whoami
I'm a second-year Computer Science PhD at The University of Illinois-Urbana Champaign. I currently work with Prof. Tianyin Xu on Agentic/Cloud Systems reliability.
I'm interested in the reliability of all kinds of systems:
- AIOps/Agentic Frameworks & Reliability: LangGraph, AIOpsLab, ITBench, etc.
- Traditional/Cloud-native Distributed systems: TiDB, Cassandra, etc.
- and so on...
Generally, I'm mostly interested in these questions, and I plan to use my PhD thesis to answer them:
- Does the system break? (yes, otherwise I'm out of work :))
- Why/How does the system break?
- How can we patch the failures or prevent them?
- How can we build systems that doesn't break as frequent? or potentially, at all?
A short bio can be found here.
Education
- Ph.D. in Computer Science (current student), University of Illinois Urbana-Champaign, 2024-?
- B.Sc. in Computer Science, The University of Chicago, 2020-2024
Publications
(*: co-primary authors)
- NeurIPS 2025
STRATUS: A Multi-agent System for Autonomous Reliability Engineering of Modern Clouds. [pdf]
- Yinfang Chen*, Jiaqi Pan*, Jackson Clark*, Yiming Su*, Noah Zheutlin, Bhavya Bhavya, Rohan Arora, Yu Deng, Saurabh Jha, Tianyin Xu
- NSDI '26
Who Watches the Watchers? On the Reliability of Softwarizing Cloud Application Management [pdf]
- Jiawei Tyler Gu, Zhen Tang, Yiming Su, Bogdan A. Stoica, Xudong Sun, William X. Zheng, Yue Zhang, Akond Rahman, Chen Wang, and Tianyin Xu
- SOSP '24
If At First You Don’t Succeed, Try, Try, Again...? Insights and LLM-informed Tooling for Detecting Retry Bugs in Software Systems [pdf]
- Bogdan Alexandru Stoica*, Utsav Sethi*, Yiming Su, Cyrus Zhou, Shan Lu, Jonathan Mace, Madan Musuvathi, Suman Nath
- HotOS '23
HotGPT: How to Make Software Documentation More Useful with a Large Language Model? [pdf]
- Yiming Su, Chengcheng Wan, Utsav Sethi, Shan Lu, Madan Musuvathi, Suman Nath
Talks
- Co-host, IBM-Illinois Discovery Accelerator Institute Annual Meeting -- Workshop Demo on SRE AI Agent Cloud Incident Mitigation on ITBench. (Apr. 25, 2025)
- Speaker, HotGPT: How to Make Software Documentation More Useful with a Large Language Model? -- HotOS '23. (Me presenting our work)
Unless specifically noted, I do not own any of the images presented on this site. All rights go to their respective owner.
My past life... here