I'm a 4th-year CS Ph.D. student at Princeton University, advised by Prof. Ravi Netravali, and a member of the Princeton Systems for AI Lab (SAIL). My research focuses on systems and algorithms for efficient LLM inference, especially model-system co-design for emerging generations of LLMs, including hybrid, reasoning, and diffusion LLMs, and the runtime techniques needed to serve them at scale. Previously, I received B.S. degrees in CS and Math from UW-Madison, where I was fortunate to work with Prof. Shivaram Venkataraman on distributed ML training systems. I have also interned at Google, AWS, and MPI-INF. My research has been adopted by open-source LLM serving systems such as SGLang and vLLM, and has been recognized with an MLSys Outstanding Paper Award Honorable Mention, a Jane Street Graduate Research Fellowship Finalist Award, an MLCommons ML and Systems Rising Stars Award, and research grants from Google, a16z, Modal, and Lambda.
Sep 2018 - Dec 2021
B.S. in CS & Math
Advisor: Prof. Shivaram Venkataraman
Jun 2025 - Dec 2025
Sunnyvale, CA
Manager: Prof. Arvind Krishnamurthy
May 2024 - Dec 2024
Santa Clara, CA
Manager: Dr. Zhen Jia
Feb 2022 - Aug 2022
Saarbrücken, Germany
Advisor: Prof. Yiting Xia