I'm a 4th-year CS Ph.D. student at Princeton University, advised by Prof. Ravi Netravali. I am a member of the Princeton Systems for AI Lab (SAIL). I am broadly interested in systems and algorithms for efficient LLM inference. My recent work focuses on model-system co-design for new generations of LLMs, including hybrid, reasoning, and diffusion LLMs, with an emphasis on the runtime and serving techniques needed to make them efficient at scale. I got my B.S. in CS and Math from University of Wisconsin-Madison, where I was fortunate to work with Prof. Shivaram Venkataraman on systems for distributed ML training. I have also interned at Google and AWS on efficient LLM inference and at MPI-INF on networks for ML training. My research has been recognized with an MLSys Outstanding Paper Award (Honorable Mention, 2025), a Jane Street Graduate Research Fellowship Finalist Award (2026), an MLCommons ML and Systems Rising Stars Award (2026), and research grants from Google, a16z, Modal, and Lambda.
Sep 2018 - Dec 2021
B.S. in CS & Math
Advisor: Prof. Shivaram Venkataraman
Jun 2025 - Dec 2025
Sunnyvale, CA
Manager: Prof. Arvind Krishnamurthy
May 2024 - Dec 2024
Santa Clara, CA
Manager: Dr. Zhen Jia
Feb 2022 - Aug 2022
Saarbrücken, Germany
Advisor: Prof. Yiting Xia