I'm a 4th-year CS Ph.D. student at Princeton University, advised by Prof. Ravi Netravali, and a member of the Princeton Systems for AI Lab (SAIL). My interests lie in systems and algorithms for efficient LLMs across the stack, from inference to pre-/post-training. My PhD work centers on model-system co-design for efficient LLM inference — studying how inference optimizations such as prefix caching and speculative decoding interact with emerging model architectures, including hybrid, reasoning, and diffusion LLMs. My research has been adopted by open-source LLM serving systems such as vLLM and SGLang , and has been recognized with an MLSys Outstanding Paper Honorable Mention, a Jane Street Graduate Research Fellowship Finalist Award, an MLCommons ML and Systems Rising Stars Award, and research grants from Google, a16z, Modal, and Lambda. Previously, I received B.S. degrees in CS and Math from UW-Madison, where I was fortunate to work with Prof. Shivaram Venkataraman on distributed ML training systems. I have also interned at Google, AWS, and MPI-INF.
Sep 2018 - Dec 2021
B.S. in CS & Math
Advisor: Prof. Shivaram Venkataraman
Jun 2025 - Dec 2025
Sunnyvale, CA
Manager: Prof. Arvind Krishnamurthy
May 2024 - Dec 2024
Santa Clara, CA
Manager: Dr. Zhen Jia
Feb 2022 - Aug 2022
Saarbrücken, Germany
Advisor: Prof. Yiting Xia