Hi There👋
I’m an undergraduate student and researcher at the University of Washington, Seattle. At UW, I’m fortunate to be advised by Prof. Luis Ceze and mentored by Dr. Zihao Ye on building efficient LLM serving systems. I also work closely with Prof. Tianqi Chen at CMU Catalyst.
My research focuses on building efficient AI systems and creating AI-driven workflows that continuously optimize these systems.
🎓I am actively seeking PhD positions for Fall 2026. Feel free to reach out!
News
[Oct, 2025]
We are launching FlashInfer-Bench! Check out the blog post and leaderboard.
[May, 2025]I’m visiting Catalyst at CMU for the summer, hosted by Prof. Tianqi Chen. Feel free to connect & chat!
[Mar, 2025]Check out our latest blog post on Sorting-Free GPU Kernels for LLM Sampling!