News

[Oct, 2025]

We are launching FlashInfer-Bench! Check out the blog post and leaderboard.

[May, 2025]

I’m visiting Catalyst at CMU for the summer, hosted by Prof. Tianqi Chen. Feel free to connect & chat!

[Mar, 2025]

Check out our latest blog post on Sorting-Free GPU Kernels for LLM Sampling!