Research
Flashinfer Bench is out!
We are launching FlashInfer-Bench! Check out the blog post and leaderboard.
Visiting CMU
I’m visiting Catalyst at CMU for the summer, hosted by Prof. Tianqi Chen. Feel free to connect & chat!
Sampling Blog
Check out our latest blog post on Sorting-Free GPU Kernels for LLM Sampling!