FlashInfer-Bench: Building the Virtuous Cycle for AI-driven LLM Systems 10/21/2025 A benchmark and infrastructure that opens the pathway for AI to accelerate real-world AI deployment. Read more
Sorting-Free GPU Kernels for LLM Sampling 03/10/2025 Some cool LLM sampling algorithms and their implementations on GPUs. Read more