Skip to content

Commit e849edc

Browse files
Update _posts/2025-10-21-flashinfer-bench.md
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
1 parent a7f9b09 commit e849edc

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

_posts/2025-10-21-flashinfer-bench.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -40,7 +40,7 @@ FlashInfer-Bench is designed specifically to address these three challenges. As
4040

4141
- **Set the right goal — FlashInfer-Bench Dataset.** We collected and curated the most important kernels used in production, along with their real workloads, to create a dataset. This allows us to measure kernel performance under real traffic conditions and further makes it possible to concretely improve production systems.
4242

43-
- **Establish a 0-day production path First-class FlashInfer Integration.** We build first-class integration with FlashInfer an open LLM kernel library widely used in major LLM inference engines. Our solution can dynamically replace the FlashInfer Kernels with the best-performing kernels, as evaluated using the FlashInfer Trace and the FlashInfer-Bench dataset. This enables activating the best kernels in LLM engines and testing end-to-end performance with minimal effort.
43+
- **Establish a 0-day production path First-class FlashInfer Integration.** We build first-class integration with FlashInfer an open LLM kernel library widely used in major LLM inference engines. Our solution can dynamically replace the FlashInfer Kernels with the best-performing kernels, as evaluated using the FlashInfer Trace and the FlashInfer-Bench dataset. This enables activating the best kernels in LLM engines and testing end-to-end performance with minimal effort.
4444

4545
In the rest of the post, we will walk through the details about the three main elements.
4646

0 commit comments

Comments
 (0)