Skip to content

Commit add945d

Browse files
xlxl
authored andcommitted
update performance and usage pic
1 parent ea17b0c commit add945d

File tree

3 files changed

+2
-2
lines changed

3 files changed

+2
-2
lines changed

blog/2025-10-29-sglang-jax.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -73,10 +73,10 @@ We benchmarked SGLang-Jax against vLLM-TPU. Full instructions are available [her
7373
We used `Qwen/Qwen3-32B`, TPU v6e-4, SGLang-jax (version: main-af32f095880ff676ed23eec19bc79584b5e20717), and vLLM-tpu (vllm-tpu==0.11.1).
7474

7575
### Results
76+
<img src="/images/blog/sglang_jax/performance_results.png" style="display:block; margin: auto; width: 85%;"></img>
77+
<p style="color:gray; text-align: center;">match vllm-tpu on prefill because of similar kernel optimizations. outperform vllm-tpu on decode thanks to overlap scheduler. </p>
7678

7779
Todo:
78-
- only show one group of results 4k input, 1k output
79-
- message: match vllm-tpu on prefill because of similar kernel optimizations. outperform vllm-tpu on decode thanks to overlap scheduler.
8080
- (optional) show some TPUs vs. GPUs.
8181

8282
## Usage
-80.3 KB
Loading
156 KB
Loading

0 commit comments

Comments
 (0)