Skip to content

Commit eef62fa

Browse files
committed
Minor
Signed-off-by: WoosukKwon <[email protected]>
1 parent e83a5be commit eef62fa

File tree

1 file changed

+2
-1
lines changed

1 file changed

+2
-1
lines changed

_posts/2025-01-26-v1-alpha-release.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -136,8 +136,9 @@ Given that the kernels used for V0 and V1 are almost identical, the performance
136136
</picture>
137137
</p>
138138

139-
We evaluated the performance on VLMs by testing Qwen2-VL using the [VisionArena dataset](https://arxiv.org/abs/2412.08687).
139+
We evaluated the performance on VLMs by testing Qwen2-VL using the [VisionArena](https://arxiv.org/abs/2412.08687) dataset.
140140
V1 delivered even larger speedups over V0, thanks its improved VLM support, driven by two key improvements: offloading input processing to a separate process and implementing more flexible scheduling for multimodal queries.
141+
We would also like to point out that prefix caching is now natively supported for multimodal models in V1, but will skip the benchmark results here.
141142

142143
- **Looking Forward**
143144

0 commit comments

Comments
 (0)