[Doc] Update more docs with respect to V1 #29188

DarkLight1337 · 2025-11-21T16:40:44Z

Purpose

Update various documentations to better match the current status of V1. Moved various sections into the V1 page so people can more easily find out the differences between V0 and V1.

cc @tdoublep can you update the status of prefix caching support for hybrid models? Feel free to update this PR directly if it hasn't been merged yet.

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: DarkLight1337 <[email protected]>

mergify · 2025-11-21T16:41:23Z

Documentation preview: https://vllm--29188.org.readthedocs.build/en/29188/

gemini-code-assist

Code Review

This pull request provides a good set of updates to the documentation for vLLM V1. The changes refactor and consolidate information, making the documentation clearer and more up-to-date. I've identified a couple of minor but important typos in the markdown files that affect rendering and link functionality. Addressing these will ensure the documentation is presented correctly to users.

docs/usage/reproducibility.md

docs/usage/v1_guide.md

Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 · 2025-11-21T16:57:33Z

/gemini review

gemini-code-assist

Code Review

This pull request updates various documentation files to reflect the current status of vLLM V1, consolidating information into a central v1_guide.md. The changes are mostly documentation updates and look good. However, I found a significant contradiction in v1_guide.md regarding the status of 'Prompt Logprobs with Prefix Caching', which could confuse users. Please see the specific comment for details.

gemini-code-assist · 2025-11-21T16:58:50Z

docs/usage/v1_guide.md

+#### Prompt Logprobs with Prefix Caching

-For each item, our progress towards V1 support falls into one of the following states:
+Logprobs are not cached. For a request requiring prompt logprobs, the engine will ignore the prefix cache and recompute the prefill of full prompt to generate the logprobs.


There is a contradiction in the documentation regarding 'Prompt Logprobs with Prefix Caching'. This section states that for requests with prompt logprobs, 'the engine will ignore the prefix cache'. However, the feature table on line 150 indicates that 'Prompt Logprobs with Prefix Caching' is '🟢 Functional'. These two statements are conflicting. Please clarify the correct behavior and update the documentation to be consistent.

The statement here is correct. I think it's OK to leave it as functional (not optimized). You can link the Prompt Logprobs with Prefix Caching to this section if you want.

heheda12345 · 2025-11-21T22:26:42Z

docs/usage/reproducibility.md

 vLLM does not guarantee the reproducibility of the results by default, for the sake of performance. To achieve
-reproducible results, you need to turn off multiprocessing to make the scheduling deterministic by setting `VLLM_ENABLE_V1_MULTIPROCESSING=0`.
+reproducible results, consider enabling [batch invariance](../features/batch_invariance.md) as the scheduling
+cannot be made deterministic without using offline mode and setting `VLLM_ENABLE_V1_MULTIPROCESSING=0`.


can you make it more clear? IMO

for online serving, you need batch invariance

for offline serving, you need either batch invariance or VLLM_ENABLE_V1_MULTIPROCESSING=0

[Doc] Update more docs with respect to V1

f83dfb4

Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 requested a review from hmellor November 21, 2025 16:40

DarkLight1337 requested a review from ywang96 as a code owner November 21, 2025 16:40

mergify bot added the documentation Improvements or additions to documentation label Nov 21, 2025

gemini-code-assist bot reviewed Nov 21, 2025

View reviewed changes

docs/usage/reproducibility.md Outdated Show resolved Hide resolved

docs/usage/v1_guide.md Outdated Show resolved Hide resolved

DarkLight1337 added 2 commits November 21, 2025 16:44

Grammar

4370820

Signed-off-by: DarkLight1337 <[email protected]>

Fix

1ba2ce9

Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 requested review from WoosukKwon, heheda12345, mgoin and youkaichao November 21, 2025 16:45

DarkLight1337 added 3 commits November 21, 2025 16:45

Fix

f2954bc

Signed-off-by: DarkLight1337 <[email protected]>

Update

0348142

Signed-off-by: DarkLight1337 <[email protected]>

Update example

3f2e7a9

Signed-off-by: DarkLight1337 <[email protected]>

gemini-code-assist bot reviewed Nov 21, 2025

View reviewed changes

varun-sundar-rabindranath mentioned this pull request Nov 21, 2025

[BugFix] EPLB + B200 + DeepGEMM : Handle column-major scales tensor #29162

Merged

heheda12345 reviewed Nov 21, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Doc] Update more docs with respect to V1 #29188

[Doc] Update more docs with respect to V1 #29188

DarkLight1337 commented Nov 21, 2025 •

edited by github-actions bot

Loading

Uh oh!

mergify bot commented Nov 21, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

DarkLight1337 commented Nov 21, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Nov 21, 2025

Uh oh!

DarkLight1337 Nov 21, 2025

Uh oh!

heheda12345 Nov 21, 2025

Uh oh!

heheda12345 Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

[Doc] Update more docs with respect to V1 #29188

Are you sure you want to change the base?

[Doc] Update more docs with respect to V1 #29188

Conversation

DarkLight1337 commented Nov 21, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

mergify bot commented Nov 21, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

DarkLight1337 commented Nov 21, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

DarkLight1337 Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

heheda12345 Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

heheda12345 Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

DarkLight1337 commented Nov 21, 2025 •

edited by github-actions bot

Loading