add log for request trace #26284

wanx7130 · 2025-10-06T08:30:36Z

Purpose

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

github-actions · 2025-10-06T08:30:47Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

chatgpt-codex-connector · 2025-10-06T08:33:11Z

vllm/v1/engine/core.py

    def _handle_client_request(self, request_type: EngineCoreRequestType,
                               request: Any) -> None:
        """Dispatch request from client."""
-
+        logger.info("[wxl debug] EngineCore handling client request %s.", request.request_id)
        if request_type == EngineCoreRequestType.ADD:
            self.add_request(request)
        elif request_type == EngineCoreRequestType.ABORT:


Abort requests crash when logging request_id

The new info log dereferences request.request_id before inspecting the request type. When an abort is submitted the request argument is a list[str] of request IDs (see the ABORT branch immediately below), so this line raises AttributeError and prevents _handle_client_request from reaching self.abort_requests. Any client attempting to cancel a request will now crash the engine core loop. Guard the log by checking the request type or by logging the value without assuming a request_id attribute.

Useful? React with 👍 / 👎.

gemini-code-assist

Code Review

This pull request introduces several logging statements for request tracing. While adding logs is beneficial for debugging, many of the new logs are at the INFO level and contain a [wxl debug] prefix, indicating they are temporary and not suitable for a production environment. These should be removed before merging. Additionally, some existing DEBUG logs have been promoted to INFO level in performance-sensitive areas, which could lead to excessive logging and performance degradation. These should be reverted to DEBUG level.

gemini-code-assist · 2025-10-06T08:33:16Z

vllm/v1/core/sched/scheduler.py

+                        logger.info("[wxl debug] %s has already received kvcache and enter into waiting status.",
+                                    request.request_id)


This log message appears to be for debugging purposes, as indicated by the [wxl debug] prefix. Such temporary debugging code should be removed before merging into the main branch to avoid cluttering production logs.

gemini-code-assist · 2025-10-06T08:33:16Z

vllm/v1/core/sched/scheduler.py

+                        logger.info(
+                            "[wxl debug] %s is still in WAITING_FOR_REMOTE_KVS state.",
                            request.request_id)


This log message appears to be for debugging purposes, as indicated by the [wxl debug] prefix, and has been elevated from DEBUG to INFO level. This change could lead to excessive logging in production. It should be reverted to the original DEBUG level and the debug prefix should be removed.

Suggested change

logger.info(

"[wxl debug] %s is still in WAITING_FOR_REMOTE_KVS state.",

request.request_id)

logger.debug(

"%s is still in WAITING_FOR_REMOTE_KVS state.",

request.request_id)

gemini-code-assist · 2025-10-06T08:33:16Z

vllm/v1/engine/async_llm.py

    ) -> RequestOutputCollector:
        """Add new request to the AsyncLLM."""
-
+        logger.info("[wxl debug] Request %s start add_request.", request_id)


This log message appears to be for debugging purposes, as indicated by the [wxl debug] prefix. Such temporary debugging code should be removed before merging into the main branch to avoid cluttering production logs.

gemini-code-assist · 2025-10-06T08:33:16Z

vllm/v1/engine/async_llm.py

        """

        try:
+            logger.info("[wxl debug] Request %s start generate.", request_id)


This log message appears to be for debugging purposes, as indicated by the [wxl debug] prefix. Such temporary debugging code should be removed before merging into the main branch to avoid cluttering production logs.

gemini-code-assist · 2025-10-06T08:33:16Z

vllm/v1/engine/core.py

                               request: Any) -> None:
        """Dispatch request from client."""
-
+        logger.info("[wxl debug] EngineCore handling client request %s.", request.request_id)


This log message appears to be for debugging purposes, as indicated by the [wxl debug] prefix. Such temporary debugging code should be removed before merging into the main branch to avoid cluttering production logs.

gemini-code-assist · 2025-10-06T08:33:16Z

vllm/v1/engine/output_processor.py

                self._update_stats_from_finished(req_state, finish_reason,
                                                 iteration_stats)

+            logger.info("[wxl debug] Request %s outputprocessor processed output and will return to client.", req_id)


This log message appears to be for debugging purposes, as indicated by the [wxl debug] prefix. Such temporary debugging code should be removed before merging into the main branch to avoid cluttering production logs.

gemini-code-assist · 2025-10-06T08:33:17Z

vllm/v1/engine/processor.py

        data_parallel_rank: Optional[int] = None,
    ) -> tuple[Optional[str], EngineCoreRequest]:
-
+        logger.info("[wxl debug] Request %s start process_inputs.", request_id)


This log message appears to be for debugging purposes, as indicated by the [wxl debug] prefix. Such temporary debugging code should be removed before merging into the main branch to avoid cluttering production logs.

gemini-code-assist · 2025-10-06T08:33:17Z

vllm/distributed/kv_transfer/kv_connector/v1/nixl_connector.py

+                    logger.info(
+                        "Transfer KVCache for req %s has completed with handle %s",
+                        req_id, handle)


This informational log is inside a loop that iterates over transfer handles. In a production environment with many transfers, this could lead to excessive logging and performance degradation. Please consider changing this log to the DEBUG level.

Suggested change

logger.info(

"Transfer KVCache for req %s has completed with handle %s",

req_id, handle)

logger.debug(

"Transfer KVCache for req %s has completed with handle %s",

req_id, handle)

gemini-code-assist · 2025-10-06T08:33:17Z

vllm/distributed/kv_transfer/kv_connector/v1/nixl_connector.py

+                    logger.info(
+                        "Transfer KVCache for req %s still in progress with handle %s",
+                        req_id, handle)


This informational log is inside a loop that iterates over transfer handles. In a production environment with many transfers, this could lead to excessive logging and performance degradation. Please consider changing this log to the DEBUG level.

Suggested change

logger.info(

"Transfer KVCache for req %s still in progress with handle %s",

req_id, handle)

logger.debug(

"Transfer KVCache for req %s still in progress with handle %s",

req_id, handle)

gemini-code-assist · 2025-10-06T08:33:17Z

vllm/distributed/kv_transfer/kv_connector/v1/nixl_connector.py

        for req_id, meta in metadata.requests.items():
            remote_engine_id = meta.remote_engine_id
-            logger.debug(
+            logger.info(


Changing this log from DEBUG to INFO could result in verbose logging in production, as start_load_kv is called frequently. This might impact performance. It's recommended to revert this to DEBUG level.

Suggested change

logger.info(

logger.debug(

add log for request trace

bdc22d5

wanx7130 requested review from WoosukKwon, robertgshaw2-redhat, njhill, ywang96, comaniac and alexm-redhat as code owners October 6, 2025 08:30

mergify bot added v1 kv-connector labels Oct 6, 2025

mergify bot mentioned this pull request Oct 6, 2025

V0.9.2 log trace #26283

Closed

5 tasks

chatgpt-codex-connector bot reviewed Oct 6, 2025

View reviewed changes

gemini-code-assist bot reviewed Oct 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

add log for request trace #26284

add log for request trace #26284

wanx7130 commented Oct 6, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Oct 6, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Oct 6, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Oct 6, 2025

Uh oh!

gemini-code-assist bot Oct 6, 2025

Uh oh!

gemini-code-assist bot Oct 6, 2025

Uh oh!

gemini-code-assist bot Oct 6, 2025

Uh oh!

gemini-code-assist bot Oct 6, 2025

Uh oh!

gemini-code-assist bot Oct 6, 2025

Uh oh!

gemini-code-assist bot Oct 6, 2025

Uh oh!

gemini-code-assist bot Oct 6, 2025

Uh oh!

gemini-code-assist bot Oct 6, 2025

Uh oh!

gemini-code-assist bot Oct 6, 2025

Uh oh!

Uh oh!

		logger.info("[wxl debug] %s has already received kvcache and enter into waiting status.",
		request.request_id)

Uh oh!

add log for request trace #26284

Are you sure you want to change the base?

add log for request trace #26284

Conversation

wanx7130 commented Oct 6, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

github-actions bot commented Oct 6, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

wanx7130 commented Oct 6, 2025 •

edited by github-actions bot

Loading