[Bug]: V0.9.1 开启full_cuda_graph: true 后会hang住

### Your current environment

<details>
<summary>The output of `python collect_env.py`</summary>

```text
Your output of above commands here
```

</details>


### 🐛 Describe the bug

vllm-ascend 版本：v0.9.1
模型：Qwen2.5-32B-Instruct
在采用 --compilation-config '{"full_cuda_graph": true}' 后，推理会hang住，通过 py-spy dump 发现卡在 npu 的graph_task_update_end 位置，该如何定位解决呢？

<img width="1866" height="698" alt="Image" src="https://github.com/user-attachments/assets/f0b0d1ac-a7fc-4913-a9a4-1adf797eef35" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bug]: V0.9.1 开启full_cuda_graph: true 后会hang住 #4180

Your current environment

🐛 Describe the bug

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Bug]: V0.9.1 开启full_cuda_graph: true 后会hang住 #4180

Description

Your current environment

🐛 Describe the bug

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions