Skip to content

Commit f9ca21b

Browse files
committed
update intro
Signed-off-by: youkaichao <[email protected]>
1 parent 8a9c57d commit f9ca21b

File tree

1 file changed

+4
-2
lines changed

1 file changed

+4
-2
lines changed

_posts/2025-08-11-cuda-debugging.md

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -7,9 +7,11 @@ image: /assets/logos/vllm-logo-text-light.png
77

88
# Introduction
99

10-
vLLM is a high-performance inference engine for large language models. Today we will cover some of the debugging tools to help users debug complicated issues in vLLM, such as illegal memory accesses.
10+
Have you ever felt you are developing attention kernel and your tests oftens runs into illegal memory access (IMA for short) and you have no idea how to debug? We definitely felt this pain again and again while working on vLLM, a high-performance inference engine for LLM models.
1111

12-
Illegal memory access (IMA for short), is probably the most common and frustrating issue. For example, here’s an error from PyTorch:
12+
If you are one of the developers who have faced this issue, this blog is for you! We will uncover some of advanced debugging techniques we use that can help users debug complicated issues in vLLM, such as IMA.
13+
14+
For example, here’s an error from PyTorch:
1315

1416
```text
1517
RuntimeError: CUDA error: an illegal memory access was encountered

0 commit comments

Comments
 (0)