Skip to content

Commit 9769c02

Browse files
committed
amend data type
Signed-off-by: tunjiantan <[email protected]>
1 parent cc0466f commit 9769c02

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

_posts/2024-10-23-vllm-serving-amd.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@ author: "Guest Post by Embedded LLM and Hot Aisle Inc."
1414
&nbsp; &nbsp;
1515
<img src="/assets/figures/vllm-serving-amd/405b2.png" width="35%">
1616
</picture><br>
17-
vLLM vs. TGI performance comparison for Llama 3.1 405B on 8 x MI300X (FP16, 32 QPS).
17+
vLLM vs. TGI performance comparison for Llama 3.1 405B on 8 x MI300X (BF16, 32 QPS).
1818
</p>
1919

2020
<p align="center">
@@ -24,7 +24,7 @@ vLLM vs. TGI performance comparison for Llama 3.1 405B on 8 x MI300X (FP16, 32 Q
2424
&nbsp; &nbsp;
2525
<img src="/assets/figures/vllm-serving-amd/70b2.png" width="35%">
2626
</picture><br>
27-
vLLM vs. TGI performance comparison for Llama 3.1 70B on 8 x MI300X (FP16, 32 QPS).
27+
vLLM vs. TGI performance comparison for Llama 3.1 70B on 8 x MI300X (BF16, 32 QPS).
2828
</p>
2929

3030
### Introduction
@@ -49,7 +49,7 @@ Even in the default configuration, vLLM shows superior performance compared to T
4949
<picture>
5050
<img src="/assets/figures/vllm-serving-amd/introduction/Mean TTFT (ms).png" width="70%">
5151
</picture><br>
52-
vLLM vs. TGI performance for Llama 3.1 405B on 8 x MI300X (FP16, QPS 16, 32, 1000; see Appendix for commands).
52+
vLLM vs. TGI performance for Llama 3.1 405B on 8 x MI300X (BF16, QPS 16, 32, 1000; see Appendix for commands).
5353
</p>
5454

5555
### How to run vLLM with Optimal Performance

0 commit comments

Comments
 (0)