You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
### What this PR does / why we need it?
- update determinitic calculation
- update support device
### Does this PR introduce _any_ user-facing change?
- Users should update ray and protobuf when using ray as distributed
backend
- Users should change to use `export HCCL_DETERMINISTIC=true` when
enabling determinitic calculation
### How was this patch tested?
N/A
- vLLM version: v0.10.0
- vLLM main:
vllm-project/vllm@ea1292a
Signed-off-by: MengqingCao <[email protected]>
Copy file name to clipboardExpand all lines: docs/source/faqs.md
+5-5Lines changed: 5 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -10,7 +10,7 @@
10
10
11
11
### 1. What devices are currently supported?
12
12
13
-
Currently, **ONLY** Atlas A2 series(Ascend-cann-kernels-910b),Atlas A2 series(Atlas-A3-cann-kernels) and Atlas 300I(Ascend-cann-kernels-310p) series are supported:
13
+
Currently, **ONLY** Atlas A2 series(Ascend-cann-kernels-910b),Atlas A3 series(Atlas-A3-cann-kernels) and Atlas 300I(Ascend-cann-kernels-310p) series are supported:
14
14
15
15
- Atlas A2 Training series (Atlas 800T A2, Atlas 900 A2 PoD, Atlas 200T A2 Box16, Atlas 300T A2)
16
16
- Atlas 800I A2 Inference series (Atlas 800I A2)
@@ -161,10 +161,10 @@ for output in outputs:
161
161
2. Set the following enveriments parameters:
162
162
163
163
```bash
164
-
export LCCL_DETERMINISTIC = 1
165
-
export HCCL_DETERMINISTIC = 1
166
-
export ATB_MATMUL_SHUFFLE_K_ENABLE = 0
167
-
export ATB_LLM_LCOC_ENABLE = 0
164
+
export LCCL_DETERMINISTIC=1
165
+
export HCCL_DETERMINISTIC=true
166
+
export ATB_MATMUL_SHUFFLE_K_ENABLE=0
167
+
export ATB_LLM_LCOC_ENABLE=0
168
168
```
169
169
170
170
### 19. How to fix the error "ImportError: Please install vllm[audio] for audio support" for Qwen2.5-Omni model?
0 commit comments