Skip to content

Commit e94363c

Browse files
committed
Enable bitsandbytes quantization on warp size 32 AMD GPUs
Signed-off-by: sstamenk <[email protected]>
1 parent 03ee481 commit e94363c

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

vllm/platforms/rocm.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -185,6 +185,9 @@ class RocmPlatform(Platform):
185185
"petit_nvfp4",
186186
"torchao",
187187
]
188+
# bitsandbytes is not supported on GPUs with warp size 64 (gfx9)
189+
if not on_gfx9():
190+
supported_quantization += ["bitsandbytes"]
188191

189192
@classmethod
190193
def get_vit_attn_backend(

0 commit comments

Comments
 (0)