Skip to content

Commit e1b2110

Browse files
committed
Enable marlin as default GPTQ kernel
Signed-off-by: Chih-Chieh Yang <[email protected]> Signed-off-by: Chih-Chieh-Yang <[email protected]>
1 parent 480646b commit e1b2110

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

server/text_generation_server/utils/layers.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -15,7 +15,7 @@
1515
HAS_BITS_AND_BYTES = False
1616
EXLLAMA_VERSION = None
1717
HAS_GPTQ_CUDA = False
18-
GPTQ_CUDA_TYPE = os.getenv("GPTQ_CUDA_TYPE", "exllama").lower()
18+
GPTQ_CUDA_TYPE = os.getenv("GPTQ_CUDA_TYPE", "marlin").lower()
1919
GPTQ_CUDA_LINEAR = None
2020

2121
if torch.cuda.is_available():

0 commit comments

Comments
 (0)