Skip to content

Commit 4e3b931

Browse files
committed
deepseek coder v2 lite without any quantization
1 parent c4fb4b9 commit 4e3b931

File tree

1 file changed

+10
-0
lines changed

1 file changed

+10
-0
lines changed

refact_known_models/huggingface.py

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -233,4 +233,14 @@
233233
"T": 8192,
234234
"filter_caps": ["chat"],
235235
},
236+
"deepseek-coder-v2/16b/instruct": {
237+
"backend": "transformers",
238+
"model_path": "deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct",
239+
"model_class_kwargs": {
240+
"torch_dtype": "bf16",
241+
},
242+
"required_memory_mb": 80000,
243+
"T": 16384, # in fact this model can handle 128K context
244+
"filter_caps": ["completion", "chat"],
245+
},
236246
}

0 commit comments

Comments
 (0)