Skip to content

Commit 47854b1

Browse files
authored
update README (#960)
* Improve interface compatibility Add Dockerfile for Linux platform * del unimportant comments * Update README.md * add DeepSeek-R1-Distill-Qwen-14B-int4-ov * Does not support setting top_k update DeepSeek model Modelfile * update README DeepSeek-R1-Distill-Qwen-1.5B uses the model with gs=32 by default
1 parent 04349d5 commit 47854b1

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

modules/ollama_openvino/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -611,7 +611,7 @@ docker run --rm -it ollama_openvino:v1
611611
The native Ollama only supports models in the GGUF format, the Ollama-OV invoke OpenVINO GenAI which requires models in the OpenVINO format. Therefore, we have enabled support for OpenVINO model files in Ollama. For public LLMs, you can access and download OpenVINO IR model from HuggingFace or ModelScope:
612612
| Model | Parameters | Size |Compression | Download | Device |
613613
| ------------------ | ---------- | ----- | -----------|-------------------- |----------|
614-
| DeepSeek-R1-Distill-Qwen-1.5B-int4-ov | 1.5B | 1.1GB | INT4_SYM_128 | [ModelScope](https://modelscope.cn/models/zhaohb/DeepSeek-R1-Distill-Qwen-1.5B-int4-ov/summary) | CPU, GPU, NPU(base) |
614+
| DeepSeek-R1-Distill-Qwen-1.5B-int4-ov | 1.5B | 1.4GB | INT4_ASYM_32 | [ModelScope](https://modelscope.cn/models/zhaohb/DeepSeek-R1-Distill-Qwen-1.5B-int4-gs-32-ov) | CPU, GPU, NPU(base) |
615615
| DeepSeek-R1-Distill-Qwen-1.5B-int4-ov-npu | 1.5B | 1.1GB | INT4_SYM_CW | [ModelScope](https://modelscope.cn/models/zhaohb/DeepSeek-R1-Distill-Qwen-1.5B-int4-ov-npu/summary) | NPU(best) |
616616
| DeepSeek-R1-Distill-Qwen-7B-int4-ov | 7B | 4.3GB | INT4_SYM_128 | [ModelScope](https://modelscope.cn/models/zhaohb/DeepSeek-R1-Distill-Qwen-7B-int4-ov) | CPU, GPU, NPU(base) |
617617
| DeepSeek-R1-Distill-Qwen-7B-int4-ov-npu | 7B | 4.1GB | INT4_SYM_CW | [ModelScope](https://modelscope.cn/models/zhaohb/DeepSeek-R1-Distill-Qwen-7B-int4-ov-npu) | NPU(best) |

0 commit comments

Comments
 (0)