File tree Expand file tree Collapse file tree 1 file changed +5
-5
lines changed Expand file tree Collapse file tree 1 file changed +5
-5
lines changed Original file line number Diff line number Diff line change @@ -39,15 +39,15 @@ python ./examples/llava/llava_surgery.py -m path/to/MobileVLM-1.7B
39393 . Use ` convert_image_encoder_to_gguf.py ` with ` --projector-type ldp ` (for ** V2** please use ` --projector-type ldpv2 ` ) to convert the LLaVA image encoder to GGUF:
4040
4141``` sh
42- python ./examples/llava/convert_image_encoder_to_gguf \
42+ python ./examples/llava/convert_image_encoder_to_gguf.py \
4343 -m path/to/clip-vit-large-patch14-336 \
4444 --llava-projector path/to/MobileVLM-1.7B/llava.projector \
4545 --output-dir path/to/MobileVLM-1.7B \
4646 --projector-type ldp
4747```
4848
4949``` sh
50- python ./examples/llava/convert_image_encoder_to_gguf \
50+ python ./examples/llava/convert_image_encoder_to_gguf.py \
5151 -m path/to/clip-vit-large-patch14-336 \
5252 --llava-projector path/to/MobileVLM-1.7B_V2/llava.projector \
5353 --output-dir path/to/MobileVLM-1.7B_V2 \
@@ -57,12 +57,12 @@ python ./examples/llava/convert_image_encoder_to_gguf \
57574 . Use ` examples/convert_legacy_llama.py ` to convert the LLaMA part of LLaVA to GGUF:
5858
5959``` sh
60- python ./examples/convert_legacy_llama.py path/to/MobileVLM-1.7B
60+ python ./examples/convert_legacy_llama.py path/to/MobileVLM-1.7B --skip-unknown
6161```
6262
63- 5 . Use ` quantize ` to convert LLaMA part's DataType from ` fp16 ` to ` q4_k `
63+ 5 . Use ` quantize ` to convert LLaMA part's DataType from ` fp32 ` to ` q4_k `
6464``` sh
65- ./llama-quantize path/to/MobileVLM-1.7B/ggml-model-f16 .gguf path/to/MobileVLM-1.7B/ggml-model-q4_k.gguf q4_k_s
65+ ./llama-quantize path/to/MobileVLM-1.7B/ggml-model-F32 .gguf path/to/MobileVLM-1.7B/ggml-model-q4_k.gguf q4_k_s
6666```
6767
6868Now both the LLaMA part and the image encoder is in the ` MobileVLM-1.7B ` directory.
You can’t perform that action at this time.
0 commit comments