Skip to content

Commit 6687d38

Browse files
committed
clarification in multimodal.md
1 parent 9c192cf commit 6687d38

File tree

2 files changed

+5
-1
lines changed

2 files changed

+5
-1
lines changed

convert_hf_to_gguf.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2668,7 +2668,7 @@ def modify_tensors(self, data_torch: Tensor, name: str, bid: int | None) -> Iter
26682668
if "language_model." in name:
26692669
name = name.replace("language_model.", "") # for InternVL
26702670
if name.startswith("mlp") or name.startswith("multi_modal_projector") \
2671-
or name.startswith("vision_model") or name.startswith("audio_tower"):
2671+
or name.startswith("vision_model") or name.startswith("audio_tower"):
26722672
# skip vision and audio tensors
26732673
return []
26742674
yield from super().modify_tensors(data_torch, name, bid)

docs/multimodal.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -89,4 +89,8 @@ NOTE: some models may require large context window, for example: `-c 8192`
8989
# Ultravox 0.5
9090
(tool_name) -hf ggml-org/ultravox-v0_5-llama-3_2-1b-GGUF
9191
(tool_name) -hf ggml-org/ultravox-v0_5-llama-3_1-8b-GGUF
92+
93+
# Qwen2-Audio and SeaLLM-Audio
94+
# note: no pre-quantized GGUF this model, as they have very poor result
95+
# ref: https://github.com/ggml-org/llama.cpp/pull/13760
9296
```

0 commit comments

Comments
 (0)