Update convert_hf_to_gguf.py #9542

blap · 2024-09-18T23:02:10Z

GGUF conversion for HF1BitLLM/Llama3-8B-1.58-100B-tokens: https://huggingface.co/HF1BitLLM/Llama3-8B-1.58-100B-tokens/discussions/3

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

GGUF conversion for HF1BitLLM/Llama3-8B-1.58-100B-tokens: https://huggingface.co/HF1BitLLM/Llama3-8B-1.58-100B-tokens/discussions/3

compilade · 2024-09-19T01:31:05Z

I appreciate the initiative, but this won't really be mergable as-is. As I've noted in https://huggingface.co/HF1BitLLM/Llama3-8B-1.58-100B-tokens/discussions/3, this is a "very ad-hoc patch, [and it] probably only works for this model".

This is becaue it only handles safetensors files, and only specially handles the particular format used for this model when using lazy conversion (so it won't work with --no-lazy).

I think the way forward for this kind of thing would be to explicitly support the quantization_config field in config.json to load quantized models. Not sure if there's a way to call transformers code compatibly with lazy conversion (i.e. streaming dequantization instead of dequantizing the whole model at once in memory) or if each type will need to be manually implemented.

Update convert_hf_to_gguf.py

6f9d127

GGUF conversion for HF1BitLLM/Llama3-8B-1.58-100B-tokens: https://huggingface.co/HF1BitLLM/Llama3-8B-1.58-100B-tokens/discussions/3

github-actions bot added the python python script changes label Sep 18, 2024

blap closed this by deleting the head repository Sep 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update convert_hf_to_gguf.py #9542

Update convert_hf_to_gguf.py #9542

Uh oh!

blap commented Sep 18, 2024 •

edited

Loading

Uh oh!

compilade commented Sep 19, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Update convert_hf_to_gguf.py #9542

Update convert_hf_to_gguf.py #9542

Uh oh!

Conversation

blap commented Sep 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

compilade commented Sep 19, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

blap commented Sep 18, 2024 •

edited

Loading