-
Notifications
You must be signed in to change notification settings - Fork 603
Closed
Labels
questionFurther information is requestedFurther information is requested
Description
Your Question
slime/tools/convert_hf_to_int4.py
Lines 72 to 80 in bd70add
| ignore_patterns = [ | |
| "re:.*lm_head.*", | |
| "re:.*norm.*", | |
| "re:.*embed.*", | |
| "re:.*self_attn.*", | |
| "re:.*shared_experts.*", | |
| "re:.*mlp\\.(gate|up|gate_up|down)_proj.*", | |
| "re:.*mlp\\.gate\\.*", | |
| ] |
I'm not familiar to quantization. Could you please explain the reason why these parameters are ignored during INT4 quantization?
What I've Tried
N/A
Environment (if relevant)
- slime version:
- Python version:
- PyTorch version:
- CUDA/ROCm version:
- GPU type and count:
- OS:
Additional Context
No response
Pre-submission Checklist
- I have read the CONTRIBUTING.md and understand the collaboration scope.
- I have read the documentation and FAQ and my question is not answered there.
- I have searched for existing issues and my question has not been asked before.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
questionFurther information is requestedFurther information is requested