[Question] Why ignore mlp projection weight in `convert_hf_to_int4.py`

### Your Question

https://github.com/THUDM/slime/blob/bd70add6ab27e773294081fd260888f003984146/tools/convert_hf_to_int4.py#L72-L80

I'm not familiar to quantization. Could you please explain the reason why these parameters are ignored during INT4 quantization?

### What I've Tried

N/A

### Environment (if relevant)

- slime version:
- Python version:
- PyTorch version:
- CUDA/ROCm version:
- GPU type and count:
- OS:


### Additional Context

_No response_

### Pre-submission Checklist

- [x] I have read the [CONTRIBUTING.md](https://github.com/THUDM/slime/blob/main/CONTRIBUTING.md) and understand the collaboration scope.
- [x] I have read the [documentation](https://thudm.github.io/slime/) and [FAQ](https://thudm.github.io/slime/en/get_started/qa.html) and my question is not answered there.
- [x] I have searched for [existing issues](https://github.com/THUDM/slime/issues) and my question has not been asked before.

	ignore_patterns = [
	"re:.lm_head.",
	"re:.norm.",
	"re:.embed.",
	"re:.self_attn.",
	"re:.shared_experts.",
	"re:.mlp\\.(gate\|up\|gate_up\|down)_proj.",
	"re:.mlp\\.gate\\.",
	]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] Why ignore mlp projection weight in `convert_hf_to_int4.py` #1640

Your Question

What I've Tried

Environment (if relevant)

Additional Context

Pre-submission Checklist

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Question] Why ignore mlp projection weight in convert_hf_to_int4.py #1640

Description

Your Question

What I've Tried

Environment (if relevant)

Additional Context

Pre-submission Checklist

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

[Question] Why ignore mlp projection weight in `convert_hf_to_int4.py` #1640