-
Notifications
You must be signed in to change notification settings - Fork 30k
Open
Labels
Description
System Info
transformers
version: 4.52.4- Platform: Linux-5.15.0-94-generic-x86_64-with-glibc2.35
- Python version: 3.11.11
- Huggingface_hub version: 0.30.1
- Safetensors version: 0.5.3
- Accelerate version: 1.4.0
- Accelerate config: not found
- DeepSpeed version: 0.16.4
- PyTorch version (GPU?): 2.6.0+cu124 (True)
- Tensorflow version (GPU?): not installed (NA)
- Flax version (CPU?/GPU?/TPU?): not installed (NA)
- Jax version: not installed
- JaxLib version: not installed
- Using distributed or parallel set-up in script?: no
- Using GPU in script?: no
- GPU type: NVIDIA H100 80GB HBM3
Who can help?
trainer: @zach-huggingface @SunMarc
Information
- The official example scripts
- My own modified scripts
Tasks
- An officially supported task in the
examples
folder (such as GLUE/SQuAD, ...) - My own task or dataset (give details below)
Reproduction
transformers/src/transformers/modeling_utils.py
Line 5382 in 43d3b19
model._initialize_missing_keys(checkpoint_keys, ignore_mismatched_sizes, is_quantized) |
The function name is init missing keys, but the argument given is all keys.
Expected behavior
checkpoint_keys -> missing_keys + mismatched_keys
MengAiDev