Support for Zero3 or Zero3 Offload? Error when loading model state_dict

Hello,

I encountered an error while trying to load a model using the following code in [`clip_encoder.py`]

```python
self.vision_tower.load_state_dict(torch.load(self.clip_model), strict=False)
```
The error message is as follows:
[rank0]:     raise RuntimeError('Error(s) in loading state_dict for {}:\n\t{}'.format(
[rank0]: RuntimeError: Error(s) in loading state_dict for CLIP:
[rank0]:        size mismatch for visual.trunk.stem.0.weight: copying a param with shape torch.Size([192, 3, 4, 4]) from checkpoint, the shape in current model is torch.Size([0])


This happens only when I use scripts/zero3_offload.json or 
scripts/zero3.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for Zero3 or Zero3 Offload? Error when loading model state_dict #38

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Support for Zero3 or Zero3 Offload? Error when loading model state_dict #38

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions