Skip to content

Commit 8abfe00

Browse files
max_shard_size="5GB" for colab running
1 parent 45b6cb6 commit 8abfe00

File tree

1 file changed

+4
-3
lines changed

1 file changed

+4
-3
lines changed

scripts/convert_cogview3_to_diffusers.py

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -6,15 +6,15 @@
66
77
Example usage:
88
python scripts/convert_cogview3_to_diffusers.py \
9-
--original_state_dict_repo_id "THUDM/cogview3" \
9+
--original_state_dict_repo_id "THUDM/cogview3-sat" \
1010
--filename "cogview3.pt" \
1111
--transformer \
1212
--output_path "./cogview3_diffusers" \
1313
--dtype "bf16"
1414
1515
Alternatively, if you have a local checkpoint:
1616
python scripts/convert_cogview3_to_diffusers.py \
17-
--checkpoint_path '/raid/.cache/huggingface/models--ZP2HF--CogView3-SAT/snapshots/ca86ce9ba94f9a7f2dd109e7a59e4c8ad04121be/cogview3plus_3b/1/mp_rank_00_model_states.pt' \
17+
--checkpoint_path 'your path/cogview3plus_3b/1/mp_rank_00_model_states.pt' \
1818
--transformer \
1919
--output_path "/raid/yiyi/cogview3_diffusers" \
2020
--dtype "bf16"
@@ -26,6 +26,7 @@
2626
--transformer: Flag to convert the transformer model.
2727
--output_path: The path to save the converted model.
2828
--dtype: The dtype to save the model in (default: "bf16", options: "fp16", "bf16", "fp32").
29+
Default is "bf16" because CogView3 uses bfloat16 for Training.
2930
3031
Note: You must provide either --original_state_dict_repo_id or --checkpoint_path.
3132
"""
@@ -173,7 +174,7 @@ def main(args):
173174
transformer.load_state_dict(converted_transformer_state_dict, strict=True)
174175

175176
print(f"Saving CogView3 Transformer in Diffusers format in {args.output_path}/transformer")
176-
transformer.to(dtype).save_pretrained(f"{args.output_path}/transformer")
177+
transformer.to(dtype).save_pretrained(f"{args.output_path}/transformer", max_shard_size="5GB")
177178

178179
if len(original_ckpt) > 0:
179180
print(f"Warning: {len(original_ckpt)} keys were not converted and will be saved as is: {original_ckpt.keys()}")

0 commit comments

Comments
 (0)