Commit 8e1ff89
Universal Checkpoint: BC for older DeepSpeed (bigscience-workshop#271)
* Enable universal ckpting
* Update run scripts
* Address PR feedback
* Remove line
* Fix white lines
* Remove redudant changes
* Apply to gpt_model only
* Code cleanup
* Code cleanup
* Update training.py
Co-authored-by: Michael Wyatt <mrwyattii@gmail.com>
* Update training.py
Co-authored-by: Michael Wyatt <mrwyattii@gmail.com>
* Log loss_scale only valid for fp16
* Add README and bf16 scripts
* Visualization docsts
* Support older DS
* Handle uni_ckpt import error
* Revert changes
---------
Co-authored-by: Michael Wyatt <mrwyattii@gmail.com>1 parent ad0e1fd commit 8e1ff89
File tree
2 files changed
+35
-34
lines changed- examples_deepspeed/universal_checkpointing
- megatron/model
2 files changed
+35
-34
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | 2 | | |
3 | | - | |
| 3 | + | |
4 | 4 | | |
5 | 5 | | |
6 | 6 | | |
| |||
11 | 11 | | |
12 | 12 | | |
13 | 13 | | |
14 | | - | |
| 14 | + | |
15 | 15 | | |
16 | 16 | | |
17 | 17 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
33 | 33 | | |
34 | 34 | | |
35 | 35 | | |
36 | | - | |
| 36 | + | |
| 37 | + | |
37 | 38 | | |
38 | 39 | | |
39 | 40 | | |
| |||
341 | 342 | | |
342 | 343 | | |
343 | 344 | | |
| 345 | + | |
| 346 | + | |
| 347 | + | |
| 348 | + | |
| 349 | + | |
| 350 | + | |
| 351 | + | |
| 352 | + | |
| 353 | + | |
| 354 | + | |
| 355 | + | |
| 356 | + | |
| 357 | + | |
| 358 | + | |
| 359 | + | |
| 360 | + | |
| 361 | + | |
| 362 | + | |
| 363 | + | |
| 364 | + | |
| 365 | + | |
| 366 | + | |
| 367 | + | |
| 368 | + | |
| 369 | + | |
| 370 | + | |
344 | 371 | | |
345 | | - | |
346 | | - | |
347 | | - | |
348 | | - | |
349 | | - | |
350 | | - | |
351 | | - | |
352 | | - | |
353 | | - | |
354 | | - | |
355 | | - | |
356 | | - | |
357 | | - | |
358 | | - | |
359 | | - | |
360 | | - | |
361 | | - | |
362 | | - | |
363 | | - | |
364 | | - | |
365 | | - | |
366 | | - | |
367 | | - | |
368 | | - | |
369 | | - | |
370 | | - | |
371 | | - | |
372 | | - | |
373 | | - | |
374 | | - | |
375 | | - | |
| 372 | + | |
| 373 | + | |
| 374 | + | |
| 375 | + | |
| 376 | + | |
376 | 377 | | |
377 | 378 | | |
0 commit comments