Skip to content

Commit 8c1889e

Browse files
authored
Merge pull request #54 from bigcode-project/loubnabnl-patch-1
Add tokens-per-second-gpu to the printed logs instead of just wandb
2 parents 3e22c9f + 1397ac0 commit 8c1889e

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

megatron/training.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -666,6 +666,7 @@ def training_log(loss_dict, total_loss_dict, learning_rate, iteration,
666666
log_string += ' number of nan iterations: {:3d} |'.format(
667667
total_loss_dict[nan_iters_key])
668668
log_string += ' TFLOPs: {:.2f} |'.format(tflops)
669+
log_string += ' tokens-per-second-per-gpu: {:.2f} |'.format(tokens_per_sec_per_gpu)
669670
if args.log_memory_to_tensorboard and mem_stats is not None:
670671
log_string += ' mem-reserved (GB): {:.2f} |'.format(mem_stats["reserved_bytes.all.current"]*1e-9)
671672
total_loss_dict[advanced_iters_key] = 0

0 commit comments

Comments
 (0)