Actions: NVIDIA/Megatron-LM
Actions
2,500+ workflow runs
2,500+ workflow runs
scatter_gather_tensors_in_pipeline argument
Community Bot
#9753:
Issue comment #4140 (comment)
created
by
copy-pr-bot
bot
mtp_use_repeated_layer behavior for GPT models
Community Bot
#9740:
Issue comment #3965 (comment)
created
by
Phlip79