- 
                Notifications
    You must be signed in to change notification settings 
- Fork 228
Pull requests: bigscience-workshop/Megatron-DeepSpeed
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
      Startup: add argument-consistency checks & summary table (Fixes #124)
      
    
      
  
        
          #409
            opened Jun 20, 2025  by
            MagellaX
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      fix(training): correct rank-zero log messages, Print total model size once at startup (rank-0) – Fixes #123
      
    
        
          #408
            opened Jun 20, 2025  by
            MagellaX
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      Bump black from 21.4b0 to 24.3.0
        
              
                dependencies
  Pull requests that update a dependency file 
        
      
    
        
          #402
            opened Mar 20, 2024  by
            dependabot
            bot
        
        
            
    
  
    Loading…
 
        
        
      
    
      a branch combining  layer-norm-auto-sync and ds_ckpt_reshape
      
    
        
          #292
            opened Jun 29, 2022  by
            stas00
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      Sync 4 layer norms - bf16, fp32, optimizer states on restart
      
    
        
          #274
            opened Mar 28, 2022  by
            tjruwase
            
        
        
            
    
  
    Loading…
 
        
        
      
    Previous Next
  
  
  ProTip!
  What’s not been updated in a month: updated:<2025-09-30.