[feat] implement record_stream when using CUDA streams during group offloading
          
            #14230
        
      pr_flax_dependency_test.yml
on: pull_request
  
    
      check_flax_dependencies
    
          
              18s