You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add amax as default per-tensor scaling factor for fp8_gemm benchmark (#339)
Summary:
Pull Request resolved: #339
Add `amax` (absolute maximum) as the default scaling factor for per-tensor scaling with fp8 workloads, as is used in practice. Also add a command-line argument that allows the user to define a scaling factor for per-tensor scaling.
Reviewed By: NikhilAPatel
Differential Revision: D80577628
0 commit comments