Skip to content

v0.4.4

Choose a tag to compare

@feifeibear feifeibear released this 08 Dec 03:19
· 100 commits to master since this release
f5fee95

The system is successfully evaluated on a multi-node system.
The benchmark scripts are integrated with memory-centric tiling borrowed from DeepSpeed.
It trains an 18B model on WeChat Yard.