[Tutorial][PTD] Deprecate Training Transformer models using Distributed Data Parallel and Pipeline Parallelism
and redirect the page to parallelism APIs
#167
Job | Run time |
---|---|
7s | |
7s |