[Tutorial][PTD] Deprecate Training Transformer models using Distributed Data Parallel and Pipeline Parallelism
and redirect the page to parallelism APIs
#3356
Job | Run time |
---|---|
5m 20s | |
5m 21s | |
5m 20s | |
5m 20s | |
5m 20s | |
5m 18s | |
5m 19s | |
5m 19s | |
5m 20s | |
5m 21s | |
5m 23s | |
5m 21s | |
5m 21s | |
5m 21s | |
5m 22s | |
1s | |
1h 20m 7s |