Skip to content

Commit 3cd99f4

Browse files
missyliushanyi15
authored andcommitted
update multi_cluster/index_en.rst (#9790)
* Update doc/v2/howto/cluster/multi_cluster/index_en.rst * update
1 parent 2d57158 commit 3cd99f4

File tree

1 file changed

+26
-10
lines changed

1 file changed

+26
-10
lines changed
Lines changed: 26 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -1,19 +1,35 @@
11
Use different clusters
22
======================
33

4-
PaddlePaddle supports running jobs on several platforms including:
5-
- `Kubernetes <http://kubernetes.io>`_ open-source system for automating deployment, scaling, and management of containerized applications from Google.
6-
- `OpenMPI <https://www.open-mpi.org>`_ Mature high performance parallel computing framework.
7-
- `Fabric <http://www.fabfile.org>`_ A cluster management tool. Write scripts to submit jobs or manage the cluster.
4+
The user's cluster environment is not the same. To facilitate everyone's deployment, we provide a variety of cluster deployment methods to facilitate the submission of cluster training tasks, which will be introduced as follows:
85

9-
We'll introduce cluster job management on these platforms. The examples can be found under `cluster_train_v2 <https://github.com/PaddlePaddle/Paddle/tree/develop/paddle/scripts/cluster_train_v2>`_ .
6+
`Kubernetes <http://kubernetes.io>`_ is a scheduling framework of Google open source container cluster, supporting a complete cluster solution for large-scale cluster production environment. The following guidelines show PaddlePaddle's support for Kubernetes:
107

11-
These cluster platforms provide API or environment variables for training processes, when the job is dispatched to different nodes. Like node ID, IP or total number of nodes etc.
8+
.. toctree::
9+
:maxdepth: 1
10+
11+
k8s_cn.md
12+
k8s_distributed_cn.md
13+
14+
`OpenMPI <https://www.open-mpi.org>`_ is a mature high-performance parallel computing framework, which is widely used in the field of HPC. The following guide describes how to use OpenMPI to build PaddlePaddle's cluster training task:
1215

1316
.. toctree::
1417
:maxdepth: 1
1518

16-
fabric_en.md
17-
openmpi_en.md
18-
k8s_en.md
19-
k8s_aws_en.md
19+
openmpi_cn.md
20+
21+
`Fabric <http://www.fabfile.org>`_ is a convenient tool for program deployment and management. We provide a way to deploy and manage with Fabric. If you want to know more about it, please read the following guidelines:
22+
23+
.. toctree::
24+
:maxdepth: 1
25+
26+
fabric_cn.md
27+
28+
We also support the deployment of PaddlePaddle on AWS. Learn more about:
29+
30+
.. toctree::
31+
:maxdepth: 1
32+
33+
k8s_aws_cn.md
34+
35+
The examples can be found under `cluster_train_v2 <https://github.com/PaddlePaddle/Paddle/tree/develop/paddle/scripts/cluster_train_v2>`_ .

0 commit comments

Comments
 (0)