Skip to content

Commit 33c8ff7

Browse files
Updated FT docs (#822)
Minor updates for better rendering in FT docs --------- Signed-off-by: Ann Kuruvilla <akuruvil@qti.qualcomm.com>
1 parent 9bcab61 commit 33c8ff7

File tree

1 file changed

+13
-5
lines changed

1 file changed

+13
-5
lines changed

docs/source/finetune.md

Lines changed: 13 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -75,6 +75,8 @@ This enables scaling training across multiple nodes.
7575

7676
Use servers with compatible/same network interface(eg:ethernet).
7777

78+
And supported only for linux servers now. Use servers connected to same switch for benefits in time while scaling.
79+
7880
```
7981
PYTHONUNBUFFERED: make python prints unbuffered, especially useful to identify progress (or lack thereof) for distributed tasks.This is optional and not compulsory
8082
```
@@ -102,13 +104,13 @@ Steps to run Multi Node Finetuning:
102104

103105
Run the following docker setup commands on both machines (server and client).
104106

105-
# Expose QAIC accelerator devices
107+
#### Expose QAIC accelerator devices
106108

107109
```
108110
devices=(/dev/accel/*)
109111
```
110112

111-
# Start Docker container
113+
#### Start Docker container
112114

113115
```
114116
sudo docker run -it \
@@ -127,18 +129,24 @@ In distributed ML setups, all nodes must resolve each other’s hostnames. If DN
127129

128130
2. Set QAIC Device Visibility
129131

130-
``` export QAIC_VISIBLE_DEVICES=$(seq -s, 0 63)
132+
```
133+
export QAIC_VISIBLE_DEVICES=$(seq -s, 0 63)
134+
131135
```
132136

133-
This exposes devices 0–63 to the training process.
137+
For example this sample command exposes devices 0–63 to the training process.
134138

135139
3. Activate the TORCH_QAIC Environment Inside the Container
136140

137141
```
138142
source /opt/torch-qaic-env/bin/activate
139143
```
140144

141-
4. Verify that the Qefficient Library is installed
145+
4. Verify that the Qefficient Library is installed:
146+
147+
```
148+
pip install -e .
149+
```
142150

143151

144152
5. Use below command on host server

0 commit comments

Comments
 (0)