You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Optionally, you can install flash-attn to use Flash Attention 2.
49
+
This requires your system to have cuda compiler set up.
50
+
```
51
+
# (Optional) flash-attn
52
+
pip install flash-attn==2.5.8
53
+
```
54
+
37
55
## Docker container
38
56
39
57
If you prefer to run your experiments in a reproduceable container, you can use our pre-built docker image containing the repository and pre-installed dependencies.
@@ -86,13 +104,13 @@ The [multiaddress](https://github.com/multiformats/multiaddr) strings listed aft
86
104
87
105
## Stopping hivemind runs
88
106
89
-
The current implementation of hivemind doesnt handle Ctrl+C keyboard interrupt well. You can stop the runs using `pkill`:
107
+
The current implementation of hivemind doesn't handle Ctrl+C keyboard interrupt well. You can stop the runs using `pkill`:
90
108
```bash
91
109
pkill -f torchrun
92
110
```
93
111
94
112
## Resuming from checkpoint
95
-
To resume from checkpoint, you can pass the `--resume-from-checkpoint`argment to the training script. e.g.
113
+
To resume from checkpoint, you can pass the `--resume-from-checkpoint`argument to the training script. e.g.
0 commit comments