add arxiv link to readme

samsja · web-flow · commit b45995395f38 · 2024-07-10T18:02:07.000-07:00
diff --git a/README.md b/README.md
@@ -1,6 +1,6 @@
 # OpenDiLoCo
 
-This repository contains the training code and experiment results for the paper "OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training".
+This repository contains the training code and experiment results for the paper [OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training](https://arxiv.org/abs/2407.07852).
 
 # Setup
 
@@ -303,4 +303,4 @@ We recommend using `bf16` to avoid scaling and desynchronization issues with hiv
 
 3. `torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate...`
     A possible culprit is that your `--per-device-train-batch-size` is too high.
-    Try a smaller value.
+    Try a smaller value.