fix typo (#54)

digger-yu · web-flow · commit 17f4786ed335 · 2024-02-28T11:49:11.000+08:00
diff --git a/README.md b/README.md
@@ -104,7 +104,7 @@ We disable all speedup methods by default. Here are details of some key argument
 - `--enable_modulate_kernel`: Whether enable the modulate kernel optimization. This speeds up the training process. The default value is `False`. Recommend to enable it for GPU < H100.
 - `--enable_layernorm_kernel`: Whether enable the layernorm kernel optimization. This speeds up the training process. The default value is `False`. Recommend to enable it.
 - `--enable_flashattn`: Whether enable the FlashAttention. This speeds up the training process. The default value is `False`. Recommend to enable.
-- `--sequence_parallel_size`: The sequence parallelism size. Will enable sequence parallelism when setting a value > 1. The defualt value is 1. Recommend to disable it if memory is enough.
+- `--sequence_parallel_size`: The sequence parallelism size. Will enable sequence parallelism when setting a value > 1. The default value is 1. Recommend to disable it if memory is enough.
 
 For more details on the configuration of the training process, please visit our code.