[Docs] Fix formatting

43798966 · Tri Dao · 3c7cbfc1 · 43798966
Commit 43798966 authored Dec 30, 2022 by Tri Dao
Hide whitespace changes
Inline Side-by-side

Showing with 2 additions and 2 deletions

training/README.md training/README.md +2 -2

No files found.
--- a/training/README.md
+++ b/training/README.md
@@ -156,13 +156,13 @@ python run.py experiment=pile/gpt3-2.7B-flash-hdim128 trainer.devices=8  # 2.7B
 ```
 The default parameters are set for 8 x A100 80GB. We train with bf16 by default.

-To train with rotary embedding, run the experiments `pile/gpt3{s,m,l,xl**-flash-rotary**.
+To train with rotary embedding, run the experiments `pile/gpt3{s,m,l,xl}-flash-rotary`.

 ### Training options

 **Gradient accumulation**: to adjust device batch size to fit into GPU memory
 (the global batch size stays the same, and gradient accumulation is calculated
-automatically), set `datamodule.batch_size=blah**.
+automatically), set `datamodule.batch_size=blah`.

 **Multi-node**: to train on multiple nodes, add `trainer.num_nodes=blah`.