Unverified Commit 740a1574 authored by Leandro von Werra's avatar Leandro von Werra Committed by GitHub
Browse files

fix link in performance docs (#17419)

parent 284fc6c0
...@@ -30,7 +30,7 @@ Training transformer models efficiently requires an accelerator such as a GPU or ...@@ -30,7 +30,7 @@ Training transformer models efficiently requires an accelerator such as a GPU or
Training large models on a single GPU can be challenging but there are a number of tools and methods that make it feasible. In this section methods such as mixed precision training, gradient accumulation and checkpointing, efficient optimizers, as well as strategies to determine the best batch size are discussed. Training large models on a single GPU can be challenging but there are a number of tools and methods that make it feasible. In this section methods such as mixed precision training, gradient accumulation and checkpointing, efficient optimizers, as well as strategies to determine the best batch size are discussed.
[Go to single GPU training section](perf_train_gpu_single) [Go to single GPU training section](perf_train_gpu_one)
### Multi-GPU ### Multi-GPU
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment