Commit c425d60b authored by Sylvain Gugger's avatar Sylvain Gugger
Browse files

Fix link to deepspeed config

parent 68209044
...@@ -1707,13 +1707,13 @@ Work is being done to enable estimating how much memory is needed for a specific ...@@ -1707,13 +1707,13 @@ Work is being done to enable estimating how much memory is needed for a specific
## Non-Trainer Deepspeed Integration ## Non-Trainer Deepspeed Integration
The [`~integrations.HfDeepSpeedConfig`] is used to integrate Deepspeed into the 🤗 Transformers core The [`~deepspeed.HfDeepSpeedConfig`] is used to integrate Deepspeed into the 🤗 Transformers core
functionality, when [`Trainer`] is not used. functionality, when [`Trainer`] is not used.
When using [`Trainer`] everything is automatically taken care of. When using [`Trainer`] everything is automatically taken care of.
When not using [`Trainer`], to efficiently deploy DeepSpeed stage 3, you must instantiate the When not using [`Trainer`], to efficiently deploy DeepSpeed stage 3, you must instantiate the
[`~integrations.HfDeepSpeedConfig`] object before instantiating the model. [`~deepspeed.HfDeepSpeedConfig`] object before instantiating the model.
For example for a pretrained model: For example for a pretrained model:
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment