Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
deepspeed
Commits
7eb083c2
"vscode:/vscode.git/clone" did not exist on "9ce89e2efadbb4a67368ae2035782d85994c56b3"
Unverified
Commit
7eb083c2
authored
Feb 26, 2021
by
Stas Bekman
Committed by
GitHub
Feb 26, 2021
Browse files
document the requirement to call for all ranks (#801)
parent
490e6f7c
Changes
2
Show whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
8 additions
and
0 deletions
+8
-0
deepspeed/runtime/engine.py
deepspeed/runtime/engine.py
+5
-0
docs/_tutorials/getting-started.md
docs/_tutorials/getting-started.md
+3
-0
No files found.
deepspeed/runtime/engine.py
View file @
7eb083c2
...
...
@@ -1471,6 +1471,11 @@ class DeepSpeedEngine(Module):
used if not provided. Tag name must be the same across all ranks.
client_state: Optional. State dictionary used for saving required training states in the client code.
save_latest: Optional. Save a file 'latest' pointing to the latest saved checkpoint.
Important: all processes must call this method and not just the process with rank 0. It is
because each process needs to save its master weights and scheduler+optimizer states. This
method will hang waiting to synchronize with other processes if it's called just for the
process with rank 0.
"""
# This is to make sure the checkpoint names are created without collision
...
...
docs/_tutorials/getting-started.md
View file @
7eb083c2
...
...
@@ -127,6 +127,9 @@ accepts a client state dictionary `client_sd` for saving. These items can be
retrieved from
`load_checkpoint`
as a return argument. In the example above,
the
`step`
value is stored as part of the
`client_sd`
.
Important: all processes must call this method and not just the process with rank 0. It is because
each process needs to save its master weights and scheduler+optimizer states. This method will hang
waiting to synchronize with other processes if it's called just for the process with rank 0.
## DeepSpeed Configuration
DeepSpeed features can be enabled, disabled, or configured using a config JSON
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment