improve from_pretrained for zero3 multi gpus mode (#24964)

* improve from_pretrained for zero3 multi gpus mode * Add check if torch.distributed.is_initialized * Revert torch.distributed --------- Co-authored-by: Stas Bekman <stas@stason.org>

improve from_pretrained for zero3 multi gpus mode (#24964)
* improve from_pretrained for zero3 multi gpus mode * Add check if torch.distributed.is_initialized * Revert torch.distributed --------- Co-authored-by: Stas Bekman <stas@stason.org>
ea41e18c · Ivan Sorokin · GitHub · 95f96b45 · ea41e18c
Unverified Commit ea41e18c authored Jul 21, 2023 by Ivan Sorokin Committed by GitHub Jul 21, 2023
Hide whitespace changes
Inline Side-by-side

Showing with 5 additions and 1 deletion

src/transformers/modeling_utils.py src/transformers/modeling_utils.py +5 -1

No files found.
--- a/src/transformers/modeling_utils.py
+++ b/src/transformers/modeling_utils.py
@@ -457,7 +457,11 @@ def load_state_dict(checkpoint_file: Union[str, os.PathLike]):
            )
        return safe_load_file(checkpoint_file)
    try:
-        return torch.load(checkpoint_file, map_location="cpu")
+        if is_deepspeed_zero3_enabled() and torch.distributed.is_initialized() and torch.distributed.get_rank() > 0:
+            map_location = "meta"
+        else:
+            map_location = "cpu"
+        return torch.load(checkpoint_file, map_location=map_location)
    except Exception as e:
        try:
            with open(checkpoint_file) as f: