Resolving Attribute error when using the FSDP ram efficient feature (#25820)

fix bug

Resolving Attribute error when using the FSDP ram efficient feature (#25820)
fix bug
c9bae84e · Sourab Mangrulkar · GitHub · 77713d11 · c9bae84e
Unverified Commit c9bae84e authored Aug 29, 2023 by Sourab Mangrulkar Committed by GitHub Aug 29, 2023
Show whitespace changes
Inline Side-by-side

Showing with 2 additions and 2 deletions

src/transformers/modeling_utils.py src/transformers/modeling_utils.py +2 -2

No files found.
--- a/src/transformers/modeling_utils.py
+++ b/src/transformers/modeling_utils.py
@@ -3574,11 +3574,11 @@ class PreTrainedModel(nn.Module, ModuleUtilsMixin, GenerationMixin, PushToHubMix
                            if param.device == torch.device("meta"):
                                if not (is_quantized):
                                    set_module_tensor_to_device(
-                                        model, key, "cpu", torch.empty(*param.size(), dtype=dtype)
+                                        model_to_load, key, "cpu", torch.empty(*param.size(), dtype=dtype)
                                    )
                                else:
                                    set_module_quantized_tensor_to_device(
-                                        model, key, "cpu", torch.empty(*param.size(), dtype=dtype)
+                                        model_to_load, key, "cpu", torch.empty(*param.size(), dtype=dtype)
                                    )
                else:
                    error_msgs += _load_state_dict_into_model(model_to_load, state_dict, start_prefix)