"git@developer.sourcefind.cn:chenpangpang/transformers.git" did not exist on "0e83c9664b2a440ade59066a77fb01d0143e4d18"
Unverified Commit f4d4f0a1 authored by Laura Hanu's avatar Laura Hanu Committed by GitHub
Browse files

only load state dict when the checkpoint is not None (#16673)

parent d57da992
...@@ -1792,7 +1792,7 @@ class PreTrainedModel(nn.Module, ModuleUtilsMixin, GenerationMixin, PushToHubMix ...@@ -1792,7 +1792,7 @@ class PreTrainedModel(nn.Module, ModuleUtilsMixin, GenerationMixin, PushToHubMix
# load pt weights early so that we know which dtype to init the model under # load pt weights early so that we know which dtype to init the model under
if from_pt: if from_pt:
if not is_sharded: if not is_sharded and state_dict is None:
# Time to load the checkpoint # Time to load the checkpoint
state_dict = load_state_dict(resolved_archive_file) state_dict = load_state_dict(resolved_archive_file)
# set dtype to instantiate the model under: # set dtype to instantiate the model under:
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment