Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
chenpangpang
transformers
Commits
7b702836
Unverified
Commit
7b702836
authored
Feb 05, 2024
by
Ziyang
Committed by
GitHub
Feb 05, 2024
Browse files
Support custom scheduler in deepspeed training (#26831)
Reuse trainer.create_scheduler to create scheduler for deepspeed
parent
ca8944c4
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
9 additions
and
7 deletions
+9
-7
src/transformers/integrations/deepspeed.py
src/transformers/integrations/deepspeed.py
+9
-7
No files found.
src/transformers/integrations/deepspeed.py
View file @
7b702836
...
...
@@ -14,7 +14,7 @@
"""
Integration with Deepspeed
"""
import
copy
import
importlib.metadata
as
importlib_metadata
import
importlib.util
import
weakref
...
...
@@ -27,7 +27,6 @@ from ..utils import is_accelerate_available, is_torch_available, logging
if
is_torch_available
():
import
torch
from
..optimization
import
get_scheduler
logger
=
logging
.
get_logger
(
__name__
)
...
...
@@ -341,12 +340,15 @@ def deepspeed_optim_sched(trainer, hf_deepspeed_config, args, num_training_steps
if
isinstance
(
optimizer
,
DummyOptim
):
def
_lr_scheduler_callable
(
optimizer
):
return
get_scheduler
(
trainer
.
args
.
lr_scheduler_type
,
optimizer
=
optimizer
,
num_warmup_steps
=
trainer
.
args
.
get_warmup_steps
(
num_training_steps
),
num_training_steps
=
num_training_steps
,
# create a shallow copy first, so later modifications do not affect original trainer
trainer_copy
=
copy
.
copy
(
trainer
)
# at the time _lr_scheduler_callable is called, trainer.lr_scheduler has been set
# update it to None so that we can re-create a new scheduler
trainer_copy
.
lr_scheduler
=
None
lr_scheduler
=
trainer_copy
.
create_scheduler
(
num_training_steps
=
num_training_steps
,
optimizer
=
optimizer
)
return
lr_scheduler
lr_scheduler
=
DummyScheduler
(
optimizer
,
lr_scheduler_callable
=
_lr_scheduler_callable
)
else
:
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment