shift torch dynamo handling to accelerate (#23168)

* mixed precision support via accelerate * fix issues * fix for the sharded ddp case * fix flax and tf failing tests * `refactor the place to create `Accelerator` object * move ddp prep to accelerate * fix 😅 * resolving comments * move fsdp handling to accelerate * fixex * fix saving * shift torch dynamo handling to accelerate

shift torch dynamo handling to accelerate (#23168)
* mixed precision support via accelerate * fix issues * fix for the sharded ddp case * fix flax and tf failing tests * `refactor the place to create `Accelerator` object * move ddp prep to accelerate * fix 😅 * resolving comments * move fsdp handling to accelerate * fixex * fix saving * shift torch dynamo handling to accelerate
03db5910 · Sourab Mangrulkar · GitHub · 0b774074 · 03db5910 · 03db5910
Unverified Commit 03db5910 authored May 31, 2023 by Sourab Mangrulkar Committed by GitHub May 31, 2023
Hide whitespace changes
Inline Side-by-side

Showing with 9 additions and 5 deletions

src/transformers/trainer.py src/transformers/trainer.py +0 -5

src/transformers/training_args.py src/transformers/training_args.py +9 -0

No files found.
--- a/src/transformers/trainer.py
+++ b/src/transformers/trainer.py
@@ -1559,11 +1559,6 @@ class Trainer:

            self.accelerator.ddp_handler = DistributedDataParallelKwargs(**kwargs)

-        # torch.compile() needs to be called after wrapping the model with FSDP or DDP
-        # to ensure that it accounts for the graph breaks required by those wrappers
-        if self.args.torch_compile:
-            model = torch.compile(model, backend=self.args.torch_compile_backend, mode=self.args.torch_compile_mode)
-
        return model

    def train(

--- a/src/transformers/training_args.py
+++ b/src/transformers/training_args.py
@@ -1371,6 +1371,15 @@ class TrainingArguments:
            self.torch_compile = True
        if self.torch_compile and self.torch_compile_backend is None:
            self.torch_compile_backend = "inductor"
+
+        # accelerate integration for torch compile
+        if self.torch_compile:
+            # set env vars for accelerate
+            prefix = "ACCELERATE_DYNAMO_"
+            os.environ[prefix + "BACKEND"] = self.torch_compile_backend
+            if self.torch_compile_mode is not None:
+                os.environ[prefix + "MODE"] = self.torch_compile_mode
+
        if self.framework == "pt" and is_torch_available() and self.torch_compile:
            if is_torch_tf32_available():
                if self.tf32 is None and not self.fp16 or self.bf16: