Update training_args.py - addition of self.distributed_state when using XPU (#25999)

* Update training_args.py Missing distributed state so lign 1813-1814 failed because value is undefined * Update training_args.py Co-authored-by: Zach Mueller <muellerzr@gmail.com> --------- Co-authored-by: Zach Mueller <muellerzr@gmail.com>

Update training_args.py - addition of self.distributed_state when using XPU (#25999)
* Update training_args.py Missing distributed state so lign 1813-1814 failed because value is undefined * Update training_args.py Co-authored-by: Zach Mueller <muellerzr@gmail.com> --------- Co-authored-by: Zach Mueller <muellerzr@gmail.com>
e52f1cb6 · Serizao · GitHub · 0fced067 · e52f1cb6
Unverified Commit e52f1cb6 authored Sep 13, 2023 by Serizao Committed by GitHub Sep 13, 2023
Hide whitespace changes
Inline Side-by-side

Showing with 1 addition and 0 deletions

src/transformers/training_args.py src/transformers/training_args.py +1 -0

No files found.
--- a/src/transformers/training_args.py
+++ b/src/transformers/training_args.py
@@ -1803,6 +1803,7 @@ class TrainingArguments:
            torch.cuda.set_device(device)
        elif is_torch_xpu_available() and "ACCELERATE_USE_XPU" not in os.environ:
            os.environ["ACCELERATE_USE_XPU"] = "true"
+            self.distributed_state = PartialState(timeout=timedelta(seconds=self.ddp_timeout))
            device = torch.device("xpu:0")
            self._n_gpu = 1
        elif is_sagemaker_dp_enabled():