[PyTorch/XLA] Fix extra TPU compilations introduced by recent changes (#29158)

* tmp * Remove debug step * Fix a typo * Move to is_torch_xla_available

[PyTorch/XLA] Fix extra TPU compilations introduced by recent changes (#29158)
* tmp * Remove debug step * Fix a typo * Move to is_torch_xla_available
b340d907 · Jiewen Tan · GitHub · 1e21c4fb · b340d907
Unverified Commit b340d907 authored Mar 13, 2024 by Jiewen Tan Committed by GitHub Mar 13, 2024
Show whitespace changes
Inline Side-by-side

Showing with 1 addition and 1 deletion

src/transformers/modeling_utils.py src/transformers/modeling_utils.py +1 -1

No files found.
--- a/src/transformers/modeling_utils.py
+++ b/src/transformers/modeling_utils.py
@@ -1364,7 +1364,7 @@ class PreTrainedModel(nn.Module, ModuleUtilsMixin, GenerationMixin, PushToHubMix
                hard_check_only=False,
                check_device_map=check_device_map,
            )
-        elif requested_attn_implementation in [None, "sdpa"]:
+        elif requested_attn_implementation in [None, "sdpa"] and not is_torch_xla_available():
            # use_flash_attention_2 takes priority over SDPA, hence SDPA treated in this elif.
            config = cls._check_and_enable_sdpa(
                config,