[BugFix][CPU] Fix `TorchSDPABackendImpl` doesn't have `use_irope` (#21200)

Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>

[BugFix][CPU] Fix `TorchSDPABackendImpl` doesn't have `use_irope` (#21200)
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
468e2400 · Lucas Wilkinson · GitHub · dcc6cfb9 · 468e2400
Unverified Commit 468e2400 authored Jul 19, 2025 by Lucas Wilkinson Committed by GitHub Jul 18, 2025
Show whitespace changes
Inline Side-by-side

Showing with 2 additions and 1 deletion

vllm/v1/worker/gpu_model_runner.py vllm/v1/worker/gpu_model_runner.py +2 -1

No files found.
--- a/vllm/v1/worker/gpu_model_runner.py
+++ b/vllm/v1/worker/gpu_model_runner.py
@@ -2668,7 +2668,8 @@ class GPUModelRunner(LoRAModelRunnerMixin):
            # TODO: Support other attention modules, e.g., cross-attention
            if attn_module.attn_type == AttentionType.DECODER:
                use_local_attention = (self.attention_chunk_size is not None
-                                       and attn_module.impl.use_irope)
+                                       and getattr(attn_module.impl,
+                                                   "use_irope", False))
                if attn_module.sliding_window is not None:
                    kv_cache_spec[layer_name] = SlidingWindowSpec(
                        block_size=block_size,