[CI/Build][AMD] Skip test on test_hybrid_attention_mamba_tensor_shapes on...

[CI/Build][AMD] Skip test on test_hybrid_attention_mamba_tensor_shapes on ROCm, requires FLASHINFER (#29995) Signed-off-by: Randall Smith <ransmith@amd.com> Co-authored-by: Randall Smith <ransmith@amd.com>

[CI/Build][AMD] Skip test on test_hybrid_attention_mamba_tensor_shapes on...
[CI/Build][AMD] Skip test on test_hybrid_attention_mamba_tensor_shapes on ROCm, requires FLASHINFER (#29995) Signed-off-by: Randall Smith <ransmith@amd.com> Co-authored-by: Randall Smith <ransmith@amd.com>
f2f4cea6 · rasmith · GitHub · dfdda967 · f2f4cea6
Unverified Commit f2f4cea6 authored Dec 04, 2025 by rasmith Committed by GitHub Dec 04, 2025
Show whitespace changes
Inline Side-by-side

Showing with 4 additions and 0 deletions

tests/v1/worker/test_gpu_model_runner.py tests/v1/worker/test_gpu_model_runner.py +4 -0

No files found.
--- a/tests/v1/worker/test_gpu_model_runner.py
+++ b/tests/v1/worker/test_gpu_model_runner.py
@@ -761,6 +761,10 @@ def test_init_kv_cache_with_kv_sharing_valid():
    assert kv_cache_config_after_init.kv_cache_groups[0].layer_names[1] == layer_1


+@pytest.mark.skipif(
+    current_platform.is_rocm(),
+    reason="Attention backend FLASHINFER is not supported on ROCm.",
+)
 def test_hybrid_attention_mamba_tensor_shapes(monkeypatch):
    """
    The GPU model runner creates different views into the