[MRV2] Enable PP CUDA graph test (#37830)

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>

[MRV2] Enable PP CUDA graph test (#37830)
Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>
43877a62 · Woosuk Kwon · GitHub · 63f49b8b · 43877a62
Unverified Commit 43877a62 authored Mar 22, 2026 by Woosuk Kwon Committed by GitHub Mar 22, 2026
Show whitespace changes
Inline Side-by-side

Showing with 2 additions and 3 deletions

.buildkite/test_areas/model_runner_v2.yaml .buildkite/test_areas/model_runner_v2.yaml +2 -3

No files found.
--- a/.buildkite/test_areas/model_runner_v2.yaml
+++ b/.buildkite/test_areas/model_runner_v2.yaml
@@ -87,13 +87,12 @@ steps:
    - vllm/v1/worker/gpu/
    - vllm/v1/worker/gpu_worker.py
    - tests/distributed/test_pipeline_parallel.py
-    #- tests/distributed/test_pp_cudagraph.py
+    - tests/distributed/test_pp_cudagraph.py
  commands:
    - set -x
    - export VLLM_USE_V2_MODEL_RUNNER=1
    - pytest -v -s distributed/test_pipeline_parallel.py -k "not ray and not Jamba"
-    # TODO: Uncomment once https://github.com/vllm-project/vllm/pull/35162 is merged.
+    - pytest -v -s distributed/test_pp_cudagraph.py -k "not ray"
-    #- pytest -v -s distributed/test_pp_cudagraph.py -k "not ray"
 - label: Model Runner V2 Spec Decode
  timeout_in_minutes: 30