"docs/vscode:/vscode.git/clone" did not exist on "57430fc95c8a94a7c68b3d525e3b8823b0f2433f"
Unverified Commit 43877a62 authored by Woosuk Kwon's avatar Woosuk Kwon Committed by GitHub
Browse files

[MRV2] Enable PP CUDA graph test (#37830)


Signed-off-by: default avatarWoosuk Kwon <woosuk@inferact.ai>
parent 63f49b8b
...@@ -87,13 +87,12 @@ steps: ...@@ -87,13 +87,12 @@ steps:
- vllm/v1/worker/gpu/ - vllm/v1/worker/gpu/
- vllm/v1/worker/gpu_worker.py - vllm/v1/worker/gpu_worker.py
- tests/distributed/test_pipeline_parallel.py - tests/distributed/test_pipeline_parallel.py
#- tests/distributed/test_pp_cudagraph.py - tests/distributed/test_pp_cudagraph.py
commands: commands:
- set -x - set -x
- export VLLM_USE_V2_MODEL_RUNNER=1 - export VLLM_USE_V2_MODEL_RUNNER=1
- pytest -v -s distributed/test_pipeline_parallel.py -k "not ray and not Jamba" - pytest -v -s distributed/test_pipeline_parallel.py -k "not ray and not Jamba"
# TODO: Uncomment once https://github.com/vllm-project/vllm/pull/35162 is merged. - pytest -v -s distributed/test_pp_cudagraph.py -k "not ray"
#- pytest -v -s distributed/test_pp_cudagraph.py -k "not ray"
- label: Model Runner V2 Spec Decode - label: Model Runner V2 Spec Decode
timeout_in_minutes: 30 timeout_in_minutes: 30
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment