Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
d7166e74
Unverified
Commit
d7166e74
authored
Mar 04, 2026
by
Stefano Castagnetta
Committed by
GitHub
Mar 04, 2026
Browse files
[CI] Add Blackwell AsyncTP correctness test (#35871)
Signed-off-by:
Stefano Castagnetta
<
scastagnetta@nvidia.com
>
parent
417fd28f
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
15 additions
and
0 deletions
+15
-0
.buildkite/test_areas/compile.yaml
.buildkite/test_areas/compile.yaml
+10
-0
tests/compile/correctness_e2e/test_async_tp.py
tests/compile/correctness_e2e/test_async_tp.py
+5
-0
No files found.
.buildkite/test_areas/compile.yaml
View file @
d7166e74
...
@@ -36,6 +36,16 @@ steps:
...
@@ -36,6 +36,16 @@ steps:
-
export VLLM_TEST_CLEAN_GPU_MEMORY=1
-
export VLLM_TEST_CLEAN_GPU_MEMORY=1
-
pytest -v -s tests/compile/correctness_e2e/test_async_tp.py
-
pytest -v -s tests/compile/correctness_e2e/test_async_tp.py
-
label
:
AsyncTP Correctness Tests (B200)
timeout_in_minutes
:
50
working_dir
:
"
/vllm-workspace/"
device
:
b200
optional
:
true
num_devices
:
2
commands
:
-
export VLLM_TEST_CLEAN_GPU_MEMORY=1
-
pytest -v -s tests/compile/correctness_e2e/test_async_tp.py
-
label
:
Distributed Compile Unit Tests (2xH100)
-
label
:
Distributed Compile Unit Tests (2xH100)
timeout_in_minutes
:
20
timeout_in_minutes
:
20
working_dir
:
"
/vllm-workspace/"
working_dir
:
"
/vllm-workspace/"
...
...
tests/compile/correctness_e2e/test_async_tp.py
View file @
d7166e74
...
@@ -31,7 +31,12 @@ def test_async_tp_pass_correctness(
...
@@ -31,7 +31,12 @@ def test_async_tp_pass_correctness(
distributed_backend
:
str
,
distributed_backend
:
str
,
eager_mode
:
bool
,
eager_mode
:
bool
,
num_gpus_available
:
int
,
num_gpus_available
:
int
,
monkeypatch
,
):
):
# Disable FlashInfer FP8 scaled_mm kernel as it is incompatible with
# async TP patterns. No-op on H100 (kernel requires CC >= 100).
monkeypatch
.
setenv
(
"VLLM_DISABLED_KERNELS"
,
"FlashInferFP8ScaledMMLinearKernel"
)
model_info
=
HF_EXAMPLE_MODELS
.
find_hf_info
(
model_id
)
model_info
=
HF_EXAMPLE_MODELS
.
find_hf_info
(
model_id
)
model_info
.
check_transformers_version
(
on_fail
=
"skip"
)
model_info
.
check_transformers_version
(
on_fail
=
"skip"
)
model_info
.
check_available_online
(
on_fail
=
"skip"
)
model_info
.
check_available_online
(
on_fail
=
"skip"
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment