Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
56de443d
Unverified
Commit
56de443d
authored
Apr 05, 2026
by
Kevin H. Luu
Committed by
GitHub
Apr 05, 2026
Browse files
[ci] Switch some CI jobs to H200 MIG slices (#38956)
parent
4dd49b06
Changes
14
Show whitespace changes
Inline
Side-by-side
Showing
14 changed files
with
25 additions
and
0 deletions
+25
-0
.buildkite/test_areas/basic_correctness.yaml
.buildkite/test_areas/basic_correctness.yaml
+1
-0
.buildkite/test_areas/benchmarks.yaml
.buildkite/test_areas/benchmarks.yaml
+1
-0
.buildkite/test_areas/cuda.yaml
.buildkite/test_areas/cuda.yaml
+1
-0
.buildkite/test_areas/engine.yaml
.buildkite/test_areas/engine.yaml
+2
-0
.buildkite/test_areas/entrypoints.yaml
.buildkite/test_areas/entrypoints.yaml
+2
-0
.buildkite/test_areas/expert_parallelism.yaml
.buildkite/test_areas/expert_parallelism.yaml
+1
-0
.buildkite/test_areas/kernels.yaml
.buildkite/test_areas/kernels.yaml
+1
-0
.buildkite/test_areas/misc.yaml
.buildkite/test_areas/misc.yaml
+2
-0
.buildkite/test_areas/models_basic.yaml
.buildkite/test_areas/models_basic.yaml
+1
-0
.buildkite/test_areas/models_language.yaml
.buildkite/test_areas/models_language.yaml
+2
-0
.buildkite/test_areas/models_multimodal.yaml
.buildkite/test_areas/models_multimodal.yaml
+4
-0
.buildkite/test_areas/pytorch.yaml
.buildkite/test_areas/pytorch.yaml
+2
-0
.buildkite/test_areas/ray_compat.yaml
.buildkite/test_areas/ray_compat.yaml
+1
-0
.buildkite/test_areas/spec_decode.yaml
.buildkite/test_areas/spec_decode.yaml
+4
-0
No files found.
.buildkite/test_areas/basic_correctness.yaml
View file @
56de443d
...
...
@@ -4,6 +4,7 @@ depends_on:
steps
:
-
label
:
Basic Correctness
timeout_in_minutes
:
30
device
:
h200_18gb
source_file_dependencies
:
-
vllm/
-
tests/basic_correctness/test_basic_correctness
...
...
.buildkite/test_areas/benchmarks.yaml
View file @
56de443d
...
...
@@ -4,6 +4,7 @@ depends_on:
steps
:
-
label
:
Benchmarks CLI Test
timeout_in_minutes
:
20
device
:
h200_18gb
source_file_dependencies
:
-
vllm/
-
tests/benchmarks/
...
...
.buildkite/test_areas/cuda.yaml
View file @
56de443d
...
...
@@ -4,6 +4,7 @@ depends_on:
steps
:
-
label
:
Platform Tests (CUDA)
timeout_in_minutes
:
15
device
:
h200_18gb
source_file_dependencies
:
-
vllm/
-
tests/cuda
...
...
.buildkite/test_areas/engine.yaml
View file @
56de443d
...
...
@@ -4,6 +4,7 @@ depends_on:
steps
:
-
label
:
Engine
timeout_in_minutes
:
15
device
:
h200_18gb
source_file_dependencies
:
-
vllm/
-
tests/engine
...
...
@@ -25,6 +26,7 @@ steps:
-
label
:
e2e Scheduling (1 GPU)
timeout_in_minutes
:
30
device
:
h200_18gb
source_file_dependencies
:
-
vllm/v1/
-
tests/v1/e2e/general/
...
...
.buildkite/test_areas/entrypoints.yaml
View file @
56de443d
...
...
@@ -61,6 +61,7 @@ steps:
-
label
:
Entrypoints Integration (API Server openai - Part 3)
timeout_in_minutes
:
50
device
:
h200_18gb
working_dir
:
"
/vllm-workspace/tests"
source_file_dependencies
:
-
vllm/
...
...
@@ -105,6 +106,7 @@ steps:
-
label
:
OpenAI API Correctness
timeout_in_minutes
:
30
device
:
h200_18gb
source_file_dependencies
:
-
csrc/
-
vllm/entrypoints/openai/
...
...
.buildkite/test_areas/expert_parallelism.yaml
View file @
56de443d
...
...
@@ -4,6 +4,7 @@ depends_on:
steps
:
-
label
:
EPLB Algorithm
timeout_in_minutes
:
15
device
:
h200_18gb
working_dir
:
"
/vllm-workspace/tests"
source_file_dependencies
:
-
vllm/distributed/eplb
...
...
.buildkite/test_areas/kernels.yaml
View file @
56de443d
...
...
@@ -4,6 +4,7 @@ depends_on:
steps
:
-
label
:
vLLM IR Tests
timeout_in_minutes
:
10
device
:
h200_18gb
working_dir
:
"
/vllm-workspace/"
source_file_dependencies
:
-
vllm/ir
...
...
.buildkite/test_areas/misc.yaml
View file @
56de443d
...
...
@@ -19,6 +19,7 @@ steps:
-
label
:
V1 Sample + Logits
timeout_in_minutes
:
30
device
:
h200_18gb
source_file_dependencies
:
-
vllm/
-
tests/v1/sample
...
...
@@ -86,6 +87,7 @@ steps:
-
label
:
Regression
timeout_in_minutes
:
20
device
:
h200_18gb
source_file_dependencies
:
-
vllm/
-
tests/test_regression
...
...
.buildkite/test_areas/models_basic.yaml
View file @
56de443d
...
...
@@ -4,6 +4,7 @@ depends_on:
steps
:
-
label
:
Basic Models Tests (Initialization)
timeout_in_minutes
:
45
device
:
h200_18gb
torch_nightly
:
true
source_file_dependencies
:
-
vllm/
...
...
.buildkite/test_areas/models_language.yaml
View file @
56de443d
...
...
@@ -67,6 +67,7 @@ steps:
-
label
:
Language Models Test (PPL)
timeout_in_minutes
:
110
device
:
h200_18gb
optional
:
true
source_file_dependencies
:
-
vllm/
...
...
@@ -90,6 +91,7 @@ steps:
-
label
:
Language Models Test (MTEB)
timeout_in_minutes
:
110
device
:
h200_18gb
optional
:
true
source_file_dependencies
:
-
vllm/
...
...
.buildkite/test_areas/models_multimodal.yaml
View file @
56de443d
...
...
@@ -4,6 +4,7 @@ depends_on:
steps
:
-
label
:
"
Multi-Modal
Models
(Standard)
1:
qwen2"
timeout_in_minutes
:
45
device
:
h200_18gb
source_file_dependencies
:
-
vllm/
-
tests/models/multimodal
...
...
@@ -19,6 +20,7 @@ steps:
-
label
:
"
Multi-Modal
Models
(Standard)
2:
qwen3
+
gemma"
timeout_in_minutes
:
45
device
:
h200_18gb
source_file_dependencies
:
-
vllm/
-
tests/models/multimodal
...
...
@@ -77,6 +79,7 @@ steps:
-
label
:
Multi-Modal Processor
# 44min
timeout_in_minutes
:
60
device
:
h200_18gb
source_file_dependencies
:
-
vllm/
-
tests/models/multimodal
...
...
@@ -131,6 +134,7 @@ steps:
-
label
:
Multi-Modal Models (Extended Pooling)
optional
:
true
device
:
h200_18gb
source_file_dependencies
:
-
vllm/
-
tests/models/multimodal/pooling
...
...
.buildkite/test_areas/pytorch.yaml
View file @
56de443d
...
...
@@ -49,6 +49,7 @@ steps:
-
label
:
PyTorch Fullgraph
timeout_in_minutes
:
30
device
:
h200_18gb
source_file_dependencies
:
-
vllm/
-
tests/compile
...
...
@@ -60,6 +61,7 @@ steps:
# if this test fails, it means the nightly torch version is not compatible with some
# of the dependencies. Please check the error message and add the package to whitelist
# in /vllm/tools/pre_commit/generate_nightly_torch_test.py
device
:
h200_18gb
soft_fail
:
true
source_file_dependencies
:
-
requirements/nightly_torch_test.txt
...
...
.buildkite/test_areas/ray_compat.yaml
View file @
56de443d
...
...
@@ -7,6 +7,7 @@ steps:
# If this fails, it means the PR introduces a dependency that
# conflicts with Ray's dependency constraints.
# See https://github.com/vllm-project/vllm/issues/33599
device
:
h200_18gb
soft_fail
:
true
timeout_in_minutes
:
10
source_file_dependencies
:
...
...
.buildkite/test_areas/spec_decode.yaml
View file @
56de443d
...
...
@@ -4,6 +4,7 @@ depends_on:
steps
:
-
label
:
Spec Decode Eagle
timeout_in_minutes
:
30
device
:
h200_18gb
source_file_dependencies
:
-
vllm/v1/spec_decode/
-
vllm/v1/worker/gpu/spec_decode/
...
...
@@ -13,6 +14,7 @@ steps:
-
label
:
Spec Decode Speculators + MTP
timeout_in_minutes
:
30
device
:
h200_18gb
source_file_dependencies
:
-
vllm/v1/spec_decode/
-
vllm/v1/worker/gpu/spec_decode/
...
...
@@ -23,6 +25,7 @@ steps:
-
label
:
Spec Decode Ngram + Suffix
timeout_in_minutes
:
30
device
:
h200_18gb
source_file_dependencies
:
-
vllm/v1/spec_decode/
-
vllm/v1/worker/gpu/spec_decode/
...
...
@@ -32,6 +35,7 @@ steps:
-
label
:
Spec Decode Draft Model
timeout_in_minutes
:
30
device
:
h200_18gb
source_file_dependencies
:
-
vllm/v1/spec_decode/
-
vllm/v1/worker/gpu/spec_decode/
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment