Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
bfa59be8
Unverified
Commit
bfa59be8
authored
Oct 22, 2025
by
Nicolò Lucchesi
Committed by
GitHub
Oct 22, 2025
Browse files
[CI] Nixl integration tests DP-EP (#27199)
Signed-off-by:
NickLucche
<
nlucches@redhat.com
>
parent
265ecb05
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
10 additions
and
1 deletion
+10
-1
tests/v1/kv_connector/nixl_integration/run_accuracy_test.sh
tests/v1/kv_connector/nixl_integration/run_accuracy_test.sh
+9
-1
tests/v1/kv_connector/nixl_integration/tp_config_sweep_accuracy_test.sh
...nnector/nixl_integration/tp_config_sweep_accuracy_test.sh
+1
-0
No files found.
tests/v1/kv_connector/nixl_integration/run_accuracy_test.sh
View file @
bfa59be8
...
...
@@ -178,8 +178,16 @@ run_tests_for_model() {
--port
$PORT
\
--enforce-eager
\
--gpu-memory-utilization
$GPU_MEMORY_UTILIZATION
\
--tensor-parallel-size
$DECODER_TP_SIZE
\
--kv-transfer-config '
$KV_CONFIG
'"
# DP-EP attention mode
if
[[
-z
"
$DP_EP
"
]]
;
then
BASE_CMD
=
"
${
BASE_CMD
}
--tensor-parallel-size
$DECODER_TP_SIZE
"
else
echo
"DP-EP Attention enabled, deploying with dp=DECODER_TP_SIZE and tp=1"
BASE_CMD
=
"
${
BASE_CMD
}
--data-parallel-size
$DECODER_TP_SIZE
\
--tensor-parallel-size 1 --enable-expert-parallel"
fi
if
[
-n
"
$model_args
"
]
;
then
FULL_CMD
=
"
$BASE_CMD
$model_args
"
...
...
tests/v1/kv_connector/nixl_integration/tp_config_sweep_accuracy_test.sh
View file @
bfa59be8
...
...
@@ -10,6 +10,7 @@ configs=(
"GPU_MEMORY_UTILIZATION=0.6 PREFILLER_TP_SIZE=1 DECODER_TP_SIZE=2"
"GPU_MEMORY_UTILIZATION=0.8 MODEL_NAMES=deepseek-ai/deepseek-vl2-tiny"
# MLA case
"GPU_MEMORY_UTILIZATION=0.8 PREFILLER_TP_SIZE=1 DECODER_TP_SIZE=2 MODEL_NAMES=deepseek-ai/deepseek-vl2-tiny"
"DP_EP=1 GPU_MEMORY_UTILIZATION=0.8 PREFILLER_TP_SIZE=1 DECODER_TP_SIZE=2 MODEL_NAMES=deepseek-ai/deepseek-vl2-tiny"
# MLA+P-TP1, D-DPEP=2 (TP=1)
)
run_tests
()
{
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment