Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
9a3101b2
Unverified
Commit
9a3101b2
authored
Nov 21, 2025
by
Charlie Fu
Committed by
GitHub
Nov 21, 2025
Browse files
[Rocm][CI] Fix DeekSeek V2-Lite Accuracy CI (#29135)
Signed-off-by:
charlifu
<
charlifu@amd.com
>
parent
d5dbdbfc
Changes
2
Show whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
21 additions
and
2 deletions
+21
-2
.buildkite/scripts/scheduled_integration_test/deepseek_v2_lite_ep_eplb.sh
...ts/scheduled_integration_test/deepseek_v2_lite_ep_eplb.sh
+11
-1
.buildkite/scripts/scheduled_integration_test/qwen30b_a3b_fp8_block_ep.sh
...ts/scheduled_integration_test/qwen30b_a3b_fp8_block_ep.sh
+10
-1
No files found.
.buildkite/scripts/scheduled_integration_test/deepseek_v2_lite_ep_eplb.sh
View file @
9a3101b2
...
...
@@ -17,7 +17,17 @@ wait_for_server() {
}
MODEL
=
"deepseek-ai/DeepSeek-V2-lite"
BACKENDS
=(
"deepep_high_throughput"
"deepep_low_latency"
)
# Set BACKENDS based on platform
if
command
-v
rocm-smi &> /dev/null
||
[[
-d
/opt/rocm
]]
||
[[
-n
"
${
ROCM_PATH
:-}
"
]]
;
then
# ROCm platform
BACKENDS
=(
"allgather_reducescatter"
)
# Disable MOE padding for ROCm since it is causing eplb to fail
export
VLLM_ROCM_MOE_PADDING
=
0
else
# Non-ROCm platform (CUDA/other)
BACKENDS
=(
"deepep_high_throughput"
"deepep_low_latency"
)
fi
cleanup
()
{
if
[[
-n
"
${
SERVER_PID
:-}
"
]]
&&
kill
-0
"
${
SERVER_PID
}
"
2>/dev/null
;
then
...
...
.buildkite/scripts/scheduled_integration_test/qwen30b_a3b_fp8_block_ep.sh
View file @
9a3101b2
...
...
@@ -17,7 +17,16 @@ wait_for_server() {
}
MODEL
=
"QWen/Qwen3-30B-A3B-FP8"
BACKENDS
=(
"deepep_high_throughput"
"deepep_low_latency"
)
# Set BACKENDS based on platform
if
command
-v
rocm-smi &> /dev/null
||
[[
-d
/opt/rocm
]]
||
[[
-n
"
${
ROCM_PATH
:-}
"
]]
;
then
# ROCm platform
BACKENDS
=(
"allgather_reducescatter"
)
# Disable MOE padding for ROCm since it is causing eplb to fail
export
VLLM_ROCM_MOE_PADDING
=
0
else
# Non-ROCm platform (CUDA/other)
BACKENDS
=(
"deepep_high_throughput"
"deepep_low_latency"
)
fi
cleanup
()
{
if
[[
-n
"
${
SERVER_PID
:-}
"
]]
&&
kill
-0
"
${
SERVER_PID
}
"
2>/dev/null
;
then
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment