Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
bc0a5a0c
Unverified
Commit
bc0a5a0c
authored
Dec 24, 2025
by
Vadim Gimpelson
Committed by
GitHub
Dec 23, 2025
Browse files
[CI] Add Qwen3-Next-FP8 to Blackwell model tests (#31049)
Signed-off-by:
Vadim Gimpelson
<
vadim.gimpelson@gmail.com
>
parent
bfa2c0bb
Changes
4
Hide whitespace changes
Inline
Side-by-side
Showing
4 changed files
with
14 additions
and
0 deletions
+14
-0
tests/evals/gsm8k/configs/Qwen3-Next-FP8-EP2.yaml
tests/evals/gsm8k/configs/Qwen3-Next-FP8-EP2.yaml
+11
-0
tests/evals/gsm8k/configs/models-blackwell.txt
tests/evals/gsm8k/configs/models-blackwell.txt
+1
-0
tests/evals/gsm8k/test_gsm8k_correctness.py
tests/evals/gsm8k/test_gsm8k_correctness.py
+1
-0
tests/utils.py
tests/utils.py
+1
-0
No files found.
tests/evals/gsm8k/configs/Qwen3-Next-FP8-EP2.yaml
0 → 100644
View file @
bc0a5a0c
model_name
:
"
Qwen/Qwen3-Next-80B-A3B-Instruct-FP8"
accuracy_threshold
:
0.85
num_questions
:
1319
num_fewshot
:
5
server_args
:
>-
--max-model-len 4096
--tensor-parallel-size 2
--enable-expert-parallel
--async-scheduling
env
:
VLLM_USE_FLASHINFER_MOE_FP8
:
"
1"
tests/evals/gsm8k/configs/models-blackwell.txt
View file @
bc0a5a0c
...
...
@@ -4,3 +4,4 @@ Qwen1.5-MoE-W4A16-CT.yaml
DeepSeek-V2-Lite-Instruct-FP8.yaml
Qwen3-30B-A3B-NVFP4.yaml
Qwen3-Next-80B-A3B-NVFP4-EP2.yaml
Qwen3-Next-FP8-EP2.yaml
tests/evals/gsm8k/test_gsm8k_correctness.py
View file @
bc0a5a0c
...
...
@@ -71,6 +71,7 @@ def test_gsm8k_correctness(config_filename):
print
(
f
"Number of questions:
{
eval_config
[
'num_questions'
]
}
"
)
print
(
f
"Number of few-shot examples:
{
eval_config
[
'num_fewshot'
]
}
"
)
print
(
f
"Server args:
{
' '
.
join
(
server_args
)
}
"
)
print
(
f
"Environment variables:
{
env_dict
}
"
)
# Launch server and run evaluation
with
RemoteOpenAIServer
(
...
...
tests/utils.py
View file @
bc0a5a0c
...
...
@@ -106,6 +106,7 @@ class RemoteOpenAIServer:
env
.
update
(
env_dict
)
serve_cmd
=
[
"vllm"
,
"serve"
,
model
,
*
vllm_serve_args
]
print
(
f
"Launching RemoteOpenAIServer with:
{
' '
.
join
(
serve_cmd
)
}
"
)
print
(
f
"Environment variables:
{
env
}
"
)
self
.
proc
:
subprocess
.
Popen
=
subprocess
.
Popen
(
serve_cmd
,
env
=
env
,
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment