Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
8e1fd5ba
Unverified
Commit
8e1fd5ba
authored
Mar 03, 2026
by
Matthew Bonanni
Committed by
GitHub
Mar 03, 2026
Browse files
[CI] Bump `num_speculative_tokens` to 3 in nightly DeepSeek tests (#35882)
Signed-off-by:
Matthew Bonanni
<
mbonanni@redhat.com
>
parent
ae88468b
Changes
4
Show whitespace changes
Inline
Side-by-side
Showing
4 changed files
with
4 additions
and
4 deletions
+4
-4
tests/evals/gsm8k/configs/DeepSeek-R1-DP.yaml
tests/evals/gsm8k/configs/DeepSeek-R1-DP.yaml
+1
-1
tests/evals/gsm8k/configs/DeepSeek-R1-TP.yaml
tests/evals/gsm8k/configs/DeepSeek-R1-TP.yaml
+1
-1
tests/evals/gsm8k/configs/DeepSeek-V3.2-DP.yaml
tests/evals/gsm8k/configs/DeepSeek-V3.2-DP.yaml
+1
-1
tests/evals/gsm8k/configs/DeepSeek-V3.2-TP.yaml
tests/evals/gsm8k/configs/DeepSeek-V3.2-TP.yaml
+1
-1
No files found.
tests/evals/gsm8k/configs/DeepSeek-R1-DP.yaml
View file @
8e1fd5ba
...
...
@@ -8,4 +8,4 @@ server_args: >-
--max-model-len 4096
--data-parallel-size 8
--enable-expert-parallel
--speculative-config '{"method":"mtp","num_speculative_tokens":
1
}'
--speculative-config '{"method":"mtp","num_speculative_tokens":
3
}'
tests/evals/gsm8k/configs/DeepSeek-R1-TP.yaml
View file @
8e1fd5ba
...
...
@@ -8,4 +8,4 @@ server_args: >-
--max-model-len 4096
--tensor-parallel-size 8
--enable-expert-parallel
--speculative-config '{"method":"mtp","num_speculative_tokens":
1
}'
--speculative-config '{"method":"mtp","num_speculative_tokens":
3
}'
tests/evals/gsm8k/configs/DeepSeek-V3.2-DP.yaml
View file @
8e1fd5ba
...
...
@@ -8,4 +8,4 @@ server_args: >-
--max-model-len 4096
--data-parallel-size 8
--enable-expert-parallel
--speculative-config '{"method":"mtp","num_speculative_tokens":
1
}'
--speculative-config '{"method":"mtp","num_speculative_tokens":
3
}'
tests/evals/gsm8k/configs/DeepSeek-V3.2-TP.yaml
View file @
8e1fd5ba
...
...
@@ -8,4 +8,4 @@ server_args: >-
--max-model-len 4096
--tensor-parallel-size 8
--enable-expert-parallel
--speculative-config '{"method":"mtp","num_speculative_tokens":
1
}'
--speculative-config '{"method":"mtp","num_speculative_tokens":
3
}'
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment