Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
6acb7a62
Unverified
Commit
6acb7a62
authored
May 30, 2025
by
Rabi Mishra
Committed by
GitHub
May 30, 2025
Browse files
[Misc]Fix benchmarks/README.md for speculative decoding (#18897)
Signed-off-by:
rabi
<
ramishra@redhat.com
>
parent
4f4a6b84
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
6 additions
and
6 deletions
+6
-6
benchmarks/README.md
benchmarks/README.md
+6
-6
No files found.
benchmarks/README.md
View file @
6acb7a62
...
@@ -146,9 +146,9 @@ python3 vllm/benchmarks/benchmark_serving.py \
...
@@ -146,9 +146,9 @@ python3 vllm/benchmarks/benchmark_serving.py \
```
bash
```
bash
VLLM_USE_V1
=
1 vllm serve meta-llama/Meta-Llama-3-8B-Instruct
\
VLLM_USE_V1
=
1 vllm serve meta-llama/Meta-Llama-3-8B-Instruct
\
--
ngram_prompt_lookup_min
2
\
--
speculative-config
$'{"method": "ngram",
--ngram-
prompt
-
lookup
-
max
5
\
"num_speculative_tokens": 5, "
prompt
_
lookup
_
max
": 5,
--speculative_config
'{"model": "[ngram]", "num_speculative_tokens": 5}
"prompt_lookup_min": 2}'
```
```
```
bash
```
bash
...
@@ -273,9 +273,9 @@ python3 vllm/benchmarks/benchmark_throughput.py \
...
@@ -273,9 +273,9 @@ python3 vllm/benchmarks/benchmark_throughput.py \
--output-len
=
100
\
--output-len
=
100
\
--num-prompts
=
2048
\
--num-prompts
=
2048
\
--async-engine
\
--async-engine
\
--
ngram_prompt_lookup_min
=
2
\
--
speculative-config
$'{"method": "ngram",
--ngram-
prompt
-
lookup
-
max
=
5
\
"num_speculative_tokens": 5, "
prompt
_
lookup
_
max
": 5,
--speculative_config
'{"model": "[ngram]", "num_speculative_tokens": 5}
"prompt_lookup_min": 2}'
```
```
```
```
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment