Unverified Commit 4693a343 authored by Calvin Chen's avatar Calvin Chen Committed by GitHub
Browse files

[Doc] cleanup deprecated flag for doc (#18715)


Signed-off-by: default avatarcalvin chen <120380290@qq.com>
parent bbd9a84d
...@@ -146,10 +146,9 @@ python3 vllm/benchmarks/benchmark_serving.py \ ...@@ -146,10 +146,9 @@ python3 vllm/benchmarks/benchmark_serving.py \
``` bash ``` bash
VLLM_USE_V1=1 vllm serve meta-llama/Meta-Llama-3-8B-Instruct \ VLLM_USE_V1=1 vllm serve meta-llama/Meta-Llama-3-8B-Instruct \
--speculative-model "[ngram]" \
--ngram_prompt_lookup_min 2 \ --ngram_prompt_lookup_min 2 \
--ngram-prompt-lookup-max 5 \ --ngram-prompt-lookup-max 5 \
--num_speculative_tokens 5 --speculative_config '{"model": "[ngram]", "num_speculative_tokens": 5}
``` ```
``` bash ``` bash
...@@ -274,10 +273,9 @@ python3 vllm/benchmarks/benchmark_throughput.py \ ...@@ -274,10 +273,9 @@ python3 vllm/benchmarks/benchmark_throughput.py \
--output-len=100 \ --output-len=100 \
--num-prompts=2048 \ --num-prompts=2048 \
--async-engine \ --async-engine \
--speculative-model="[ngram]" \
--ngram_prompt_lookup_min=2 \ --ngram_prompt_lookup_min=2 \
--ngram-prompt-lookup-max=5 \ --ngram-prompt-lookup-max=5 \
--num_speculative_tokens=5 --speculative_config '{"model": "[ngram]", "num_speculative_tokens": 5}
``` ```
``` ```
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment