"vscode:/vscode.git/clone" did not exist on "73e3949d07cb11f0f9ebfb85cf0ee39d29db6be4"
Unverified Commit 8a579408 authored by minmin's avatar minmin Committed by GitHub
Browse files

[Misc] Update benchmark_prefix_caching.py fixed example usage (#11920)


Signed-off-by: default avatarRen MinMin <renmm6@chinaunicom.cn>
Co-authored-by: default avatarRen MinMin <renmm6@chinaunicom.cn>
parent 46fa98cc
...@@ -10,7 +10,8 @@ Fixed example usage: ...@@ -10,7 +10,8 @@ Fixed example usage:
--model meta-llama/Llama-2-7b-chat-hf \ --model meta-llama/Llama-2-7b-chat-hf \
--enable-prefix-caching \ --enable-prefix-caching \
--num-prompts 1 \ --num-prompts 1 \
--repeat-count 100 --repeat-count 100 \
--input-length-range 128:256
ShareGPT example usage: ShareGPT example usage:
# This command samples 20 prompts with input lengths # This command samples 20 prompts with input lengths
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment