Unverified Commit 5959564f authored by Kuntai Du's avatar Kuntai Du Committed by GitHub
Browse files

Doc fix in `benchmark_long_document_qa_throughput.py` (#11933)


Signed-off-by: default avatarKuntai Du <kuntai@uchicago.edu>
parent f33e033e
...@@ -2,8 +2,7 @@ ...@@ -2,8 +2,7 @@
Offline benchmark to test the long document QA throughput. Offline benchmark to test the long document QA throughput.
Example usage: Example usage:
# This command run the vllm with 50GB CPU memory for offloading # This workload samples 8 different prompts with a default input
# The workload samples 8 different prompts with a default input
# length of 20000 tokens, then replicates each prompt 2 times # length of 20000 tokens, then replicates each prompt 2 times
# in random order. # in random order.
python benchmark_long_document_qa_throughput.py \ python benchmark_long_document_qa_throughput.py \
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment