[bugfix]fix empty prompts for async-engine mode in benchmark throughput (#27494)

Signed-off-by: Lucia Fang <fanglu@fb.com>

[bugfix]fix empty prompts for async-engine mode in benchmark throughput (#27494)
Signed-off-by: Lucia Fang <fanglu@fb.com>
315b860a · Lucia Fang · GitHub · 87c41c26 · 315b860a
Unverified Commit 315b860a authored Oct 26, 2025 by Lucia Fang Committed by GitHub Oct 26, 2025
Show whitespace changes
Inline Side-by-side

Showing with 1 addition and 0 deletions

vllm/benchmarks/throughput.py vllm/benchmarks/throughput.py +1 -0

No files found.
--- a/vllm/benchmarks/throughput.py
+++ b/vllm/benchmarks/throughput.py
@@ -221,6 +221,7 @@ async def run_vllm_async(
                    detokenize=not disable_detokenize,
                )
            )
+            prompts.append(prompt)
            lora_requests.append(request.lora_request)
        generators = []