Unverified Commit 4293c00b authored by Matthew Bonanni's avatar Matthew Bonanni Committed by GitHub
Browse files

[Benchmarks] Fix attention benchmark smoke test (#34269)


Signed-off-by: default avatarMatthew Bonanni <mbonanni@redhat.com>
parent 506ad7d7
...@@ -22,9 +22,10 @@ steps: ...@@ -22,9 +22,10 @@ steps:
device: b200 device: b200
num_gpus: 2 num_gpus: 2
optional: true optional: true
working_dir: "/vllm-workspace/"
timeout_in_minutes: 10 timeout_in_minutes: 10
source_file_dependencies: source_file_dependencies:
- benchmarks/attention_benchmarks/ - benchmarks/attention_benchmarks/
- vllm/v1/attention/ - vllm/v1/attention/
commands: commands:
- python benchmarks/attention_benchmarks/benchmark.py --backends flash flashinfer --batch-specs "8q1s1k" --repeats 1 --warmup-iters 1 - python3 benchmarks/attention_benchmarks/benchmark.py --backends flash flashinfer --batch-specs "8q1s1k" --repeats 1 --warmup-iters 1
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment