Unverified Commit 781dc944 authored by Casper's avatar Casper Committed by GitHub
Browse files

Fix typo in benchmark command (#161)

parent 5c7e3682
......@@ -212,7 +212,7 @@ These benchmarks showcase the speed and memory usage of processing context (pref
- Tested with AutoAWQ version 0.1.6
- GPU: RTX 4090 (AMD Ryzen 9 7950X)
- Command: `python examples/benchmark --model_path <hf_model> --batch_size 1`
- Command: `python examples/benchmark.py --model_path <hf_model> --batch_size 1`
- 🟢 for GEMV, 🔵 for GEMM, 🔴 for avoid using
| Model Name | Size | Version | Batch Size | Prefill Length | Decode Length | Prefill tokens/s | Decode tokens/s | Memory (VRAM) |
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment