Unverified Commit 781dc944 authored by Casper's avatar Casper Committed by GitHub
Browse files

Fix typo in benchmark command (#161)

parent 5c7e3682
...@@ -212,7 +212,7 @@ These benchmarks showcase the speed and memory usage of processing context (pref ...@@ -212,7 +212,7 @@ These benchmarks showcase the speed and memory usage of processing context (pref
- Tested with AutoAWQ version 0.1.6 - Tested with AutoAWQ version 0.1.6
- GPU: RTX 4090 (AMD Ryzen 9 7950X) - GPU: RTX 4090 (AMD Ryzen 9 7950X)
- Command: `python examples/benchmark --model_path <hf_model> --batch_size 1` - Command: `python examples/benchmark.py --model_path <hf_model> --batch_size 1`
- 🟢 for GEMV, 🔵 for GEMM, 🔴 for avoid using - 🟢 for GEMV, 🔵 for GEMM, 🔴 for avoid using
| Model Name | Size | Version | Batch Size | Prefill Length | Decode Length | Prefill tokens/s | Decode tokens/s | Memory (VRAM) | | Model Name | Size | Version | Batch Size | Prefill Length | Decode Length | Prefill tokens/s | Decode tokens/s | Memory (VRAM) |
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment