Fix Falcon benchmark format

dc99d2fc · Casper Hansen · 7cf3c790 · dc99d2fc
Commit dc99d2fc authored Sep 13, 2023 by Casper Hansen
Hide whitespace changes
Inline Side-by-side

Showing with 5 additions and 4 deletions

README.md README.md +5 -4

No files found.
--- a/README.md
+++ b/README.md
@@ -228,10 +228,11 @@ generation_output = model.generate(
 ### Falcon 7B
-Note: Fast generation, fast context processing
+- Note: Fast generation, fast context processing
-GPU: NVIDIA GeForce RTX 3090
+- GPU: NVIDIA GeForce RTX 3090
-Command: `python examples/benchmark.py --model_path casperhansen/falcon-7b-awq --quant_file awq_model_w4_g64.pt`
+- Command: `python examples/benchmark.py --model_path casperhansen/falcon-7b-awq --quant_file awq_model_w4_g64.pt`
-Version: GEMM
+- Version: GEMM
 |   Batch Size |   Prefill Length |   Decode Length |   Prefill tokens/s |   Decode tokens/s | Memory (VRAM)    |
 |-------------:|-----------------:|----------------:|-------------------:|------------------:|:-----------------|
 |            1 |               32 |              32 |            466.826 |           95.1413 | 4.47 GB (18.88%) |