Update README.md

d76125bf · Casper · GitHub · 1b0af2d3 · d76125bf
Unverified Commit d76125bf authored Sep 13, 2023 by Casper Committed by GitHub Sep 13, 2023
Show whitespace changes
Inline Side-by-side

Showing with 1 addition and 1 deletion

README.md README.md +1 -1

No files found.
--- a/README.md
+++ b/README.md
@@ -74,7 +74,7 @@ Under examples, you can find examples of how to quantize, run inference, and ben

 ### INT4 GEMM vs INT4 GEMV vs FP16

-There are two versions of AWQ: GEMM and GEMV. Both names to how matrix multiplication runs under the hood. We suggest the following:
+There are two versions of AWQ: GEMM and GEMV. Both names relate to how matrix multiplication runs under the hood. We suggest the following:

 - GEMV (quantized): Best for small context, batch size 1, highest number of tokens/s.
 - GEMM (quantized): Best for larger context, up to batch size 8, faster than GEMV on batch size > 1, slower than GEMV on batch size = 1.