Super tiny little typo fix (#10633)

2b0879bf · fzyzcjy · GitHub · ed46f143 · 2b0879bf
Unverified Commit 2b0879bf authored Nov 25, 2024 by fzyzcjy Committed by GitHub Nov 25, 2024
Hide whitespace changes
Inline Side-by-side

Showing with 1 addition and 1 deletion

docs/source/quantization/fp8_e5m2_kvcache.rst docs/source/quantization/fp8_e5m2_kvcache.rst +1 -1

No files found.
--- a/docs/source/quantization/fp8_e5m2_kvcache.rst
+++ b/docs/source/quantization/fp8_e5m2_kvcache.rst
@@ -4,7 +4,7 @@ FP8 E5M2 KV Cache
 ==================

 The int8/int4 quantization scheme requires additional scale GPU memory storage, which reduces the expected GPU memory benefits.
-The FP8 data format retains 2~3 mantissa bits and can convert float/fp16/bflaot16 and fp8 to each other.
+The FP8 data format retains 2~3 mantissa bits and can convert float/fp16/bfloat16 and fp8 to each other.

 Here is an example of how to enable this feature: