Unverified Commit 2b0879bf authored by fzyzcjy's avatar fzyzcjy Committed by GitHub
Browse files

Super tiny little typo fix (#10633)

parent ed46f143
...@@ -4,7 +4,7 @@ FP8 E5M2 KV Cache ...@@ -4,7 +4,7 @@ FP8 E5M2 KV Cache
================== ==================
The int8/int4 quantization scheme requires additional scale GPU memory storage, which reduces the expected GPU memory benefits. The int8/int4 quantization scheme requires additional scale GPU memory storage, which reduces the expected GPU memory benefits.
The FP8 data format retains 2~3 mantissa bits and can convert float/fp16/bflaot16 and fp8 to each other. The FP8 data format retains 2~3 mantissa bits and can convert float/fp16/bfloat16 and fp8 to each other.
Here is an example of how to enable this feature: Here is an example of how to enable this feature:
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment