[Doc] Update supported_hardware.rst (#7276)

6d944202 · Michael Goin · GitHub · fc1493a0 · 6d944202
Unverified Commit 6d944202 authored Aug 07, 2024 by Michael Goin Committed by GitHub Aug 07, 2024
Show whitespace changes
Inline Side-by-side

Showing with 15 additions and 13 deletions

docs/source/quantization/supported_hardware.rst docs/source/quantization/supported_hardware.rst +15 -13

No files found.
--- a/docs/source/quantization/supported_hardware.rst
+++ b/docs/source/quantization/supported_hardware.rst
@@ -5,18 +5,20 @@ Supported Hardware for Quantization Kernels

 The table below shows the compatibility of various quantization implementations with different hardware platforms in vLLM:

-==============  ======  =======  =======  =====  ======  =======  =========  =======  ==============  ==========
+=====================  ======  =======  =======  =====  ======  =======  =========  =======  ==============  ==========
 Implementation         Volta   Turing   Ampere   Ada    Hopper  AMD GPU  Intel GPU  x86 CPU  AWS Inferentia  Google TPU
-==============  ======  =======  =======  =====  ======  =======  =========  =======  ==============  ==========
-AQLM            ✅      ✅       ✅       ✅     ✅      ❌        ❌         ❌       ❌              ❌
+=====================  ======  =======  =======  =====  ======  =======  =========  =======  ==============  ==========
 AWQ                    ❌      ✅       ✅       ✅     ✅      ❌        ❌         ❌       ❌              ❌
-DeepSpeedFP     ✅      ✅       ✅       ✅     ✅      ❌        ❌         ❌       ❌              ❌
-FP8             ❌      ❌       ✅       ✅     ✅      ❌        ❌         ❌       ❌              ❌
-Marlin          ❌      ❌       ✅       ✅     ✅      ❌        ❌         ❌       ❌              ❌
 GPTQ                   ✅      ✅       ✅       ✅     ✅      ❌        ❌         ❌       ❌              ❌
-SqueezeLLM      ✅      ✅       ✅       ✅     ✅      ❌        ❌         ❌       ❌              ❌
+Marlin (GPTQ/AWQ/FP8)  ❌      ❌       ✅       ✅     ✅      ❌        ❌         ❌       ❌              ❌
+INT8 (W8A8)            ❌      ✅       ✅       ✅     ✅      ❌        ❌         ❌       ❌              ❌
+FP8 (W8A8)             ❌      ❌       ❌       ✅     ✅      ❌        ❌         ❌       ❌              ❌
+AQLM                   ✅      ✅       ✅       ✅     ✅      ❌        ❌         ❌       ❌              ❌
 bitsandbytes           ✅      ✅       ✅       ✅     ✅      ❌        ❌         ❌       ❌              ❌
-==============  ======  =======  =======  =====  ======  =======  =========  =======  ==============  ==========
+DeepSpeedFP            ✅      ✅       ✅       ✅     ✅      ❌        ❌         ❌       ❌              ❌
+GGUF                   ✅      ✅       ✅       ✅     ✅      ❌        ❌         ❌       ❌              ❌
+SqueezeLLM             ✅      ✅       ✅       ✅     ✅      ❌        ❌         ❌       ❌              ❌
+=====================  ======  =======  =======  =====  ======  =======  =========  =======  ==============  ==========

 Notes:
 ^^^^^^