Revert "[Doc] Update supported_hardware.rst (#7276)" (#7467)

e20233d3 · Woosuk Kwon · GitHub · d6e634f3 · e20233d3
Unverified Commit e20233d3 authored Aug 13, 2024 by Woosuk Kwon Committed by GitHub Aug 13, 2024
Show whitespace changes
Inline Side-by-side

Showing with 13 additions and 15 deletions

docs/source/quantization/supported_hardware.rst docs/source/quantization/supported_hardware.rst +13 -15

No files found.
--- a/docs/source/quantization/supported_hardware.rst
+++ b/docs/source/quantization/supported_hardware.rst
@@ -5,20 +5,18 @@ Supported Hardware for Quantization Kernels

 The table below shows the compatibility of various quantization implementations with different hardware platforms in vLLM:

-=====================  ======  =======  =======  =====  ======  =======  =========  =======  ==============  ==========
+==============  ======  =======  =======  =====  ======  =======  =========  =======  ==============  ==========
 Implementation  Volta   Turing   Ampere   Ada    Hopper  AMD GPU  Intel GPU  x86 CPU  AWS Inferentia  Google TPU
-=====================  ======  =======  =======  =====  ======  =======  =========  =======  ==============  ==========
-AWQ                    ❌      ✅       ✅       ✅     ✅      ❌        ❌         ❌       ❌              ❌
-GPTQ                   ✅      ✅       ✅       ✅     ✅      ❌        ❌         ❌       ❌              ❌
-Marlin (GPTQ/AWQ/FP8)  ❌      ❌       ✅       ✅     ✅      ❌        ❌         ❌       ❌              ❌
-INT8 (W8A8)            ❌      ✅       ✅       ✅     ✅      ❌        ❌         ❌       ❌              ❌
-FP8 (W8A8)             ❌      ❌       ❌       ✅     ✅      ❌        ❌         ❌       ❌              ❌
+==============  ======  =======  =======  =====  ======  =======  =========  =======  ==============  ==========
 AQLM            ✅      ✅       ✅       ✅     ✅      ❌        ❌         ❌       ❌              ❌
-bitsandbytes           ✅      ✅       ✅       ✅     ✅      ❌        ❌         ❌       ❌              ❌
+AWQ             ❌      ✅       ✅       ✅     ✅      ❌        ❌         ❌       ❌              ❌
 DeepSpeedFP     ✅      ✅       ✅       ✅     ✅      ❌        ❌         ❌       ❌              ❌
-GGUF                   ✅      ✅       ✅       ✅     ✅      ❌        ❌         ❌       ❌              ❌
+FP8             ❌      ❌       ✅       ✅     ✅      ❌        ❌         ❌       ❌              ❌
+Marlin          ❌      ❌       ✅       ✅     ✅      ❌        ❌         ❌       ❌              ❌
+GPTQ            ✅      ✅       ✅       ✅     ✅      ❌        ❌         ❌       ❌              ❌
 SqueezeLLM      ✅      ✅       ✅       ✅     ✅      ❌        ❌         ❌       ❌              ❌
-=====================  ======  =======  =======  =====  ======  =======  =========  =======  ==============  ==========
+bitsandbytes    ✅      ✅       ✅       ✅     ✅      ❌        ❌         ❌       ❌              ❌
+==============  ======  =======  =======  =====  ======  =======  =========  =======  ==============  ==========

 Notes:
 ^^^^^^