[Doc] Installed version of llmcompressor for int8/fp8 quantization (#11103)

Signed-off-by: Guangda Liu <bingps@users.noreply.github.com> Co-authored-by: Guangda Liu <bingps@users.noreply.github.com>

[Doc] Installed version of llmcompressor for int8/fp8 quantization (#11103)
Signed-off-by: Guangda Liu <bingps@users.noreply.github.com> Co-authored-by: Guangda Liu <bingps@users.noreply.github.com>
fd222206 · bingps · GitHub · b2f77545 · fd222206 · fd222206
Unverified Commit fd222206 authored Dec 11, 2024 by bingps Committed by GitHub Dec 11, 2024
Show whitespace changes
Inline Side-by-side

Showing with 3 additions and 3 deletions

docs/source/quantization/fp8.rst docs/source/quantization/fp8.rst +1 -1

docs/source/quantization/int8.rst docs/source/quantization/int8.rst +2 -2

No files found.
--- a/docs/source/quantization/fp8.rst
+++ b/docs/source/quantization/fp8.rst
@@ -45,7 +45,7 @@ To produce performant FP8 quantized models with vLLM, you'll need to install the

 .. code-block:: console

-   $ pip install llmcompressor==0.1.0
+   $ pip install llmcompressor

 Quantization Process
 --------------------

--- a/docs/source/quantization/int8.rst
+++ b/docs/source/quantization/int8.rst
@@ -19,7 +19,7 @@ To use INT8 quantization with vLLM, you'll need to install the `llm-compressor <

 .. code-block:: console

-   $ pip install llmcompressor==0.1.0
+   $ pip install llmcompressor

 Quantization Process
 --------------------