Unverified Commit fd222206 authored by bingps's avatar bingps Committed by GitHub
Browse files

[Doc] Installed version of llmcompressor for int8/fp8 quantization (#11103)


Signed-off-by: default avatarGuangda Liu <bingps@users.noreply.github.com>
Co-authored-by: default avatarGuangda Liu <bingps@users.noreply.github.com>
parent b2f77545
...@@ -45,7 +45,7 @@ To produce performant FP8 quantized models with vLLM, you'll need to install the ...@@ -45,7 +45,7 @@ To produce performant FP8 quantized models with vLLM, you'll need to install the
.. code-block:: console .. code-block:: console
$ pip install llmcompressor==0.1.0 $ pip install llmcompressor
Quantization Process Quantization Process
-------------------- --------------------
......
...@@ -19,7 +19,7 @@ To use INT8 quantization with vLLM, you'll need to install the `llm-compressor < ...@@ -19,7 +19,7 @@ To use INT8 quantization with vLLM, you'll need to install the `llm-compressor <
.. code-block:: console .. code-block:: console
$ pip install llmcompressor==0.1.0 $ pip install llmcompressor
Quantization Process Quantization Process
-------------------- --------------------
...@@ -142,4 +142,4 @@ Best Practices ...@@ -142,4 +142,4 @@ Best Practices
Troubleshooting and Support Troubleshooting and Support
--------------------------- ---------------------------
If you encounter any issues or have feature requests, please open an issue on the ``vllm-project/llm-compressor`` GitHub repository. If you encounter any issues or have feature requests, please open an issue on the ``vllm-project/llm-compressor`` GitHub repository.
\ No newline at end of file
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment