"vscode:/vscode.git/clone" did not exist on "07b39d1cb93178b093bc6f1af924781703da66a5"
Unverified Commit 2358ca52 authored by Harry Mellor's avatar Harry Mellor Committed by GitHub
Browse files

[Doc]: Improve feature tables (#13224)


Signed-off-by: default avatarHarry Mellor <19981378+hmellor@users.noreply.github.com>
parent 8cf97f86
.vertical-table-header th.head:not(.stub) {
writing-mode: sideways-lr;
white-space: nowrap;
max-width: 0;
p {
margin: 0;
}
}
...@@ -78,8 +78,12 @@ html_theme_options = { ...@@ -78,8 +78,12 @@ html_theme_options = {
'use_repository_button': True, 'use_repository_button': True,
'use_edit_page_button': True, 'use_edit_page_button': True,
} }
# Add any paths that contain custom static files (such as style sheets) here,
# relative to this directory. They are copied after the builtin static files,
# so a file named "default.css" will overwrite the builtin "default.css".
html_static_path = ["_static"] html_static_path = ["_static"]
html_js_files = ["custom.js"] html_js_files = ["custom.js"]
html_css_files = ["custom.css"]
myst_url_schemes = { myst_url_schemes = {
'http': None, 'http': None,
...@@ -121,11 +125,6 @@ if READTHEDOCS_VERSION_TYPE == "tag": ...@@ -121,11 +125,6 @@ if READTHEDOCS_VERSION_TYPE == "tag":
if os.path.exists(header_file): if os.path.exists(header_file):
os.remove(header_file) os.remove(header_file)
# Add any paths that contain custom static files (such as style sheets) here,
# relative to this directory. They are copied after the builtin static files,
# so a file named "default.css" will overwrite the builtin "default.css".
# html_static_path = ['_static']
# Generate additional rst documentation here. # Generate additional rst documentation here.
def setup(app): def setup(app):
......
...@@ -4,8 +4,14 @@ ...@@ -4,8 +4,14 @@
The tables below show mutually exclusive features and the support on some hardware. The tables below show mutually exclusive features and the support on some hardware.
The symbols used have the following meanings:
- ✅ = Full compatibility
- 🟠 = Partial compatibility
- ❌ = No compatibility
:::{note} :::{note}
Check the '✗' with links to see tracking issue for unsupported feature/hardware combination. Check the ❌ or 🟠 with links to see tracking issue for unsupported feature/hardware combination.
::: :::
## Feature x Feature ## Feature x Feature
...@@ -29,6 +35,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar ...@@ -29,6 +35,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
:header-rows: 1 :header-rows: 1
:stub-columns: 1 :stub-columns: 1
:widths: auto :widths: auto
:class: vertical-table-header
- * Feature - * Feature
* [CP](#chunked-prefill) * [CP](#chunked-prefill)
...@@ -48,7 +55,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar ...@@ -48,7 +55,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* beam-search * beam-search
* <abbr title="Guided Decoding">guided dec</abbr> * <abbr title="Guided Decoding">guided dec</abbr>
- * [CP](#chunked-prefill) - * [CP](#chunked-prefill)
* *
* *
* *
* *
...@@ -66,7 +73,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar ...@@ -66,7 +73,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* *
- * [APC](#automatic-prefix-caching) - * [APC](#automatic-prefix-caching)
* *
* *
* *
* *
* *
...@@ -82,9 +89,9 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar ...@@ -82,9 +89,9 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* *
* *
- * [LoRA](#lora-adapter) - * [LoRA](#lora-adapter)
* [](gh-pr:9057)
* *
* *
*
* *
* *
* *
...@@ -102,7 +109,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar ...@@ -102,7 +109,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* *
* *
* *
* *
* *
* *
* *
...@@ -118,9 +125,9 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar ...@@ -118,9 +125,9 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
- * [SD](#spec_decode) - * [SD](#spec_decode)
* *
* *
* *
*
* *
*
* *
* *
* *
...@@ -138,7 +145,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar ...@@ -138,7 +145,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* *
* *
* *
* *
* *
* *
* *
...@@ -150,13 +157,13 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar ...@@ -150,13 +157,13 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* *
* *
- * <abbr title="Pooling Models">pooling</abbr> - * <abbr title="Pooling Models">pooling</abbr>
* *
* *
* *
* *
* *
* *
* *
* *
* *
* *
...@@ -167,14 +174,14 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar ...@@ -167,14 +174,14 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* *
* *
- * <abbr title="Encoder-Decoder Models">enc-dec</abbr> - * <abbr title="Encoder-Decoder Models">enc-dec</abbr>
* *
* [](gh-issue:7366) * [](gh-issue:7366)
* *
* *
* [](gh-issue:7366) * [](gh-issue:7366)
*
* *
* *
*
* *
* *
* *
...@@ -190,9 +197,9 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar ...@@ -190,9 +197,9 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* *
* *
* *
* *
*
* *
*
* *
* *
* *
...@@ -205,12 +212,12 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar ...@@ -205,12 +212,12 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* *
* *
* *
* [](gh-pr:8199)
* *
* *
*
*
* *
* *
*
* *
* *
* *
...@@ -222,49 +229,49 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar ...@@ -222,49 +229,49 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* *
* *
* *
* *
*
*
*
* *
*
*
* *
* *
*
* *
* *
* *
* *
* *
- * multi-step - * multi-step
* *
* *
* *
*
*
*
*
*
* *
*
* *
*
*
* *
* [](gh-issue:8198)
* *
*
* *
* *
* *
* *
- * <abbr title="Multimodal Inputs">mm</abbr> - * <abbr title="Multimodal Inputs">mm</abbr>
* *
* [](gh-pr:8348) * [🟠](gh-pr:8348)
* [](gh-pr:7199) * [🟠](gh-pr:4194)
* ? *
* ? *
* *
* *
* *
* *
* *
* *
* ? *
* *
* *
* *
* *
...@@ -273,16 +280,16 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar ...@@ -273,16 +280,16 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* *
* *
* *
* [](gh-issue:6137) * [](gh-issue:6137)
* *
* *
* *
* *
* *
* ? *
* [](gh-issue:7968) * [](gh-issue:7968)
*
* *
*
* *
* *
- * beam-search - * beam-search
...@@ -290,35 +297,35 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar ...@@ -290,35 +297,35 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* *
* *
* *
* [](gh-issue:6137) * [](gh-issue:6137)
* *
* *
* *
* *
* *
* ? *
* [](gh-issue:7968) * [](gh-issue:7968)
* ? *
*
* *
*
* *
- * <abbr title="Guided Decoding">guided dec</abbr> - * <abbr title="Guided Decoding">guided dec</abbr>
* *
* *
* ? *
* ? *
* [](gh-issue:11484) * [](gh-issue:11484)
* *
* *
* ? *
* *
* *
* *
* [](gh-issue:9893) * [](gh-issue:9893)
* ? *
*
* *
* *
*
::: :::
(feature-x-hardware)= (feature-x-hardware)=
...@@ -339,7 +346,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar ...@@ -339,7 +346,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* CPU * CPU
* AMD * AMD
- * [CP](#chunked-prefill) - * [CP](#chunked-prefill)
* [](gh-issue:2729) * [](gh-issue:2729)
* *
* *
* *
...@@ -347,7 +354,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar ...@@ -347,7 +354,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* *
* *
- * [APC](#automatic-prefix-caching) - * [APC](#automatic-prefix-caching)
* [](gh-issue:3687) * [](gh-issue:3687)
* *
* *
* *
...@@ -368,7 +375,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar ...@@ -368,7 +375,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* *
* *
* *
* [](gh-issue:8475) * [](gh-issue:8475)
* *
- * [SD](#spec_decode) - * [SD](#spec_decode)
* *
...@@ -384,7 +391,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar ...@@ -384,7 +391,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* *
* *
* *
* *
* *
- * <abbr title="Pooling Models">pooling</abbr> - * <abbr title="Pooling Models">pooling</abbr>
* *
...@@ -393,7 +400,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar ...@@ -393,7 +400,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* *
* *
* *
* ? *
- * <abbr title="Encoder-Decoder Models">enc-dec</abbr> - * <abbr title="Encoder-Decoder Models">enc-dec</abbr>
* *
* *
...@@ -401,7 +408,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar ...@@ -401,7 +408,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* *
* *
* *
* *
- * <abbr title="Multimodal Inputs">mm</abbr> - * <abbr title="Multimodal Inputs">mm</abbr>
* *
* *
...@@ -432,15 +439,15 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar ...@@ -432,15 +439,15 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* *
* *
* *
* *
* *
- * multi-step - * multi-step
* *
* *
* *
* *
* *
* [](gh-issue:8477) * [](gh-issue:8477)
* *
- * best-of - * best-of
* *
......
...@@ -20,93 +20,93 @@ The table below shows the compatibility of various quantization implementations ...@@ -20,93 +20,93 @@ The table below shows the compatibility of various quantization implementations
* AWS Inferentia * AWS Inferentia
* Google TPU * Google TPU
- * AWQ - * AWQ
* *
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* *
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* *
* *
- * GPTQ - * GPTQ
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* *
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* *
* *
- * Marlin (GPTQ/AWQ/FP8) - * Marlin (GPTQ/AWQ/FP8)
* *
* *
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* *
* *
* *
* *
* *
- * INT8 (W8A8) - * INT8 (W8A8)
* *
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* *
* *
* ✅︎ * ✅︎
* *
* *
- * FP8 (W8A8) - * FP8 (W8A8)
* *
* *
* *
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* *
* *
* *
* *
- * AQLM - * AQLM
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* *
* *
* *
* *
* *
- * bitsandbytes - * bitsandbytes
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* *
* *
* *
* *
* *
- * DeepSpeedFP - * DeepSpeedFP
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* *
* *
* *
* *
* *
- * GGUF - * GGUF
* ✅︎ * ✅︎
* ✅︎ * ✅︎
...@@ -114,16 +114,16 @@ The table below shows the compatibility of various quantization implementations ...@@ -114,16 +114,16 @@ The table below shows the compatibility of various quantization implementations
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* *
* *
* *
* *
::: :::
- Volta refers to SM 7.0, Turing to SM 7.5, Ampere to SM 8.0/8.6, Ada to SM 8.9, and Hopper to SM 9.0. - Volta refers to SM 7.0, Turing to SM 7.5, Ampere to SM 8.0/8.6, Ada to SM 8.9, and Hopper to SM 9.0.
- "✅︎" indicates that the quantization method is supported on the specified hardware. - ✅︎ indicates that the quantization method is supported on the specified hardware.
- "✗" indicates that the quantization method is not supported on the specified hardware. - indicates that the quantization method is not supported on the specified hardware.
:::{note} :::{note}
This compatibility chart is subject to change as vLLM continues to evolve and expand its support for different hardware platforms and quantization methods. This compatibility chart is subject to change as vLLM continues to evolve and expand its support for different hardware platforms and quantization methods.
......
...@@ -28,10 +28,10 @@ The selected option sets the default pooler used to extract the final hidden sta ...@@ -28,10 +28,10 @@ The selected option sets the default pooler used to extract the final hidden sta
- * Embedding (`embed`) - * Embedding (`embed`)
* `LAST` * `LAST`
* ✅︎ * ✅︎
* *
- * Classification (`classify`) - * Classification (`classify`)
* `LAST` * `LAST`
* *
* ✅︎ * ✅︎
- * Sentence Pair Scoring (`score`) - * Sentence Pair Scoring (`score`)
* \* * \*
...@@ -39,8 +39,8 @@ The selected option sets the default pooler used to extract the final hidden sta ...@@ -39,8 +39,8 @@ The selected option sets the default pooler used to extract the final hidden sta
* \* * \*
- * Reward Modeling (`reward`) - * Reward Modeling (`reward`)
* `ALL` * `ALL`
* *
* *
::: :::
\*The default pooler is always defined by the model. \*The default pooler is always defined by the model.
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment