[Doc] Add V1 column to supported models list (#19523)

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

[Doc] Add V1 column to supported models list (#19523)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
c742438f · Cyrus Leung · GitHub · 73e2e011 · c742438f · c742438f
Unverified Commit c742438f authored Jun 12, 2025 by Cyrus Leung Committed by GitHub Jun 12, 2025
Expand all Show whitespace changes
Inline Side-by-side

Showing with 168 additions and 160 deletions

docs/models/supported_models.md docs/models/supported_models.md +143 -141

docs/usage/v1_guide.md docs/usage/v1_guide.md +25 -19

No files found.
--- a/docs/models/supported_models.md
+++ b/docs/models/supported_models.md
--- a/docs/usage/v1_guide.md
+++ b/docs/usage/v1_guide.md
@@ -67,37 +67,43 @@ For each item, our progress towards V1 support falls into one of the following s
 ### Models
 | Model Type                  | Status                                                                             |
-|-----------------|-----------------------------------------------------------------------------------|
+|-----------------------------|------------------------------------------------------------------------------------|
 | **Decoder-only Models**     | <nobr>🚀 Optimized</nobr>                                                          |
 | **Encoder-Decoder Models**  | <nobr>🟠 Delayed</nobr>                                                            |
 | **Embedding Models**        | <nobr>🚧 WIP ([PR #16188](https://github.com/vllm-project/vllm/pull/16188))</nobr> |
 | **Mamba Models**            | <nobr>🚧 WIP ([PR #19327](https://github.com/vllm-project/vllm/pull/19327))</nobr> |
 | **Multimodal Models**       | <nobr>🟢 Functional</nobr>                                                         |
-vLLM V1 currently excludes model architectures with the `SupportsV0Only` protocol,
+vLLM V1 currently excludes model architectures with the `SupportsV0Only` protocol.
-and the majority fall into the following categories:
+!!! tip
+    This corresponds to the V1 column in our [list of supported models][supported-models].
+See below for the status of models that are still not yet supported in V1.
+#### Embedding Models
-**Embedding Models**  
 The initial support will be provided by [PR #16188](https://github.com/vllm-project/vllm/pull/16188).
 Later, we will consider using [hidden states processor](https://github.com/vllm-project/vllm/issues/12249),
 which is based on [global logits processor](https://github.com/vllm-project/vllm/pull/13360)
 to enable simultaneous generation and embedding using the same engine instance in V1.
-**Mamba Models**  
+#### Mamba Models
 Models using selective state-space mechanisms instead of standard transformer attention (e.g., `MambaForCausalLM`, `JambaForCausalLM`)
 will be supported via [PR #19327](https://github.com/vllm-project/vllm/pull/19327).
-**Encoder-Decoder Models**  
+#### Encoder-Decoder Models
-vLLM V1 is currently optimized for decoder-only transformers.
-Models requiring cross-attention between separate encoder and decoder are not yet supported (e.g., `BartForConditionalGeneration`, `MllamaForConditionalGeneration`).
-For a complete list of supported models, see the [list of supported models](https://docs.vllm.ai/en/latest/models/supported_models.html).
+Models requiring cross-attention between separate encoder and decoder (e.g., `BartForConditionalGeneration`, `MllamaForConditionalGeneration`)
+are not yet supported.
 ### Features
 | Feature                                     | Status                                                                            |
-|-----------------|-----------------------------------------------------------------------------------|
+|---------------------------------------------|-----------------------------------------------------------------------------------|
 | **Prefix Caching**                          | <nobr>🚀 Optimized</nobr>                                                         |
 | **Chunked Prefill**                         | <nobr>🚀 Optimized</nobr>                                                         |
 | **LoRA**                                    | <nobr>🚀 Optimized</nobr>                                                         |