"vscode:/vscode.git/clone" did not exist on "992e5c3d34776cae25a9fecf0a45d51fa69d6dbc"
Unverified Commit 6f68c492 authored by 22quinn's avatar 22quinn Committed by GitHub
Browse files

[Doc] Update V1 user guide for embedding models (#19842)


Signed-off-by: default avatar22quinn <33176974+22quinn@users.noreply.github.com>
parent 47194606
...@@ -70,7 +70,7 @@ For each item, our progress towards V1 support falls into one of the following s ...@@ -70,7 +70,7 @@ For each item, our progress towards V1 support falls into one of the following s
|-----------------------------|------------------------------------------------------------------------------------| |-----------------------------|------------------------------------------------------------------------------------|
| **Decoder-only Models** | <nobr>🚀 Optimized</nobr> | | **Decoder-only Models** | <nobr>🚀 Optimized</nobr> |
| **Encoder-Decoder Models** | <nobr>🟠 Delayed</nobr> | | **Encoder-Decoder Models** | <nobr>🟠 Delayed</nobr> |
| **Embedding Models** | <nobr>🚧 WIP ([PR #16188](https://github.com/vllm-project/vllm/pull/16188))</nobr> | | **Embedding Models** | <nobr>🟢 Functional</nobr> |
| **Mamba Models** | <nobr>🚧 WIP ([PR #19327](https://github.com/vllm-project/vllm/pull/19327))</nobr> | | **Mamba Models** | <nobr>🚧 WIP ([PR #19327](https://github.com/vllm-project/vllm/pull/19327))</nobr> |
| **Multimodal Models** | <nobr>🟢 Functional</nobr> | | **Multimodal Models** | <nobr>🟢 Functional</nobr> |
...@@ -80,11 +80,11 @@ vLLM V1 currently excludes model architectures with the `SupportsV0Only` protoco ...@@ -80,11 +80,11 @@ vLLM V1 currently excludes model architectures with the `SupportsV0Only` protoco
This corresponds to the V1 column in our [list of supported models][supported-models]. This corresponds to the V1 column in our [list of supported models][supported-models].
See below for the status of models that are still not yet supported in V1. See below for the status of models that are not yet supported or have more features planned in V1.
#### Embedding Models #### Embedding Models
The initial support will be provided by [PR #16188](https://github.com/vllm-project/vllm/pull/16188). The initial basic support is now functional.
Later, we will consider using [hidden states processor](https://github.com/vllm-project/vllm/issues/12249), Later, we will consider using [hidden states processor](https://github.com/vllm-project/vllm/issues/12249),
which is based on [global logits processor](https://github.com/vllm-project/vllm/pull/13360) which is based on [global logits processor](https://github.com/vllm-project/vllm/pull/13360)
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment