Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
ba601102
Unverified
Commit
ba601102
authored
Sep 11, 2025
by
Russell Bryant
Committed by
GitHub
Sep 11, 2025
Browse files
[Docs] Update V1 doc to reflect whisper support (#24606)
Signed-off-by:
Russell Bryant
<
rbryant@redhat.com
>
parent
85df8afd
Changes
2
Show whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
5 additions
and
4 deletions
+5
-4
docs/models/supported_models.md
docs/models/supported_models.md
+1
-1
docs/usage/v1_guide.md
docs/usage/v1_guide.md
+4
-3
No files found.
docs/models/supported_models.md
View file @
ba601102
...
...
@@ -766,7 +766,7 @@ Speech2Text models trained specifically for Automatic Speech Recognition.
| Architecture | Models | Example HF Models |
[
LoRA
](
../features/lora.md
)
|
[
PP
](
../serving/parallelism_scaling.md
)
|
[
V1
](
gh-issue:8779
)
|
|--------------|--------|-------------------|----------------------|---------------------------|---------------------|
|
`WhisperForConditionalGeneration`
| Whisper |
`openai/whisper-small`
,
`openai/whisper-large-v3-turbo`
, etc. | | | |
|
`WhisperForConditionalGeneration`
| Whisper |
`openai/whisper-small`
,
`openai/whisper-large-v3-turbo`
, etc. | | |
✅︎
|
|
`VoxtralForConditionalGeneration`
| Voxtral (Mistral format) |
`mistralai/Voxtral-Mini-3B-2507`
,
`mistralai/Voxtral-Small-24B-2507`
, etc. | ✅︎ | ✅︎ | ✅︎ |
|
`Gemma3nForConditionalGeneration`
| Gemma3n |
`google/gemma-3n-E2B-it`
,
`google/gemma-3n-E4B-it`
, etc. | | | ✅︎ |
...
...
docs/usage/v1_guide.md
View file @
ba601102
...
...
@@ -83,7 +83,7 @@ based on assigned priority, with FCFS as a tie-breaker), configurable via the
| Model Type | Status |
|-----------------------------|------------------------------------------------------------------------------------|
|
**Decoder-only Models**
|
<nobr>
🚀 Optimized
</nobr>
|
|
**Encoder-Decoder Models**
|
<nobr>
🟠 Delayed
</nobr>
|
|
**Encoder-Decoder Models**
|
<nobr>
🟢 Whisper only
</nobr>
|
|
**Embedding Models**
|
<nobr>
🟢 Functional
</nobr>
|
|
**Mamba Models**
|
<nobr>
🟢 (Mamba-2), 🟢 (Mamba-1)
</nobr>
|
|
**Multimodal Models**
|
<nobr>
🟢 Functional
</nobr>
|
...
...
@@ -118,8 +118,9 @@ Please note that prefix caching is not yet supported for any of the above models
#### Encoder-Decoder Models
Models requiring cross-attention between separate encoder and decoder (e.g.,
`BartForConditionalGeneration`
,
`MllamaForConditionalGeneration`
)
are not yet supported.
Whisper is supported. Other models requiring cross-attention between separate
encoder and decoder (e.g.,
`BartForConditionalGeneration`
,
`MllamaForConditionalGeneration`
) are not yet supported.
### Features
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment