Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
5bbaf492
Unverified
Commit
5bbaf492
authored
Jul 30, 2025
by
Cyrus Leung
Committed by
GitHub
Jul 30, 2025
Browse files
[Doc] Update partial support (#21916)
Signed-off-by:
DarkLight1337
<
tlleungac@connect.ust.hk
>
parent
533db093
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
4 additions
and
3 deletions
+4
-3
docs/features/compatibility_matrix.md
docs/features/compatibility_matrix.md
+4
-3
No files found.
docs/features/compatibility_matrix.md
View file @
5bbaf492
...
...
@@ -41,17 +41,18 @@ th:not(:first-child) {
|
[
LoRA
](
lora.md
)
| ✅ | ✅ | ✅ | | | | | | | | | | | |
|
[
SD
](
spec_decode.md
)
| ✅ | ✅ | ❌ | ✅ | | | | | | | | | | |
| CUDA graph | ✅ | ✅ | ✅ | ✅ | ✅ | | | | | | | | | |
|
[
pooling
](
../models/pooling_models.md
)
|
✅
\*
|
✅
\*
| ✅ | ❌ | ✅ | ✅ | | | | | | | | |
|
[
pooling
](
../models/pooling_models.md
)
|
🟠
\*
|
🟠
\*
| ✅ | ❌ | ✅ | ✅ | | | | | | | | |
|
<abbr
title=
"Encoder-Decoder Models"
>
enc-dec
</abbr>
| ❌ |
[
❌
](
gh-issue:7366
)
| ❌ |
[
❌
](
gh-issue:7366
)
| ✅ | ✅ | ✅ | | | | | | | |
|
<abbr
title=
"Logprobs"
>
logP
</abbr>
| ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | ✅ | ✅ | | | | | | |
|
<abbr
title=
"Prompt Logprobs"
>
prmpt logP
</abbr>
| ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | ✅ | ✅ | ✅ | | | | | |
|
<abbr
title=
"Async Output Processing"
>
async output
</abbr>
| ✅ | ✅ | ✅ | ❌ | ✅ | ❌ | ❌ | ✅ | ✅ | ✅ | | | | |
| multi-step | ❌ | ✅ | ❌ | ❌ | ✅ | ❌ | ❌ | ✅ | ✅ | ✅ | ✅ | | | |
|
[
mm
](
multimodal_inputs.md
)
| ✅ | ✅ |
[
🟠
](
gh-pr:4194
)
| ❔ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ❔ | ✅ | | |
|
[
mm
](
multimodal_inputs.md
)
| ✅ | ✅ |
[
🟠
](
gh-pr:4194
)
<sup>
^
</sup>
| ❔ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ❔ | ✅ | | |
| best-of | ✅ | ✅ | ✅ |
[
❌
](
gh-issue:6137
)
| ✅ | ❌ | ✅ | ✅ | ✅ | ❔ |
[
❌
](
gh-issue:7968
)
| ✅ | ✅ | |
| beam-search | ✅ | ✅ | ✅ |
[
❌
](
gh-issue:6137
)
| ✅ | ❌ | ✅ | ✅ | ✅ | ❔ |
[
❌
](
gh-issue:7968
)
| ❔ | ✅ | ✅ |
\*
Chunked prefill and prefix caching are only applicable to last-token pooling.
<sup>
^
</sup>
LoRA is only applicable to the language backbone of multimodal models.
[](
){
#feature-x-hardware }
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment