Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
e2db1164
Unverified
Commit
e2db1164
authored
Aug 24, 2025
by
Cyrus Leung
Committed by
GitHub
Aug 24, 2025
Browse files
[Model] Enable BLOOM on V1 (#23488)
Signed-off-by:
DarkLight1337
<
tlleungac@connect.ust.hk
>
parent
416f0592
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
3 additions
and
3 deletions
+3
-3
docs/models/supported_models.md
docs/models/supported_models.md
+1
-1
vllm/model_executor/models/bloom.py
vllm/model_executor/models/bloom.py
+2
-2
No files found.
docs/models/supported_models.md
View file @
e2db1164
...
...
@@ -328,7 +328,7 @@ th {
|
`BaiChuanForCausalLM`
| Baichuan2, Baichuan |
`baichuan-inc/Baichuan2-13B-Chat`
,
`baichuan-inc/Baichuan-7B`
, etc. | ✅︎ | ✅︎ | ✅︎ |
|
`BailingMoeForCausalLM`
| Ling |
`inclusionAI/Ling-lite-1.5`
,
`inclusionAI/Ling-plus`
, etc. | ✅︎ | ✅︎ | ✅︎ |
|
`BambaForCausalLM`
| Bamba |
`ibm-ai-platform/Bamba-9B-fp8`
,
`ibm-ai-platform/Bamba-9B`
| ✅︎ | ✅︎ | ✅︎ |
|
`BloomForCausalLM`
| BLOOM, BLOOMZ, BLOOMChat |
`bigscience/bloom`
,
`bigscience/bloomz`
, etc. | | ✅︎ | |
|
`BloomForCausalLM`
| BLOOM, BLOOMZ, BLOOMChat |
`bigscience/bloom`
,
`bigscience/bloomz`
, etc. | | ✅︎ |
✅︎
|
|
`BartForConditionalGeneration`
| BART |
`facebook/bart-base`
,
`facebook/bart-large-cnn`
, etc. | | | |
|
`MBartForConditionalGeneration`
| mBART |
`facebook/mbart-large-en-ro`
,
`facebook/mbart-large-50`
, etc. | | | |
|
`ChatGLMModel`
,
`ChatGLMForConditionalGeneration`
| ChatGLM |
`zai-org/chatglm2-6b`
,
`zai-org/chatglm3-6b`
,
`ShieldLM-6B-chatglm3`
, etc. | ✅︎ | ✅︎ | ✅︎ |
...
...
vllm/model_executor/models/bloom.py
View file @
e2db1164
...
...
@@ -43,7 +43,7 @@ from vllm.model_executor.model_loader.weight_utils import default_weight_loader
from
vllm.model_executor.sampling_metadata
import
SamplingMetadata
from
vllm.sequence
import
IntermediateTensors
from
.interfaces
import
SupportsPP
,
SupportsQuant
,
SupportsV0Only
from
.interfaces
import
SupportsPP
,
SupportsQuant
from
.utils
import
(
AutoWeightsLoader
,
is_pp_missing_parameter
,
make_empty_intermediate_tensors_factory
,
make_layers
,
maybe_prefix
)
...
...
@@ -313,7 +313,7 @@ class BloomModel(nn.Module):
return
loaded_params
class
BloomForCausalLM
(
nn
.
Module
,
SupportsPP
,
SupportsV0Only
,
SupportsQuant
):
class
BloomForCausalLM
(
nn
.
Module
,
SupportsPP
,
SupportsQuant
):
def
__init__
(
self
,
*
,
vllm_config
:
VllmConfig
,
prefix
:
str
=
""
):
super
().
__init__
()
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment