Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
5608e611
Unverified
Commit
5608e611
authored
Oct 31, 2024
by
Jee Jee Li
Committed by
GitHub
Oct 31, 2024
Browse files
[Doc] Update Qwen documentation (#9869)
parent
3ea2dc2e
Changes
2
Show whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
6 additions
and
3 deletions
+6
-3
docs/source/models/supported_models.rst
docs/source/models/supported_models.rst
+5
-2
vllm/model_executor/models/qwen.py
vllm/model_executor/models/qwen.py
+1
-1
No files found.
docs/source/models/supported_models.rst
View file @
5608e611
...
@@ -277,7 +277,7 @@ Text Generation
...
@@ -277,7 +277,7 @@ Text Generation
* - :code:`QWenLMHeadModel`
* - :code:`QWenLMHeadModel`
- Qwen
- Qwen
- :code:`Qwen/Qwen-7B`, :code:`Qwen/Qwen-7B-Chat`, etc.
- :code:`Qwen/Qwen-7B`, :code:`Qwen/Qwen-7B-Chat`, etc.
-
-
✅︎
- ✅︎
- ✅︎
* - :code:`Qwen2ForCausalLM`
* - :code:`Qwen2ForCausalLM`
- Qwen2
- Qwen2
...
@@ -516,7 +516,7 @@ Text Generation
...
@@ -516,7 +516,7 @@ Text Generation
- Qwen-VL
- Qwen-VL
- T + I\ :sup:`E+`
- T + I\ :sup:`E+`
- :code:`Qwen/Qwen-VL`, :code:`Qwen/Qwen-VL-Chat`, etc.
- :code:`Qwen/Qwen-VL`, :code:`Qwen/Qwen-VL-Chat`, etc.
-
-
✅︎
- ✅︎
- ✅︎
* - :code:`Qwen2AudioForConditionalGeneration`
* - :code:`Qwen2AudioForConditionalGeneration`
- Qwen2-Audio
- Qwen2-Audio
...
@@ -540,6 +540,9 @@ Text Generation
...
@@ -540,6 +540,9 @@ Text Generation
| :sup:`E` Pre-computed embeddings can be inputted for this modality.
| :sup:`E` Pre-computed embeddings can be inputted for this modality.
| :sup:`+` Multiple items can be inputted per text prompt for this modality.
| :sup:`+` Multiple items can be inputted per text prompt for this modality.
.. note::
vLLM currently only supports adding LoRA to the language backbone of multimodal models.
.. note::
.. note::
For :code:`openbmb/MiniCPM-V-2`, the official repo doesn't work yet, so we need to use a fork (:code:`HwwwH/MiniCPM-V-2`) for now.
For :code:`openbmb/MiniCPM-V-2`, the official repo doesn't work yet, so we need to use a fork (:code:`HwwwH/MiniCPM-V-2`) for now.
For more details, please see: https://github.com/vllm-project/vllm/pull/4087#issuecomment-2250397630
For more details, please see: https://github.com/vllm-project/vllm/pull/4087#issuecomment-2250397630
...
...
vllm/model_executor/models/qwen.py
View file @
5608e611
...
@@ -1048,7 +1048,7 @@ class QWenVL(QWenBaseModel):
...
@@ -1048,7 +1048,7 @@ class QWenVL(QWenBaseModel):
@
MULTIMODAL_REGISTRY
.
register_max_image_tokens
(
MAX_QWEN_IMG_TOKENS
)
@
MULTIMODAL_REGISTRY
.
register_max_image_tokens
(
MAX_QWEN_IMG_TOKENS
)
@
INPUT_REGISTRY
.
register_dummy_data
(
dummy_data_for_qwen
)
@
INPUT_REGISTRY
.
register_dummy_data
(
dummy_data_for_qwen
)
@
INPUT_REGISTRY
.
register_input_processor
(
input_processor_for_qwen
)
@
INPUT_REGISTRY
.
register_input_processor
(
input_processor_for_qwen
)
class
QWenLMHeadModel
(
QWenBaseModel
):
class
QWenLMHeadModel
(
QWenBaseModel
,
SupportsLoRA
):
"""
"""
QWenLMHeadModel is not only applicable to LLM but also to VL, which is not
QWenLMHeadModel is not only applicable to LLM but also to VL, which is not
conducive to the current integration logic of LoRA in vLLM. Therefore, it
conducive to the current integration logic of LoRA in vLLM. Therefore, it
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment