Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
b1ded114
Unverified
Commit
b1ded114
authored
Sep 28, 2025
by
Yuxuan Zhang
Committed by
GitHub
Sep 28, 2025
Browse files
Update GLM-4.5 Doc transformers version (#25830)
Signed-off-by:
zRzRzRzRzRzRzR
<
2448370773@qq.com
>
parent
f4e4088c
Changes
4
Hide whitespace changes
Inline
Side-by-side
Showing
4 changed files
with
7 additions
and
5 deletions
+7
-5
docs/features/tool_calling.md
docs/features/tool_calling.md
+4
-2
docs/models/supported_models.md
docs/models/supported_models.md
+1
-1
tests/models/registry.py
tests/models/registry.py
+1
-1
vllm/model_executor/models/glm4_moe.py
vllm/model_executor/models/glm4_moe.py
+1
-1
No files found.
docs/features/tool_calling.md
View file @
b1ded114
...
@@ -323,8 +323,10 @@ Flags: `--tool-call-parser longcat`
...
@@ -323,8 +323,10 @@ Flags: `--tool-call-parser longcat`
Supported models:
Supported models:
*
`ZhipuAI/GLM-4.5`
*
`zai-org/GLM-4.5`
*
`ZhipuAI/GLM-4.5-Air`
*
`zai-org/GLM-4.5-Air`
*
`zai-org/GLM-4.6`
*
`zai-org/GLM-4.6-Air`
Flags:
`--tool-call-parser glm45`
Flags:
`--tool-call-parser glm45`
...
...
docs/models/supported_models.md
View file @
b1ded114
...
@@ -367,7 +367,7 @@ th {
...
@@ -367,7 +367,7 @@ th {
|
`Gemma3nForCausalLM`
| Gemma 3n |
`google/gemma-3n-E2B-it`
,
`google/gemma-3n-E4B-it`
, etc. | | | ✅︎ |
|
`Gemma3nForCausalLM`
| Gemma 3n |
`google/gemma-3n-E2B-it`
,
`google/gemma-3n-E4B-it`
, etc. | | | ✅︎ |
|
`GlmForCausalLM`
| GLM-4 |
`zai-org/glm-4-9b-chat-hf`
, etc. | ✅︎ | ✅︎ | ✅︎ |
|
`GlmForCausalLM`
| GLM-4 |
`zai-org/glm-4-9b-chat-hf`
, etc. | ✅︎ | ✅︎ | ✅︎ |
|
`Glm4ForCausalLM`
| GLM-4-0414 |
`zai-org/GLM-4-32B-0414`
, etc. | ✅︎ | ✅︎ | ✅︎ |
|
`Glm4ForCausalLM`
| GLM-4-0414 |
`zai-org/GLM-4-32B-0414`
, etc. | ✅︎ | ✅︎ | ✅︎ |
|
`Glm4MoeForCausalLM`
| GLM-4.5 |
`zai-org/GLM-4.5`
, etc. | ✅︎ | ✅︎ | ✅︎ |
|
`Glm4MoeForCausalLM`
| GLM-4.5
, GLM-4.6
|
`zai-org/GLM-4.5`
, etc. | ✅︎ | ✅︎ | ✅︎ |
|
`GPT2LMHeadModel`
| GPT-2 |
`gpt2`
,
`gpt2-xl`
, etc. | | ✅︎ | ✅︎ |
|
`GPT2LMHeadModel`
| GPT-2 |
`gpt2`
,
`gpt2-xl`
, etc. | | ✅︎ | ✅︎ |
|
`GPTBigCodeForCausalLM`
| StarCoder, SantaCoder, WizardCoder |
`bigcode/starcoder`
,
`bigcode/gpt_bigcode-santacoder`
,
`WizardLM/WizardCoder-15B-V1.0`
, etc. | ✅︎ | ✅︎ | ✅︎ |
|
`GPTBigCodeForCausalLM`
| StarCoder, SantaCoder, WizardCoder |
`bigcode/starcoder`
,
`bigcode/gpt_bigcode-santacoder`
,
`WizardLM/WizardCoder-15B-V1.0`
, etc. | ✅︎ | ✅︎ | ✅︎ |
|
`GPTJForCausalLM`
| GPT-J |
`EleutherAI/gpt-j-6b`
,
`nomic-ai/gpt4all-j`
, etc. | | ✅︎ | ✅︎ |
|
`GPTJForCausalLM`
| GPT-J |
`EleutherAI/gpt-j-6b`
,
`nomic-ai/gpt4all-j`
, etc. | | ✅︎ | ✅︎ |
...
...
tests/models/registry.py
View file @
b1ded114
...
@@ -642,7 +642,7 @@ _SPECULATIVE_DECODING_EXAMPLE_MODELS = {
...
@@ -642,7 +642,7 @@ _SPECULATIVE_DECODING_EXAMPLE_MODELS = {
speculative_model
=
"baidu/ERNIE-4.5-21B-A3B-PT"
),
speculative_model
=
"baidu/ERNIE-4.5-21B-A3B-PT"
),
"Glm4MoeMTPModel"
:
_HfExamplesInfo
(
"zai-org/GLM-4.5"
,
"Glm4MoeMTPModel"
:
_HfExamplesInfo
(
"zai-org/GLM-4.5"
,
speculative_model
=
"zai-org/GLM-4.5"
,
speculative_model
=
"zai-org/GLM-4.5"
,
min_transformers_version
=
"4.5
4
"
,
min_transformers_version
=
"4.5
6
"
,
is_available_online
=
False
),
is_available_online
=
False
),
"LongCatFlashMTPModel"
:
_HfExamplesInfo
(
"LongCatFlashMTPModel"
:
_HfExamplesInfo
(
"meituan-longcat/LongCat-Flash-Chat"
,
"meituan-longcat/LongCat-Flash-Chat"
,
...
...
vllm/model_executor/models/glm4_moe.py
View file @
b1ded114
...
@@ -21,7 +21,7 @@
...
@@ -21,7 +21,7 @@
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# See the License for the specific language governing permissions and
# limitations under the License.
# limitations under the License.
"""Inference-only GLM-4.5 model compatible with HuggingFace weights."""
"""Inference-only GLM-4.5
, GLM-4.6
model compatible with HuggingFace weights."""
import
typing
import
typing
from
collections.abc
import
Callable
,
Iterable
from
collections.abc
import
Callable
,
Iterable
from
itertools
import
islice
from
itertools
import
islice
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment