Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
a3154a60
Unverified
Commit
a3154a60
authored
Feb 02, 2026
by
Paco Xu
Committed by
GitHub
Feb 02, 2026
Browse files
[Doc] add missing model entries in supported_models.md (#33220)
Signed-off-by:
Paco Xu
<
paco.xu@daocloud.io
>
parent
7c036432
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
13 additions
and
7 deletions
+13
-7
docs/models/supported_models.md
docs/models/supported_models.md
+13
-7
No files found.
docs/models/supported_models.md
View file @
a3154a60
...
@@ -365,6 +365,7 @@ th {
...
@@ -365,6 +365,7 @@ th {
|
`BloomForCausalLM`
| BLOOM, BLOOMZ, BLOOMChat |
`bigscience/bloom`
,
`bigscience/bloomz`
, etc. | | ✅︎ |
|
`BloomForCausalLM`
| BLOOM, BLOOMZ, BLOOMChat |
`bigscience/bloom`
,
`bigscience/bloomz`
, etc. | | ✅︎ |
|
`ChatGLMModel`
,
`ChatGLMForConditionalGeneration`
| ChatGLM |
`zai-org/chatglm2-6b`
,
`zai-org/chatglm3-6b`
,
`thu-coai/ShieldLM-6B-chatglm3`
, etc. | ✅︎ | ✅︎ |
|
`ChatGLMModel`
,
`ChatGLMForConditionalGeneration`
| ChatGLM |
`zai-org/chatglm2-6b`
,
`zai-org/chatglm3-6b`
,
`thu-coai/ShieldLM-6B-chatglm3`
, etc. | ✅︎ | ✅︎ |
|
`CohereForCausalLM`
,
`Cohere2ForCausalLM`
| Command-R, Command-A |
`CohereLabs/c4ai-command-r-v01`
,
`CohereLabs/c4ai-command-r7b-12-2024`
,
`CohereLabs/c4ai-command-a-03-2025`
,
`CohereLabs/command-a-reasoning-08-2025`
, etc. | ✅︎ | ✅︎ |
|
`CohereForCausalLM`
,
`Cohere2ForCausalLM`
| Command-R, Command-A |
`CohereLabs/c4ai-command-r-v01`
,
`CohereLabs/c4ai-command-r7b-12-2024`
,
`CohereLabs/c4ai-command-a-03-2025`
,
`CohereLabs/command-a-reasoning-08-2025`
, etc. | ✅︎ | ✅︎ |
|
`CwmForCausalLM`
| CWM |
`facebook/cwm`
, etc. | ✅︎ | ✅︎ |
|
`DbrxForCausalLM`
| DBRX |
`databricks/dbrx-base`
,
`databricks/dbrx-instruct`
, etc. | | ✅︎ |
|
`DbrxForCausalLM`
| DBRX |
`databricks/dbrx-base`
,
`databricks/dbrx-instruct`
, etc. | | ✅︎ |
|
`DeciLMForCausalLM`
| DeciLM |
`nvidia/Llama-3_3-Nemotron-Super-49B-v1`
, etc. | ✅︎ | ✅︎ |
|
`DeciLMForCausalLM`
| DeciLM |
`nvidia/Llama-3_3-Nemotron-Super-49B-v1`
, etc. | ✅︎ | ✅︎ |
|
`DeepseekForCausalLM`
| DeepSeek |
`deepseek-ai/deepseek-llm-67b-base`
,
`deepseek-ai/deepseek-llm-7b-chat`
, etc. | ✅︎ | ✅︎ |
|
`DeepseekForCausalLM`
| DeepSeek |
`deepseek-ai/deepseek-llm-67b-base`
,
`deepseek-ai/deepseek-llm-7b-chat`
, etc. | ✅︎ | ✅︎ |
...
@@ -375,7 +376,7 @@ th {
...
@@ -375,7 +376,7 @@ th {
|
`Ernie4_5ForCausalLM`
| Ernie4.5 |
`baidu/ERNIE-4.5-0.3B-PT`
, etc. | ✅︎ | ✅︎ |
|
`Ernie4_5ForCausalLM`
| Ernie4.5 |
`baidu/ERNIE-4.5-0.3B-PT`
, etc. | ✅︎ | ✅︎ |
|
`Ernie4_5_MoeForCausalLM`
| Ernie4.5MoE |
`baidu/ERNIE-4.5-21B-A3B-PT`
,
`baidu/ERNIE-4.5-300B-A47B-PT`
, etc. |✅︎| ✅︎ |
|
`Ernie4_5_MoeForCausalLM`
| Ernie4.5MoE |
`baidu/ERNIE-4.5-21B-A3B-PT`
,
`baidu/ERNIE-4.5-300B-A47B-PT`
, etc. |✅︎| ✅︎ |
|
`ExaoneForCausalLM`
| EXAONE-3 |
`LGAI-EXAONE/EXAONE-3.0-7.8B-Instruct`
, etc. | ✅︎ | ✅︎ |
|
`ExaoneForCausalLM`
| EXAONE-3 |
`LGAI-EXAONE/EXAONE-3.0-7.8B-Instruct`
, etc. | ✅︎ | ✅︎ |
|
`ExaoneMo
e
CausalLM`
| K-EXAONE |
`LGAI-EXAONE/K-EXAONE-236B-A23B`
, etc. | | |
|
`ExaoneMo
EFor
CausalLM`
| K-EXAONE |
`LGAI-EXAONE/K-EXAONE-236B-A23B`
, etc. | | |
|
`Exaone4ForCausalLM`
| EXAONE-4 |
`LGAI-EXAONE/EXAONE-4.0-32B`
, etc. | ✅︎ | ✅︎ |
|
`Exaone4ForCausalLM`
| EXAONE-4 |
`LGAI-EXAONE/EXAONE-4.0-32B`
, etc. | ✅︎ | ✅︎ |
|
`Fairseq2LlamaForCausalLM`
| Llama (fairseq2 format) |
`mgleize/fairseq2-dummy-Llama-3.2-1B`
, etc. | ✅︎ | ✅︎ |
|
`Fairseq2LlamaForCausalLM`
| Llama (fairseq2 format) |
`mgleize/fairseq2-dummy-Llama-3.2-1B`
, etc. | ✅︎ | ✅︎ |
|
`FalconForCausalLM`
| Falcon |
`tiiuae/falcon-7b`
,
`tiiuae/falcon-40b`
,
`tiiuae/falcon-rw-7b`
, etc. | | ✅︎ |
|
`FalconForCausalLM`
| Falcon |
`tiiuae/falcon-7b`
,
`tiiuae/falcon-40b`
,
`tiiuae/falcon-rw-7b`
, etc. | | ✅︎ |
...
@@ -389,6 +390,7 @@ th {
...
@@ -389,6 +390,7 @@ th {
|
`GlmForCausalLM`
| GLM-4 |
`zai-org/glm-4-9b-chat-hf`
, etc. | ✅︎ | ✅︎ |
|
`GlmForCausalLM`
| GLM-4 |
`zai-org/glm-4-9b-chat-hf`
, etc. | ✅︎ | ✅︎ |
|
`Glm4ForCausalLM`
| GLM-4-0414 |
`zai-org/GLM-4-32B-0414`
, etc. | ✅︎ | ✅︎ |
|
`Glm4ForCausalLM`
| GLM-4-0414 |
`zai-org/GLM-4-32B-0414`
, etc. | ✅︎ | ✅︎ |
|
`Glm4MoeForCausalLM`
| GLM-4.5, GLM-4.6, GLM-4.7 |
`zai-org/GLM-4.5`
, etc. | ✅︎ | ✅︎ |
|
`Glm4MoeForCausalLM`
| GLM-4.5, GLM-4.6, GLM-4.7 |
`zai-org/GLM-4.5`
, etc. | ✅︎ | ✅︎ |
|
`Glm4MoeLiteForCausalLM`
| GLM-4.7-Flash |
`zai-org/GLM-4.7-Flash`
, etc. | ✅︎ | ✅︎ |
|
`GPT2LMHeadModel`
| GPT-2 |
`openai-community/gpt2`
,
`openai-community/gpt2-xl`
, etc. | | ✅︎ |
|
`GPT2LMHeadModel`
| GPT-2 |
`openai-community/gpt2`
,
`openai-community/gpt2-xl`
, etc. | | ✅︎ |
|
`GPTBigCodeForCausalLM`
| StarCoder, SantaCoder, WizardCoder |
`bigcode/starcoder`
,
`bigcode/gpt_bigcode-santacoder`
,
`WizardLM/WizardCoder-15B-V1.0`
, etc. | ✅︎ | ✅︎ |
|
`GPTBigCodeForCausalLM`
| StarCoder, SantaCoder, WizardCoder |
`bigcode/starcoder`
,
`bigcode/gpt_bigcode-santacoder`
,
`WizardLM/WizardCoder-15B-V1.0`
, etc. | ✅︎ | ✅︎ |
|
`GPTJForCausalLM`
| GPT-J |
`EleutherAI/gpt-j-6b`
,
`nomic-ai/gpt4all-j`
, etc. | | ✅︎ |
|
`GPTJForCausalLM`
| GPT-J |
`EleutherAI/gpt-j-6b`
,
`nomic-ai/gpt4all-j`
, etc. | | ✅︎ |
...
@@ -403,7 +405,6 @@ th {
...
@@ -403,7 +405,6 @@ th {
|
`Grok1ForCausalLM`
| Grok2 |
`xai-org/grok-2`
| ✅︎ | ✅︎ |
|
`Grok1ForCausalLM`
| Grok2 |
`xai-org/grok-2`
| ✅︎ | ✅︎ |
|
`HunYuanDenseV1ForCausalLM`
| Hunyuan Dense |
`tencent/Hunyuan-7B-Instruct`
| ✅︎ | ✅︎ |
|
`HunYuanDenseV1ForCausalLM`
| Hunyuan Dense |
`tencent/Hunyuan-7B-Instruct`
| ✅︎ | ✅︎ |
|
`HunYuanMoEV1ForCausalLM`
| Hunyuan-A13B |
`tencent/Hunyuan-A13B-Instruct`
,
`tencent/Hunyuan-A13B-Pretrain`
,
`tencent/Hunyuan-A13B-Instruct-FP8`
, etc. | ✅︎ | ✅︎ |
|
`HunYuanMoEV1ForCausalLM`
| Hunyuan-A13B |
`tencent/Hunyuan-A13B-Instruct`
,
`tencent/Hunyuan-A13B-Pretrain`
,
`tencent/Hunyuan-A13B-Instruct-FP8`
, etc. | ✅︎ | ✅︎ |
|
`HCXVisionForCausalLM`
| HyperCLOVAX-SEED-Vision-Instruct-3B |
`naver-hyperclovax/HyperCLOVAX-SEED-Vision-Instruct-3B`
| | |
|
`InternLMForCausalLM`
| InternLM |
`internlm/internlm-7b`
,
`internlm/internlm-chat-7b`
, etc. | ✅︎ | ✅︎ |
|
`InternLMForCausalLM`
| InternLM |
`internlm/internlm-7b`
,
`internlm/internlm-chat-7b`
, etc. | ✅︎ | ✅︎ |
|
`InternLM2ForCausalLM`
| InternLM2 |
`internlm/internlm2-7b`
,
`internlm/internlm2-chat-7b`
, etc. | ✅︎ | ✅︎ |
|
`InternLM2ForCausalLM`
| InternLM2 |
`internlm/internlm2-7b`
,
`internlm/internlm2-chat-7b`
, etc. | ✅︎ | ✅︎ |
|
`InternLM3ForCausalLM`
| InternLM3 |
`internlm/internlm3-8b-instruct`
, etc. | ✅︎ | ✅︎ |
|
`InternLM3ForCausalLM`
| InternLM3 |
`internlm/internlm3-8b-instruct`
, etc. | ✅︎ | ✅︎ |
...
@@ -416,12 +417,14 @@ th {
...
@@ -416,12 +417,14 @@ th {
|
`Lfm2ForCausalLM`
| LFM2 |
`LiquidAI/LFM2-1.2B`
,
`LiquidAI/LFM2-700M`
,
`LiquidAI/LFM2-350M`
, etc. | ✅︎ | ✅︎ |
|
`Lfm2ForCausalLM`
| LFM2 |
`LiquidAI/LFM2-1.2B`
,
`LiquidAI/LFM2-700M`
,
`LiquidAI/LFM2-350M`
, etc. | ✅︎ | ✅︎ |
|
`Lfm2MoeForCausalLM`
| LFM2MoE |
`LiquidAI/LFM2-8B-A1B-preview`
, etc. | ✅︎ | ✅︎ |
|
`Lfm2MoeForCausalLM`
| LFM2MoE |
`LiquidAI/LFM2-8B-A1B-preview`
, etc. | ✅︎ | ✅︎ |
|
`LlamaForCausalLM`
| Llama 3.1, Llama 3, Llama 2, LLaMA, Yi |
`meta-llama/Meta-Llama-3.1-405B-Instruct`
,
`meta-llama/Meta-Llama-3.1-70B`
,
`meta-llama/Meta-Llama-3-70B-Instruct`
,
`meta-llama/Llama-2-70b-hf`
,
`01-ai/Yi-34B`
, etc. | ✅︎ | ✅︎ |
|
`LlamaForCausalLM`
| Llama 3.1, Llama 3, Llama 2, LLaMA, Yi |
`meta-llama/Meta-Llama-3.1-405B-Instruct`
,
`meta-llama/Meta-Llama-3.1-70B`
,
`meta-llama/Meta-Llama-3-70B-Instruct`
,
`meta-llama/Llama-2-70b-hf`
,
`01-ai/Yi-34B`
, etc. | ✅︎ | ✅︎ |
|
`LongcatFlashForCausalLM`
| LongCat-Flash |
`meituan-longcat/LongCat-Flash-Chat`
,
`meituan-longcat/LongCat-Flash-Chat-FP8`
| ✅︎ | ✅︎ |
|
`MambaForCausalLM`
| Mamba |
`state-spaces/mamba-130m-hf`
,
`state-spaces/mamba-790m-hf`
,
`state-spaces/mamba-2.8b-hf`
, etc. | | ✅︎ |
|
`MambaForCausalLM`
| Mamba |
`state-spaces/mamba-130m-hf`
,
`state-spaces/mamba-790m-hf`
,
`state-spaces/mamba-2.8b-hf`
, etc. | | ✅︎ |
|
`Mamba2ForCausalLM`
| Mamba2 |
`mistralai/Mamba-Codestral-7B-v0.1`
, etc. | | ✅︎ |
|
`Mamba2ForCausalLM`
| Mamba2 |
`mistralai/Mamba-Codestral-7B-v0.1`
, etc. | | ✅︎ |
|
`MiMoForCausalLM`
| MiMo |
`XiaomiMiMo/MiMo-7B-RL`
, etc. | ✅︎ | ✅︎ |
|
`MiMoForCausalLM`
| MiMo |
`XiaomiMiMo/MiMo-7B-RL`
, etc. | ✅︎ | ✅︎ |
|
`MiMoV2FlashForCausalLM`
| MiMoV2Flash |
`XiaomiMiMo/MiMo-V2-Flash`
, etc. | ︎| ✅︎ |
|
`MiMoV2FlashForCausalLM`
| MiMoV2Flash |
`XiaomiMiMo/MiMo-V2-Flash`
, etc. | ︎| ✅︎ |
|
`MiniCPMForCausalLM`
| MiniCPM |
`openbmb/MiniCPM-2B-sft-bf16`
,
`openbmb/MiniCPM-2B-dpo-bf16`
,
`openbmb/MiniCPM-S-1B-sft`
, etc. | ✅︎ | ✅︎ |
|
`MiniCPMForCausalLM`
| MiniCPM |
`openbmb/MiniCPM-2B-sft-bf16`
,
`openbmb/MiniCPM-2B-dpo-bf16`
,
`openbmb/MiniCPM-S-1B-sft`
, etc. | ✅︎ | ✅︎ |
|
`MiniCPM3ForCausalLM`
| MiniCPM3 |
`openbmb/MiniCPM3-4B`
, etc. | ✅︎ | ✅︎ |
|
`MiniCPM3ForCausalLM`
| MiniCPM3 |
`openbmb/MiniCPM3-4B`
, etc. | ✅︎ | ✅︎ |
|
`MiniMaxForCausalLM`
| MiniMax-Text |
`MiniMaxAI/MiniMax-Text-01-hf`
, etc. | | |
|
`MiniMaxM2ForCausalLM`
| MiniMax-M2, MiniMax-M2.1 |
`MiniMaxAI/MiniMax-M2`
, etc. | ✅︎ | ✅︎ |
|
`MiniMaxM2ForCausalLM`
| MiniMax-M2, MiniMax-M2.1 |
`MiniMaxAI/MiniMax-M2`
, etc. | ✅︎ | ✅︎ |
|
`MistralForCausalLM`
| Ministral-3, Mistral, Mistral-Instruct |
`mistralai/Ministral-3-3B-Instruct-2512`
,
`mistralai/Mistral-7B-v0.1`
,
`mistralai/Mistral-7B-Instruct-v0.1`
, etc. | ✅︎ | ✅︎ |
|
`MistralForCausalLM`
| Ministral-3, Mistral, Mistral-Instruct |
`mistralai/Ministral-3-3B-Instruct-2512`
,
`mistralai/Mistral-7B-v0.1`
,
`mistralai/Mistral-7B-Instruct-v0.1`
, etc. | ✅︎ | ✅︎ |
|
`MistralLarge3ForCausalLM`
| Mistral-Large-3-675B-Base-2512, Mistral-Large-3-675B-Instruct-2512 |
`mistralai/Mistral-Large-3-675B-Base-2512`
,
`mistralai/Mistral-Large-3-675B-Instruct-2512`
, etc. | ✅︎ | ✅︎ |
|
`MistralLarge3ForCausalLM`
| Mistral-Large-3-675B-Base-2512, Mistral-Large-3-675B-Instruct-2512 |
`mistralai/Mistral-Large-3-675B-Base-2512`
,
`mistralai/Mistral-Large-3-675B-Instruct-2512`
, etc. | ✅︎ | ✅︎ |
...
@@ -429,10 +432,10 @@ th {
...
@@ -429,10 +432,10 @@ th {
|
`MPTForCausalLM`
| MPT, MPT-Instruct, MPT-Chat, MPT-StoryWriter |
`mosaicml/mpt-7b`
,
`mosaicml/mpt-7b-storywriter`
,
`mosaicml/mpt-30b`
, etc. | | ✅︎ |
|
`MPTForCausalLM`
| MPT, MPT-Instruct, MPT-Chat, MPT-StoryWriter |
`mosaicml/mpt-7b`
,
`mosaicml/mpt-7b-storywriter`
,
`mosaicml/mpt-30b`
, etc. | | ✅︎ |
|
`NemotronForCausalLM`
| Nemotron-3, Nemotron-4, Minitron |
`nvidia/Minitron-8B-Base`
,
`mgoin/Nemotron-4-340B-Base-hf-FP8`
, etc. | ✅︎ | ✅︎ |
|
`NemotronForCausalLM`
| Nemotron-3, Nemotron-4, Minitron |
`nvidia/Minitron-8B-Base`
,
`mgoin/Nemotron-4-340B-Base-hf-FP8`
, etc. | ✅︎ | ✅︎ |
|
`NemotronHForCausalLM`
| Nemotron-H |
`nvidia/Nemotron-H-8B-Base-8K`
,
`nvidia/Nemotron-H-47B-Base-8K`
,
`nvidia/Nemotron-H-56B-Base-8K`
, etc. | ✅︎ | ✅︎ |
|
`NemotronHForCausalLM`
| Nemotron-H |
`nvidia/Nemotron-H-8B-Base-8K`
,
`nvidia/Nemotron-H-47B-Base-8K`
,
`nvidia/Nemotron-H-56B-Base-8K`
, etc. | ✅︎ | ✅︎ |
|
`O
LM
oForCausalLM`
| OLMo |
`allenai/OLMo-1B-hf`
,
`allenai/OLMo-7B-hf`
, etc. | ✅︎ | ✅︎ |
|
`O
lm
oForCausalLM`
| OLMo |
`allenai/OLMo-1B-hf`
,
`allenai/OLMo-7B-hf`
, etc. | ✅︎ | ✅︎ |
|
`O
LM
o2ForCausalLM`
| OLMo2 |
`allenai/OLMo-2-0425-1B`
, etc. | ✅︎ | ✅︎ |
|
`O
lm
o2ForCausalLM`
| OLMo2 |
`allenai/OLMo-2-0425-1B`
, etc. | ✅︎ | ✅︎ |
|
`O
LM
o3ForCausalLM`
| OLMo3 |
`allenai/Olmo-3-7B-Instruct`
,
`allenai/Olmo-3-32B-Think`
, etc. | ✅︎ | ✅︎ |
|
`O
lm
o3ForCausalLM`
| OLMo3 |
`allenai/Olmo-3-7B-Instruct`
,
`allenai/Olmo-3-32B-Think`
, etc. | ✅︎ | ✅︎ |
|
`O
LMoE
ForCausalLM`
| OLMoE |
`allenai/OLMoE-1B-7B-0924`
,
`allenai/OLMoE-1B-7B-0924-Instruct`
, etc. | | ✅︎ |
|
`O
lmoe
ForCausalLM`
| OLMoE |
`allenai/OLMoE-1B-7B-0924`
,
`allenai/OLMoE-1B-7B-0924-Instruct`
, etc. | | ✅︎ |
|
`OPTForCausalLM`
| OPT, OPT-IML |
`facebook/opt-66b`
,
`facebook/opt-iml-max-30b`
, etc. | ✅︎ | ✅︎ |
|
`OPTForCausalLM`
| OPT, OPT-IML |
`facebook/opt-66b`
,
`facebook/opt-iml-max-30b`
, etc. | ✅︎ | ✅︎ |
|
`OrionForCausalLM`
| Orion |
`OrionStarAI/Orion-14B-Base`
,
`OrionStarAI/Orion-14B-Chat`
, etc. | | ✅︎ |
|
`OrionForCausalLM`
| Orion |
`OrionStarAI/Orion-14B-Base`
,
`OrionStarAI/Orion-14B-Chat`
, etc. | | ✅︎ |
|
`OuroForCausalLM`
| ouro |
`ByteDance/Ouro-1.4B`
,
`ByteDance/Ouro-2.6B`
, etc. | ✅︎ | |
|
`OuroForCausalLM`
| ouro |
`ByteDance/Ouro-1.4B`
,
`ByteDance/Ouro-2.6B`
, etc. | ✅︎ | |
...
@@ -451,19 +454,21 @@ th {
...
@@ -451,19 +454,21 @@ th {
|
`Qwen3ForCausalLM`
| Qwen3 |
`Qwen/Qwen3-8B`
, etc. | ✅︎ | ✅︎ |
|
`Qwen3ForCausalLM`
| Qwen3 |
`Qwen/Qwen3-8B`
, etc. | ✅︎ | ✅︎ |
|
`Qwen3MoeForCausalLM`
| Qwen3MoE |
`Qwen/Qwen3-30B-A3B`
, etc. | ✅︎ | ✅︎ |
|
`Qwen3MoeForCausalLM`
| Qwen3MoE |
`Qwen/Qwen3-30B-A3B`
, etc. | ✅︎ | ✅︎ |
|
`Qwen3NextForCausalLM`
| Qwen3NextMoE |
`Qwen/Qwen3-Next-80B-A3B-Instruct`
, etc. | ✅︎ | ✅︎ |
|
`Qwen3NextForCausalLM`
| Qwen3NextMoE |
`Qwen/Qwen3-Next-80B-A3B-Instruct`
, etc. | ✅︎ | ✅︎ |
|
`RWForCausalLM`
| Falcon RW |
`tiiuae/falcon-40b`
, etc. | | ✅︎ |
|
`SeedOssForCausalLM`
| SeedOss |
`ByteDance-Seed/Seed-OSS-36B-Instruct`
, etc. | ✅︎ | ✅︎ |
|
`SeedOssForCausalLM`
| SeedOss |
`ByteDance-Seed/Seed-OSS-36B-Instruct`
, etc. | ✅︎ | ✅︎ |
|
`SolarForCausalLM`
| Solar Pro |
`upstage/solar-pro-preview-instruct`
, etc. | ✅︎ | ✅︎ |
|
`SolarForCausalLM`
| Solar Pro |
`upstage/solar-pro-preview-instruct`
, etc. | ✅︎ | ✅︎ |
|
`StableLmForCausalLM`
| StableLM |
`stabilityai/stablelm-3b-4e1t`
,
`stabilityai/stablelm-base-alpha-7b-v2`
, etc. | | |
|
`StableLmForCausalLM`
| StableLM |
`stabilityai/stablelm-3b-4e1t`
,
`stabilityai/stablelm-base-alpha-7b-v2`
, etc. | | |
|
`StableLMEpochForCausalLM`
| StableLM Epoch |
`stabilityai/stablelm-zephyr-3b`
, etc. | | ✅︎ |
|
`Starcoder2ForCausalLM`
| Starcoder2 |
`bigcode/starcoder2-3b`
,
`bigcode/starcoder2-7b`
,
`bigcode/starcoder2-15b`
, etc. | | ✅︎ |
|
`Starcoder2ForCausalLM`
| Starcoder2 |
`bigcode/starcoder2-3b`
,
`bigcode/starcoder2-7b`
,
`bigcode/starcoder2-15b`
, etc. | | ✅︎ |
|
`Step1ForCausalLM`
| Step-Audio |
`stepfun-ai/Step-Audio-EditX`
, etc. | ✅︎ | ✅︎ |
|
`Step1ForCausalLM`
| Step-Audio |
`stepfun-ai/Step-Audio-EditX`
, etc. | ✅︎ | ✅︎ |
|
`Step3p5ForCausalLM`
| Step-3.5-flash |
`stepfun-ai/step-3.5-flash`
, etc. | | ✅︎ |
|
`Step3p5ForCausalLM`
| Step-3.5-flash |
`stepfun-ai/step-3.5-flash`
, etc. | | ✅︎ |
|
`TeleChatForCausalLM`
| TeleChat |
`chuhac/TeleChat2-35B`
, etc. | ✅︎ | ✅︎ |
|
`TeleChat2ForCausalLM`
| TeleChat2 |
`Tele-AI/TeleChat2-3B`
,
`Tele-AI/TeleChat2-7B`
,
`Tele-AI/TeleChat2-35B`
, etc. | ✅︎ | ✅︎ |
|
`TeleChat2ForCausalLM`
| TeleChat2 |
`Tele-AI/TeleChat2-3B`
,
`Tele-AI/TeleChat2-7B`
,
`Tele-AI/TeleChat2-35B`
, etc. | ✅︎ | ✅︎ |
|
`TeleFLMForCausalLM`
| TeleFLM |
`CofeAI/FLM-2-52B-Instruct-2407`
,
`CofeAI/Tele-FLM`
, etc. | ✅︎ | ✅︎ |
|
`TeleFLMForCausalLM`
| TeleFLM |
`CofeAI/FLM-2-52B-Instruct-2407`
,
`CofeAI/Tele-FLM`
, etc. | ✅︎ | ✅︎ |
|
`XverseForCausalLM`
| XVERSE |
`xverse/XVERSE-7B-Chat`
,
`xverse/XVERSE-13B-Chat`
,
`xverse/XVERSE-65B-Chat`
, etc. | ✅︎ | ✅︎ |
|
`XverseForCausalLM`
| XVERSE |
`xverse/XVERSE-7B-Chat`
,
`xverse/XVERSE-13B-Chat`
,
`xverse/XVERSE-65B-Chat`
, etc. | ✅︎ | ✅︎ |
|
`MiniMaxM1ForCausalLM`
| MiniMax-Text |
`MiniMaxAI/MiniMax-M1-40k`
,
`MiniMaxAI/MiniMax-M1-80k`
, etc. | | |
|
`MiniMaxM1ForCausalLM`
| MiniMax-Text |
`MiniMaxAI/MiniMax-M1-40k`
,
`MiniMaxAI/MiniMax-M1-80k`
, etc. | | |
|
`MiniMaxText01ForCausalLM`
| MiniMax-Text |
`MiniMaxAI/MiniMax-Text-01`
, etc. | | |
|
`MiniMaxText01ForCausalLM`
| MiniMax-Text |
`MiniMaxAI/MiniMax-Text-01`
, etc. | | |
|
`Zamba2ForCausalLM`
| Zamba2 |
`Zyphra/Zamba2-7B-instruct`
,
`Zyphra/Zamba2-2.7B-instruct`
,
`Zyphra/Zamba2-1.2B-instruct`
, etc. | | |
|
`Zamba2ForCausalLM`
| Zamba2 |
`Zyphra/Zamba2-7B-instruct`
,
`Zyphra/Zamba2-2.7B-instruct`
,
`Zyphra/Zamba2-1.2B-instruct`
, etc. | | |
|
`LongcatFlashForCausalLM`
| LongCat-Flash |
`meituan-longcat/LongCat-Flash-Chat`
,
`meituan-longcat/LongCat-Flash-Chat-FP8`
| ✅︎ | ✅︎ |
!!! note
!!! note
Grok2 requires
`tokenizer.tok.json`
with
`tiktoken`
installed. You can optionally override MoE router renormalization with
`moe_router_renormalize`
.
Grok2 requires
`tokenizer.tok.json`
with
`tiktoken`
installed. You can optionally override MoE router renormalization with
`moe_router_renormalize`
.
...
@@ -677,6 +682,7 @@ These models primarily accept the [`LLM.generate`](./generative_models.md#llmgen
...
@@ -677,6 +682,7 @@ These models primarily accept the [`LLM.generate`](./generative_models.md#llmgen
|
`Glm4vMoeForConditionalGeneration`
| GLM-4.5V | T + I
<sup>
E+
</sup>
+ V
<sup>
E+
</sup>
|
`zai-org/GLM-4.5V`
, etc. | ✅︎ | ✅︎ |
|
`Glm4vMoeForConditionalGeneration`
| GLM-4.5V | T + I
<sup>
E+
</sup>
+ V
<sup>
E+
</sup>
|
`zai-org/GLM-4.5V`
, etc. | ✅︎ | ✅︎ |
|
`GlmOcrForConditionalGeneration`
| GLM-OCR | T + I
<sup>
E+
</sup>
|
`zai-org/GLM-OCR`
, etc. | ✅︎ | ✅︎ |
|
`GlmOcrForConditionalGeneration`
| GLM-OCR | T + I
<sup>
E+
</sup>
|
`zai-org/GLM-OCR`
, etc. | ✅︎ | ✅︎ |
|
`GraniteSpeechForConditionalGeneration`
| Granite Speech | T + A |
`ibm-granite/granite-speech-3.3-8b`
| ✅︎ | ✅︎ |
|
`GraniteSpeechForConditionalGeneration`
| Granite Speech | T + A |
`ibm-granite/granite-speech-3.3-8b`
| ✅︎ | ✅︎ |
|
`HCXVisionForCausalLM`
| HyperCLOVAX-SEED-Vision-Instruct-3B | T + I
<sup>
+
</sup>
+ V
<sup>
+
</sup>
|
`naver-hyperclovax/HyperCLOVAX-SEED-Vision-Instruct-3B`
| | |
|
`H2OVLChatModel`
| H2OVL | T + I
<sup>
E+
</sup>
|
`h2oai/h2ovl-mississippi-800m`
,
`h2oai/h2ovl-mississippi-2b`
, etc. | | ✅︎ |
|
`H2OVLChatModel`
| H2OVL | T + I
<sup>
E+
</sup>
|
`h2oai/h2ovl-mississippi-800m`
,
`h2oai/h2ovl-mississippi-2b`
, etc. | | ✅︎ |
|
`HunYuanVLForConditionalGeneration`
| HunyuanOCR | T + I
<sup>
E+
</sup>
|
`tencent/HunyuanOCR`
, etc. | ✅︎ | ✅︎ |
|
`HunYuanVLForConditionalGeneration`
| HunyuanOCR | T + I
<sup>
E+
</sup>
|
`tencent/HunyuanOCR`
, etc. | ✅︎ | ✅︎ |
|
`Idefics3ForConditionalGeneration`
| Idefics3 | T + I |
`HuggingFaceM4/Idefics3-8B-Llama3`
, etc. | ✅︎ | |
|
`Idefics3ForConditionalGeneration`
| Idefics3 | T + I |
`HuggingFaceM4/Idefics3-8B-Llama3`
, etc. | ✅︎ | |
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment