Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
ModelZoo
Kimi-K2.5_vllm
Commits
0715629e
Commit
0715629e
authored
Mar 19, 2026
by
luopl
Browse files
fix_tp16
parent
3374aaf6
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
3 additions
and
2 deletions
+3
-2
README.md
README.md
+3
-2
No files found.
README.md
View file @
0715629e
...
...
@@ -120,8 +120,9 @@ ray start --address='x.x.x.x:6379' --num-gpus=8 --num-cpus=32
## serve启动
vllm serve moonshotai/Kimi-K2.5
\
-tp
32
\
-tp
16
\
--distributed-executor-backend
ray
\
--gpu-memory-utilization
0.95
\
--mm-encoder-tp-mode
data
\
--trust-remote-code
\
--tool-call-parser
kimi_k2
\
...
...
@@ -150,7 +151,7 @@ DCU与GPU精度一致,推理框架:vllm。
## 预训练权重
| 模型名称 | 权重大小 | DCU型号 | 最低卡数需求 | 下载地址 |
|:------:|:----:|:----------:|:------:|:---------------------:|
| Kimi-K2.5 | 1.1T | BW1000 |
32
|
[
Hugging Face
](
https://huggingface.co/moonshotai/Kimi-K2.5
)
|
| Kimi-K2.5 | 1.1T | BW1000 |
16
|
[
Hugging Face
](
https://huggingface.co/moonshotai/Kimi-K2.5
)
|
## 源码仓库及问题反馈
-
https://developer.sourcefind.cn/codes/modelzoo/kimi-k2.5_vllm
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment