Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
dynamo
Commits
ed939f08
"ssh:/git@developer.sourcefind.cn:2222/OpenDAS/dynamo.git" did not exist on "05913af5e83b3e418218aed8d5ef239976ae8100"
Unverified
Commit
ed939f08
authored
Mar 14, 2026
by
Biswa Panda
Committed by
GitHub
Mar 14, 2026
Browse files
docs(recipes): add experimental WIP note to Kimi-K2.5 recipe (#7381)
parent
4ef8b8e6
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
7 additions
and
1 deletion
+7
-1
recipes/README.md
recipes/README.md
+1
-1
recipes/kimi-k2.5/README.md
recipes/kimi-k2.5/README.md
+6
-0
No files found.
recipes/README.md
View file @
ed939f08
...
@@ -42,7 +42,7 @@ These recipes demonstrate aggregated or disaggregated serving:
...
@@ -42,7 +42,7 @@ These recipes demonstrate aggregated or disaggregated serving:
|
**[DeepSeek-R1](deepseek-r1/sglang/disagg-16gpu/)**
| SGLang | Disagg WideEP | 32x H200 | ✅ | ❌ | TP=16, multi-node. Use
`model-download-sglang.yaml`
| ❌ |
|
**[DeepSeek-R1](deepseek-r1/sglang/disagg-16gpu/)**
| SGLang | Disagg WideEP | 32x H200 | ✅ | ❌ | TP=16, multi-node. Use
`model-download-sglang.yaml`
| ❌ |
|
**[DeepSeek-R1](deepseek-r1/trtllm/disagg/wide_ep/gb200/)**
| TensorRT-LLM | Disagg WideEP (GB200) | 36x GB200 | ✅ | ✅ | Multi-node: 8 decode + 1 prefill nodes | ❌ |
|
**[DeepSeek-R1](deepseek-r1/trtllm/disagg/wide_ep/gb200/)**
| TensorRT-LLM | Disagg WideEP (GB200) | 36x GB200 | ✅ | ✅ | Multi-node: 8 decode + 1 prefill nodes | ❌ |
|
**[DeepSeek-R1](deepseek-r1/)**
| vLLM | Disagg DEP16 | 32x H200 | ✅ | ❌ | Multi-node, data-expert parallel | ❌ |
|
**[DeepSeek-R1](deepseek-r1/)**
| vLLM | Disagg DEP16 | 32x H200 | ✅ | ❌ | Multi-node, data-expert parallel | ❌ |
|
**[Kimi-K2.5](kimi-k2.5/)**
| TensorRT-LLM | Aggregated | 8x B200 | ✅ | ❌ | MoE model, TP8×EP8, reasoning + tool calling | ❌ |
|
**[Kimi-K2.5](kimi-k2.5/)**
🚧
| TensorRT-LLM | Aggregated | 8x B200 | ✅ | ❌ |
Experimental —
MoE model, TP8×EP8, reasoning + tool calling | ❌ |
**Legend:**
**Legend:**
-
**Deployment**
: ✅ = Complete
`deploy.yaml`
manifest available
-
**Deployment**
: ✅ = Complete
`deploy.yaml`
manifest available
...
...
recipes/kimi-k2.5/README.md
View file @
ed939f08
# Kimi-K2.5 Recipes
# Kimi-K2.5 Recipes
> 🚧 **Work-in-Progress — Experimental Recipe**
>
> The TensorRT-LLM Python package used for Dynamo's TRT-LLM integration does not yet include
> native Kimi K2.5 support. This recipe is an **experimental** effort to bring
> Kimi K2.5 to Dynamo ahead of upstream availability. It needs to patch the container image on top of released dynamo image.
Deployment recipe for
**Kimi-K2.5**
using TensorRT-LLM with Dynamo's KV-aware routing.
Deployment recipe for
**Kimi-K2.5**
using TensorRT-LLM with Dynamo's KV-aware routing.
## Available Configurations
## Available Configurations
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment