Unverified Commit f4ae7c85 authored by wang jiahao's avatar wang jiahao Committed by GitHub
Browse files

Merge pull request #1069 from kvcache-ai/qiyuxinlin-patch-4

Update balance-serve.md
parents 77c6cc82 2fcdbee7
......@@ -112,7 +112,7 @@ Use our optimized configuration for constrained VRAM:
```bash
python ktransformers/server/main.py \
--port 10002
--port 10002 \
--model_path <path_to_safetensor_config> \
--gguf_path <path_to_gguf_files> \
--optimize_config_path ktransformers/optimize/optimize_rules/DeepSeek-V3-Chat-serve.yaml \
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment