Unverified Commit f4ae7c85 authored by wang jiahao's avatar wang jiahao Committed by GitHub
Browse files

Merge pull request #1069 from kvcache-ai/qiyuxinlin-patch-4

Update balance-serve.md
parents 77c6cc82 2fcdbee7
...@@ -112,7 +112,7 @@ Use our optimized configuration for constrained VRAM: ...@@ -112,7 +112,7 @@ Use our optimized configuration for constrained VRAM:
```bash ```bash
python ktransformers/server/main.py \ python ktransformers/server/main.py \
--port 10002 --port 10002 \
--model_path <path_to_safetensor_config> \ --model_path <path_to_safetensor_config> \
--gguf_path <path_to_gguf_files> \ --gguf_path <path_to_gguf_files> \
--optimize_config_path ktransformers/optimize/optimize_rules/DeepSeek-V3-Chat-serve.yaml \ --optimize_config_path ktransformers/optimize/optimize_rules/DeepSeek-V3-Chat-serve.yaml \
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment