Commit e6222c2f authored by raojy's avatar raojy 💬
Browse files

Update README.md

parent fd7c029d
......@@ -18,7 +18,7 @@ Qwen3-Omni 是一款原生的端到端全模态基座模型,具备对文本、
| DTK | 26.04 |
| python | 3.10.12 |
| transformers | 4.57.6 |
| vllm | 0.15.1+das.opt1.alpha.dtk2604.20260220.g2799735a |
| vllm | 0.15.1+das.opt1.alpha.dtk2604 |
| torch | 2.9.0+das.opt1.dtk2604.20260206.g275d08c2 |
推荐使用镜像: harbor.sourcefind.cn:5443/dcu/admin/base/custom:vllm0.15.1-ubuntu22.04-dtk26.04-0130-py3.10-20260220
......@@ -66,7 +66,7 @@ pip install numpy==1.26.1
vllm serve Qwen/Qwen3-Omni-30B-A3B-Instruct \
--trust-remote-code \
--tensor-parallel-size 2 \
--tensor-parallel-size 4 \
--dtype bfloat16
## client访问
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment