Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
ModelZoo
Qwen3-Omni_vllm
Commits
3e1bb3cb
Commit
3e1bb3cb
authored
Apr 02, 2026
by
raojy
💬
Browse files
Update README.md
parent
3df6e522
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
3 additions
and
4 deletions
+3
-4
README.md
README.md
+3
-4
No files found.
README.md
View file @
3e1bb3cb
...
...
@@ -64,7 +64,7 @@ pip install numpy==1.26.1
```
bash
## serve启动
vllm serve
/public/home/raojy/project/model_code
/Qwen3-Omni-30B-A3B-Instruct
\
vllm serve
Qwen
/Qwen3-Omni-30B-A3B-Instruct
\
--trust-remote-code
\
--tensor-parallel-size
4
\
--dtype
bfloat16
...
...
@@ -74,7 +74,7 @@ curl http://localhost:8000/v1/chat/completions \
-H
"Content-Type: application/json"
\
-H
"Authorization: Bearer EMPTY"
\
-d
'{
"model": "
/public/home/raojy/project/model_code
/Qwen3-Omni-30B-A3B-Instruct",
"model": "
Qwen
/Qwen3-Omni-30B-A3B-Instruct",
"messages": [
{
"role": "user",
...
...
@@ -110,8 +110,7 @@ DCU与GPU精度一致,推理框架:vllm。
## 预训练权重
| 模型名称 | 权重大小 | DCU型号 | 最低卡数需求 | 下载地址 |
|:------:|:----:|:----------:|:------:|:---------------------:|
|
Qwen3-Omni-30B-A3B-Instruct | 30B | BW1000 | 2 |
[
Hugging Face
](
https://huggingface.co/Qwen/Qwen3-Omni-30B-A3B-Instruct
)
|
| Qwen3-Omni-30B-A3B-Instruct | 30B | BW1000 | 2 |
[
Hugging Face
](
https://huggingface.co/Qwen/Qwen3-Omni-30B-A3B-Instruct
)
|
## 源码仓库及问题反馈
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment