Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
ModelZoo
Qwen3-VL_pytorch
Commits
2f531453
Commit
2f531453
authored
Feb 09, 2026
by
raojy
Browse files
updata
parent
06f870c9
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
7 deletions
+2
-7
README.md
README.md
+2
-7
No files found.
README.md
View file @
2f531453
...
...
@@ -38,9 +38,6 @@ Visual Coding Boost:从图像/视频生成 Draw.io/HTML/CSS/JS。
| flash_attn | 2.6.1+das.opt1.dtk2504 |
| av | 16.0.1 |
| vllm | 0.11.0+das.opt1.alpha.dtk25042.20251225.gca4598a4 |
## 硬件需求
DCU型号:K100AI,节点数量:2台,卡数:16 张。
推荐使用镜像:harbor.sourcefind.cn:5443/dcu/admin/base/vllm:0.11.0-ubuntu22.04-dtk25.04.2-1226-das1.7-py3.10-20251226
...
...
@@ -123,7 +120,7 @@ curl http://localhost:8000/v1/chat/completions \
}'
```
### 多机
多卡
推理
### 多机推理
样例模型:
[
Qwen3-VL-235B-A22B-Thinking
](
https://huggingface.co/Qwen/Qwen3-VL-235B-A22B-Thinking
)
1.
加入环境变量
...
...
@@ -245,12 +242,10 @@ Output:
### 精度
`DCU与GPU精度一致,
支持
推理框架:transformers、vllm。`
`DCU与GPU精度一致,推理框架:transformers、vllm。`
## 预训练权重
## Qwen3-VL 全系列模型清单
|
**模型名称**
|
**权重大小**
|
**最低卡数需求 (K100AI)**
|
**下载地址 (Hugging Face)**
|
| ------------------------------- | ------------ | ------------------------- | ------------------------------------------------------------ |
|
**Qwen3-VL-2B-Instruct**
| 2B | 1 |
[
Qwen3-VL-2B-Instruct
](
https://huggingface.co/Qwen/Qwen3-VL-2B-Instruct
)
|
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment