Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
ModelZoo
Qwen3.5_vllm
Commits
1f7b7ce3
Commit
1f7b7ce3
authored
Feb 16, 2026
by
luopl
Browse files
upload whl
parent
a0dcaa80
Changes
4
Hide whitespace changes
Inline
Side-by-side
Showing
4 changed files
with
2 additions
and
5 deletions
+2
-5
README.md
README.md
+1
-4
flash_mla-1.0.0-cp310-cp310-manylinux_2_28_x86_64.whl
flash_mla-1.0.0-cp310-cp310-manylinux_2_28_x86_64.whl
+0
-0
model.properties
model.properties
+1
-1
vllm-0.15.1+das.opt1.alpha.dtk2604-cp310-cp310-linux_x86_64.whl
....15.1+das.opt1.alpha.dtk2604-cp310-cp310-linux_x86_64.whl
+0
-0
No files found.
README.md
View file @
1f7b7ce3
...
...
@@ -49,7 +49,7 @@ docker load -i pytorch2.9.1-ubuntu22.04-dtk26.04-0130-py3.10-20260204.tar.gz
-
挂载地址
`-v`
根据实际模型情况修改
```
bash
docker run
-it
\
--shm-size
6
0g
\
--shm-size
20
0g
\
--network
=
host
\
--name
qwen3.5
\
--privileged
\
...
...
@@ -110,9 +110,6 @@ export HIP_VISIBLE_DEVICES=0,1,2,3,4,5,6,7
export
VLLM_SPEC_DECODE_EAGER
=
1
export
VLLM_MLA_DISABLE
=
0
export
VLLM_USE_FLASH_MLA
=
1
# K100_AI集群建议额外设置的环境变量:
export
VLLM_ENFORCE_EAGER_BS_THRESHOLD
=
44
export
VLLM_RPC_TIMEOUT
=
1800000
# 海光CPU绑定核
...
...
flash_mla-1.0.0-cp310-cp310-manylinux_2_28_x86_64.whl
0 → 100644
View file @
1f7b7ce3
File added
model.properties
View file @
1f7b7ce3
...
...
@@ -11,4 +11,4 @@ appCategory=对话问答
# 框架类型
frameType
=
vllm
# 加速卡类型
accelerateType
=
K100AI,BW1000
\ No newline at end of file
accelerateType
=
BW1000
\ No newline at end of file
vllm-0.15.1+das.opt1.alpha.dtk2604-cp310-cp310-linux_x86_64.whl
0 → 100644
View file @
1f7b7ce3
File added
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment