Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
ModelZoo
Seed-OSS_vllm
Commits
1228ae0a
Commit
1228ae0a
authored
Dec 05, 2025
by
dengjb
Browse files
update
parent
0a011e69
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
5 additions
and
6 deletions
+5
-6
README.md
README.md
+5
-6
No files found.
README.md
View file @
1228ae0a
...
...
@@ -35,7 +35,7 @@ Seed-OSS是字节跳动Seed团队于2025年8月开源的大型语言模型系列
| flash_attn | 2.6.1+das.opt1.dtk2504 |
| flash_mla | 1.0.0+das.opt1.dtk25042 |
当前仅支持
镜像:
推荐使用
镜像:
-
挂载地址
`-v`
根据实际模型情况修改
```
bash
...
...
@@ -55,11 +55,10 @@ docker run -it --shm-size 60g --network=host --name seed_oss --privileged --devi
### vllm
#### 单机推理
可参考vllm_serve.sh脚本
```
bash
## serve启动
## 可参考vllm_serve.sh脚本
vllm serve /path/of/ByteDance-Seed/Seed-OSS-36B-Instruct/
\
--trust-remote-code
\
--max-model-len
32768
\
...
...
@@ -68,7 +67,7 @@ vllm serve /path/of/ByteDance-Seed/Seed-OSS-36B-Instruct/ \
-tp
2
## client访问
可参考vllm_cilent.sh
##
可参考vllm_cilent.sh
curl http://localhost:8000/v1/chat/completions
\
-H
"Content-Type: application/json"
\
-d
'{
...
...
@@ -93,7 +92,7 @@ DCU与GPU精度一致,推理框架:vllm。
## 预训练权重
| 模型名称 | 权重大小 | DCU型号 | 最低卡数需求 |下载地址|
|:-----:|:----------:|:----------:|:---------------------:|:----------:|
| Seed-OSS-36B-Instruct | 3
2
B | BW1000 | 2 |
[
下载地址
](
https://huggingface.co/ByteDance-Seed/Seed-OSS-36B-Instruct
)
|
| Seed-OSS-36B-Instruct | 3
6
B | BW1000 | 2 |
[
huggingface
](
https://huggingface.co/ByteDance-Seed/Seed-OSS-36B-Instruct
)
|
## 源码仓库及问题反馈
-
https://developer.sourcefind.cn/codes/modelzoo/seed-oss_vllm
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment