Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
ModelZoo
Baichuan-M3_pytorch
Commits
a9ee04b7
Commit
a9ee04b7
authored
Mar 09, 2026
by
shihm
Browse files
updata readme
parent
26aebea5
Changes
3
Expand all
Hide whitespace changes
Inline
Side-by-side
Showing
3 changed files
with
4 additions
and
5 deletions
+4
-5
README.md
README.md
+2
-2
model.properties
model.properties
+2
-2
vocab.json
vocab.json
+0
-1
No files found.
README.md
View file @
a9ee04b7
...
...
@@ -114,7 +114,7 @@ ray start --address='x.x.x.x:6379' --num-gpus=8 --num-cpus=32
```
启动vllm server
```
bash
vllm serve /
path/to
/Baichuan-M3-235B
vllm serve /
baichuan-inc
/Baichuan-M3-235B
--host
x.x.x.x
--port
8000
--distributed-executor-backend
ray
--tensor-parallel-size
16
...
...
@@ -183,7 +183,7 @@ print(response)
### 精度
`DCU与GPU精度一致,推理框架:vllm,transformer`
`DCU与GPU精度一致,推理框架:vllm,transformer
s
`
## 预训练权重
| 模型名称 | 权重大小 | DCU型号 | 最低卡数需求 |下载地址|
...
...
model.properties
View file @
a9ee04b7
# 模型唯一标识
modelCode
=
2152
# 模型名称
modelName
=
Baichuan-M3_
vllm
modelName
=
Baichuan-M3_
pytorch
# 模型描述
modelDescription
=
Baichuan-M3 是百川智能推出的全新一代医疗增强大语言模型,是继 Baichuan-M2 之后的重要里程碑。
# 运行过程
...
...
@@ -9,6 +9,6 @@ processType=推理
# 算法类别
appCategory
=
对话问答
# 框架类型
frameType
=
vllm
frameType
=
vllm
,transformers
# 加速卡类型
accelerateType
=
BW1000
\ No newline at end of file
vocab.json
deleted
100644 → 0
View file @
26aebea5
This diff is collapsed.
Click to expand it.
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment