Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
ModelZoo
Baichuan-M3_pytorch
Commits
61f51a8e
Commit
61f51a8e
authored
Mar 13, 2026
by
shihm
Browse files
readme
parent
b80e3d36
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
8 additions
and
7 deletions
+8
-7
README.md
README.md
+8
-7
No files found.
README.md
View file @
61f51a8e
# Baichuan-M3
## 论文
[
Modeling Clinical Inquiry for Reliable Medical Decision-Making
](
https://arxiv.org/abs/2602.06570
)
[
Baichuan-M3:
Modeling Clinical Inquiry for Reliable Medical Decision-Making
](
https://arxiv.org/abs/2602.06570
)
## 模型简介
Baichuan-M3 是百川智能推出的全新一代医疗增强大语言模型,是继 Baichuan-M2 之后的重要里程碑。
...
...
@@ -61,6 +61,7 @@ docker run -it \
```
bash
python inference.py
```
Transformers推理不支持Baichuan-M3-235B-GPTQ-INT4模型
### vllm
...
...
@@ -68,12 +69,12 @@ python inference.py
启动vllm server
```
bash
vllm serve
/
baichuan-inc/Baichuan-M3-235B
--reasoning-parser
qwen3
--tensor-parallel-size
8
--trust-remote-code
--port
8000
--gpu-memory-utilization
0.95
vllm serve baichuan-inc/Baichuan-M3-235B
\
--reasoning-parser
qwen3
\
--tensor-parallel-size
8
\
--trust-remote-code
\
--port
8000
\
--gpu-memory-utilization
0.95
\
--served-model-name
baichuan-m3
```
启动完成后可通过以下方式访问:
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment