updata readme

a9ee04b7 · shihm · 26aebea5 · a9ee04b7 · a9ee04b7 · 26aebea5
Commit a9ee04b7 authored Mar 09, 2026 by shihm
Expand all Hide whitespace changes
Inline Side-by-side

Showing with 4 additions and 5 deletions

README.md README.md +2 -2

model.properties model.properties +2 -2

vocab.json vocab.json +0 -1

No files found.
--- a/README.md
+++ b/README.md
@@ -114,7 +114,7 @@ ray start --address='x.x.x.x:6379' --num-gpus=8 --num-cpus=32
 ```
 启动vllm server
 ```bash
-vllm serve /path/to/Baichuan-M3-235B 
+vllm serve /baichuan-inc/Baichuan-M3-235B 
    --host x.x.x.x --port 8000  
    --distributed-executor-backend ray 
    --tensor-parallel-size 16 
@@ -183,7 +183,7 @@ print(response)


 ### 精度
-`DCU与GPU精度一致，推理框架：vllm,transformer`
+`DCU与GPU精度一致，推理框架：vllm,transformers`

 ## 预训练权重
 | 模型名称  | 权重大小  | DCU型号  | 最低卡数需求 |下载地址|

--- a/model.properties
+++ b/model.properties
 # 模型唯一标识
 modelCode=2152
 # 模型名称
-modelName=Baichuan-M3_vllm
+modelName=Baichuan-M3_pytorch
 # 模型描述
 modelDescription=Baichuan-M3 是百川智能推出的全新一代医疗增强大语言模型，是继 Baichuan-M2 之后的重要里程碑。
 # 运行过程
@@ -9,6 +9,6 @@ processType=推理
 # 算法类别
 appCategory=对话问答
 # 框架类型
-frameType=vllm
+frameType=vllm,transformers
 # 加速卡类型
 accelerateType=BW1000
\ No newline at end of file
--- a/vocab.json
+++ b/vocab.json