Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
ModelZoo
Gemma-4
Commits
d97afa9c
Commit
d97afa9c
authored
May 28, 2026
by
weishb
Browse files
更新readme模板
parent
ec571323
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
10 additions
and
7 deletions
+10
-7
README.md
README.md
+9
-6
model.properties
model.properties
+1
-1
No files found.
README.md
View file @
d97afa9c
...
...
@@ -56,6 +56,13 @@ docker run -it \
关于本项目DCU显卡所需的特殊深度学习库可从
[
光合
](
https://developer.sourcefind.cn/tool/
)
开发者社区下载安装。
## 预训练权重
**请根据`支持的DCU型号`选择对应模型下载,FP8模型仅在BW1100/BW1101上支持,其他型号请勿使用!**
| 模型名称 | 权重大小 | DCU型号 | 最低卡数需求 | 下载地址 |
|:------:|:----:|:------:|:------:|:---------------------:|
| gemma-4-31B-it | 31B | BW1000 | 2 |
[
Hugging Face
](
https://huggingface.co/google/gemma-4-31B-it
)
|
## 数据集
暂无
...
...
@@ -64,7 +71,7 @@ docker run -it \
暂无
## 推理
### v
llm
### v
LLM
#### 单机推理
```
bash
...
...
@@ -99,13 +106,9 @@ curl http://localhost:8000/v1/chat/completions \
### 精度
DCU与GPU精度一致,推理框架:vllm。
## 预训练权重
| 模型名称 | 权重大小 | DCU型号 | 最低卡数需求 | 下载地址 |
|:------:|:----:|:------:|:------:|:---------------------:|
| gemma-4-31B-it | 31B | BW1000 | 2 |
[
Hugging Face
](
https://huggingface.co/google/gemma-4-31B-it
)
|
## 源码仓库及问题反馈
-
https://developer.sourcefind.cn/codes/modelzoo/gemma-4
_vllm
-
https://developer.sourcefind.cn/codes/modelzoo/gemma-4
## 参考资料
-
https://github.com/google-gemma
model.properties
View file @
d97afa9c
# 模型唯一标识
modelCode
=
2384
# 模型名称
modelName
=
Gemma-4
_vllm
modelName
=
Gemma-4
# 模型描述
modelDescription
=
Gemma 4 模型旨在在每个尺寸级别上都达到前沿性能,覆盖从移动和边缘设备(E2B、E4B)到消费级 GPU 和工作站(26B A4B、31B)的多种部署场景。
# 运行过程
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment