Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
ModelZoo
InternLM2-Math-7B_pytorch
Commits
9fb99bfd
Commit
9fb99bfd
authored
Sep 06, 2024
by
zhougaofeng
Browse files
Update README.md
parent
c3efd318
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
8 additions
and
0 deletions
+8
-0
README.md
README.md
+8
-0
No files found.
README.md
View file @
9fb99bfd
...
@@ -5,6 +5,14 @@
...
@@ -5,6 +5,14 @@
`InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning`
`InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning`
-
[https://arxiv.org/abs/2402.06332]
-
[https://arxiv.org/abs/2402.06332]
## 模型结构
Internlm2_math在Internlm2模型上继续用约100B的高质量数学相关令牌进行预训练,并用约200万的双语数学监督数据进行SFT。Internlm2采用LLama+GQA结构,将Internlm中Wqkv矩阵堆叠排放,改进为交错重排,大概能提高5%的训练效率。
<div
align=
center
>
<img
src=
"doc/struct.png"
/>
</div>
## 算法原理
## 算法原理
InternLM-Math是基于InternLM2-Base模型进行数学预训练得到的大型语言模型。融合了链式推理、奖励建模、数据增强和形式推理等多种能力,不仅可以解决数学问题,还可以验证推理过程的正确性。竞赛级别的MATH基准测试的准确率优于更大参数量的qwen-72B、Llemma-34B等模型
InternLM-Math是基于InternLM2-Base模型进行数学预训练得到的大型语言模型。融合了链式推理、奖励建模、数据增强和形式推理等多种能力,不仅可以解决数学问题,还可以验证推理过程的正确性。竞赛级别的MATH基准测试的准确率优于更大参数量的qwen-72B、Llemma-34B等模型
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment