Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
ModelZoo
MLPerf_BERT_paddle
Commits
7e842664
Commit
7e842664
authored
Apr 15, 2025
by
chenzk
Browse files
Update url.md
parent
a8fedcdf
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
1 deletion
+1
-1
README.md
README.md
+1
-1
No files found.
README.md
View file @
7e842664
...
...
@@ -42,7 +42,7 @@ BERT用大量的无监督文本通过自监督训练的方式训练,把文本
## 数据集
模型训练的数据集来自
[
Wikipedia
](
http://
113.200.138.88:18080/aidatasets
/wikipedia
)
,即一种常用的自然语言处理数据集,它包含了维基百科上的文章和对应的摘要(即第一段内容),可用于各种文本相关的任务,例如文本分类、文本摘要、命名实体识别等。
模型训练的数据集来自
[
Wikipedia
](
http
s
://
huggingface.co/datasets/wikimedia
/wikipedia
)
,即一种常用的自然语言处理数据集,它包含了维基百科上的文章和对应的摘要(即第一段内容),可用于各种文本相关的任务,例如文本分类、文本摘要、命名实体识别等。
下载+预处理数据可按照下述进行,最终获得的输入数据如下图所示:
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment