Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
Megatron-LM
29d10a3652de6ff6f25e04e09e04c65c7abd1277
29d10a3652de6ff6f25e04e09e04c65c7abd1277
Switch branch/tag
megatron-lm
megatron
data
__init__.py
Find file
Normal view
History
Permalink
__init__.py
97 Bytes
Edit
Web IDE
Newer
Older
Initial commit of multiprocess preprocess and extracted copy of fairseq's indexed_dataset.
Jared Casper
committed
Nov 07, 2019
1
2
from
.
import
indexed_dataset
from
.bert_tokenization
import
FullTokenizer
as
FullBertTokenizer
moved albert to bert
Mohammad Shoeybi
committed
Mar 16, 2020
3