Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
Megatron-LM
b1714c14ce24ed045d6b0e0d85a4e8744e2e24a7
b1714c14ce24ed045d6b0e0d85a4e8744e2e24a7
Switch branch/tag
megatron-lm
megatron
data
__init__.py
Find file
Normal view
History
Permalink
__init__.py
96 Bytes
Edit
Web IDE
Newer
Older
Initial commit of multiprocess preprocess and extracted copy of fairseq's indexed_dataset.
Jared Casper
committed
Nov 07, 2019
1
2
from
.
import
indexed_dataset
from
.bert_tokenization
import
FullTokenizer
as
FullBertTokenizer