Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
Megatron-LM
1237533eb5680b358589305aed68b2fd0d9982a8
1237533eb5680b358589305aed68b2fd0d9982a8
Switch branch/tag
megatron-lm
megatron
data
__init__.py
Find file
Normal view
History
Permalink
__init__.py
96 Bytes
Edit
Web IDE
Newer
Older
Initial commit of multiprocess preprocess and extracted copy of fairseq's indexed_dataset.
Jared Casper
committed
Nov 07, 2019
1
2
from
.
import
indexed_dataset
from
.bert_tokenization
import
FullTokenizer
as
FullBertTokenizer