Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
Megatron-LM
9350ee08836cc7ddfa28206ecfb6110e0a1fcaf9
9350ee08836cc7ddfa28206ecfb6110e0a1fcaf9
Switch branch/tag
megatron-lm
megatron
data
__init__.py
Find file
Normal view
History
Permalink
__init__.py
32 Bytes
Edit
Web IDE
Newer
Older
Initial commit of multiprocess preprocess and extracted copy of fairseq's indexed_dataset.
Jared Casper
committed
Nov 07, 2019
1
from
.
import
indexed_dataset
tokenizer moved to its own directory
Mohammad
committed
Mar 28, 2020
2
moved albert to bert
Mohammad Shoeybi
committed
Mar 16, 2020
3