Merge branch 'staging_gpt2_dataloader' into 'staging'
GPT2 dataloader using indexed dataset See merge request ADLR/megatron-lm!45
Showing
File moved
File moved
File moved
File moved
requirements.txt
deleted
100644 → 0
| nltk>=3.4 | ||
| numpy>=1.15.4 | ||
| pandas>=0.24.0 | ||
| sentencepiece>=0.1.8 | ||
| tensorflow>=1.12.0 |
tools/preprocess_data.py
0 → 100644
Please register or sign in to comment