large update including model parallelism and gpt2
Co-authored-by:shoeybi <shoeybim@gmail.com> Co-authored-by:
raulpuric <raulpuric@berkeley.edu> Co-authored-by:
jaredcasper <jaredcasper@gmail.com> Co-authored-by:
mpatwary <mostofa.patwary@gmail.com> Co-authored-by:
plegresl <plegresl@gmail.com>
Showing
mpu/tests/test_layers.py
0 → 100644
This diff is collapsed.
mpu/tests/test_random.py
0 → 100644
This diff is collapsed.
mpu/transformer.py
0 → 100644
This diff is collapsed.
mpu/utils.py
0 → 100644
openwebtext/README.md
0 → 100644
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
openwebtext/merge_jsons.py
0 → 100644
This diff is collapsed.
This diff is collapsed.
optim/__init__.py
→
openwebtext/tokenizer.py
100755 → 100644
This diff is collapsed.
optim/adam.py
deleted
100755 → 0
This diff is collapsed.
This diff is collapsed.
pretrain_gpt2.py
0 → 100755
This diff is collapsed.
scripts/generate_text.sh
0 → 100755
This diff is collapsed.
This diff is collapsed.
Please register or sign in to comment