large update including model parallelism and gpt2
Co-authored-by:shoeybi <shoeybim@gmail.com> Co-authored-by:
raulpuric <raulpuric@berkeley.edu> Co-authored-by:
jaredcasper <jaredcasper@gmail.com> Co-authored-by:
mpatwary <mostofa.patwary@gmail.com> Co-authored-by:
plegresl <plegresl@gmail.com>
Showing
gpt2_data_loader.py
0 → 100644
model/gpt2_modeling.py
0 → 100644
mpu/__init__.py
0 → 100644
mpu/cross_entropy.py
0 → 100644
mpu/data.py
0 → 100644
mpu/grads.py
0 → 100644
mpu/initialize.py
0 → 100644
mpu/layers.py
0 → 100644
mpu/mappings.py
0 → 100644
mpu/random.py
0 → 100644
mpu/tests/__init__.py
0 → 100644
mpu/tests/commons.py
0 → 100644
mpu/tests/test_data.py
0 → 100644
mpu/tests/test_initialize.py
0 → 100644
Please register or sign in to comment