Merge pull request #81 from laekov/realmp
Model parallelism of single experts
Showing
fmoe/linear.py
0 → 100644
fmoe/megatron/patch.py
0 → 100644
Please register or sign in to comment
Model parallelism of single experts