Merge pull request #1 from xptree/laekov/multigpu
Faster MoE implementation for both single GPU and multiple GPUs
Showing
This diff is collapsed.
pytorch/cuda/moe_function.py
0 → 100644
Please register or sign in to comment