Commits · 98584dd0dffa4214db72dd09f09b6dc03e30ed07 · OpenDAS / FastMoE

30 Jul, 2021 1 commit
- update benchmark accordingly · 98584dd0
  Rick Ho authored Jul 30, 2021
  
  98584dd0
20 Jul, 2021 2 commits
- Merge pull request #64 from laekov/fp16_bal_loss · bba5f289
  Rick Ho authored Jul 20, 2021
```
fix fp16 training with balance loss
```
  bba5f289
- fix fp16 training with balance loss · ebefe2b1
  Jiezhong Qiu authored Jul 20, 2021
```
https://github.com/laekov/fastmoe/issues/63
```
  ebefe2b1
08 Jul, 2021 4 commits
- Merge pull request #60 from laekov/remove-comm · 537679a8
  Rick Ho authored Jul 08, 2021
```
Remove unnecessary dependencies on comm
```
  537679a8
- remove unnecessary dependencies on comm · c8483d42
  Rick Ho authored Jul 08, 2021
  
  c8483d42
- Merge pull request #59 from laekov/cope-with-pipeline · 50a9aa94
  Rick Ho authored Jul 08, 2021
```
Use moe_group instead of world for MoE
```
  50a9aa94
- resolve loss reduction with customized gates · 59913cca
  Rick Ho authored Jul 08, 2021
  
  59913cca
07 Jul, 2021 1 commit
- add moe_comm · 18a4395c
  Rick Ho authored Jul 08, 2021
  
  18a4395c
30 Jun, 2021 2 commits
- Merge pull request #57 from ymjiang/patch-1 · 55f8ca7d
  Rick Ho authored Jun 30, 2021
```
Fix typo in readme
```
  55f8ca7d
- Fix typo in readme · a4119185
  Yimin Jiang authored Jun 30, 2021
  
  a4119185
29 Jun, 2021 1 commit
- Merge pull request #56 from laekov/older-pytorch-compatibility · 11369d67
  Jiezhong Qiu authored Jun 29, 2021
```
Fix pytorch compatibility issue !55
```
  11369d67
28 Jun, 2021 2 commits
- acro bug fix · 9170835c
  Rich Ho authored Jun 28, 2021
  
  9170835c
- fix pytorch compatibility issue !55 · 17e2a5c5
  Rick Ho authored Jun 28, 2021
  
  17e2a5c5
18 Jun, 2021 1 commit
- Added default weight initializations to FMoELinear and NoisyGate (#52) · 7d41fe88
  Tiago Antunes authored Jun 18, 2021
```
* Added default weight initializations to FMoELinear and NoisyGate

* Following torch's naming convention
```
  7d41fe88
17 Jun, 2021 5 commits
- Merge pull request #51 from laekov/fix-expert-exchange · ec2d458d
  Rick Ho authored Jun 17, 2021
```
use single variable for returned value
```
  ec2d458d
- use single variable for returned value · 318513ae
  Jiezhong Qiu authored Jun 17, 2021
```
the old impl raised error "too many values to unpack (expected 1)"
```
  318513ae
- Merge pull request #50 from laekov/fix-balance-loss · 28bfe689
  Jiezhong Qiu authored Jun 17, 2021
```
Fix grad of balance loss
```
  28bfe689
- fix concat shape · a12ad553
  Rick Ho authored Jun 17, 2021
  
  a12ad553
- use cat instead of creating new tensor · 913d7127
  Rick Ho authored Jun 17, 2021
  
  913d7127
16 Jun, 2021 1 commit

Improve efficiency of metadata exchange (#48) · 295a615a

Rick Ho authored Jun 16, 2021

* use single variable instead of vector in c functions

* expert count kernel

* remove all lists

* fix old tests

295a615a

09 Jun, 2021 2 commits
- Merge pull request #45 from laekov/fix_stream_column_reduce · b861e928
  Rick Ho authored Jun 09, 2021
```
Fixed asynchronous streams in column reduce kernel call
```
  b861e928
- Fixed asynchronous streams in column reduce kernel call · 2126b59a
  TiagoMAntunes authored Jun 09, 2021
  
  2126b59a
31 May, 2021 7 commits
- Merge pull request #42 from laekov/v0.2.0-pre-release · c96f8863
  Rick Ho authored May 31, 2021
```
Checkout version number to V0.2.0
```
  c96f8863
- update document for megatron · 411e57f5
  Rick Ho authored May 31, 2021
  
  411e57f5
- update version number · 7f15d11d
  Rick Ho authored May 31, 2021
  
  7f15d11d
- Merge pull request #41 from laekov/hidden-hidden-size-arg · d205aaeb
  Rick Ho authored May 31, 2021
```
Add hidden hidden size args
```
  d205aaeb
- Add hidden hidden size args · f9ce8e09
  Sengxian authored May 31, 2021
  
  f9ce8e09
- Merge pull request #40 from laekov/new-gate-patch · 4d59a9db
  Rick Ho authored May 31, 2021
```
Adapt balance loss for new gate interface & update patch
```
  4d59a9db
- Adapt balance loss for new gate interface & update patch · 4eec9807
  Sengxian authored May 31, 2021
  
  4eec9807
30 May, 2021 1 commit
- Merge pull request #39 from laekov/gshard-gate-bugfix · c77f676d
  Rick Ho authored May 30, 2021
```
Fix bugs to run megatron with gshard gate
```
  c77f676d
29 May, 2021 1 commit
- fix bugs to run megatron with gshard gate · fa5f45f0
  Rick Ho authored May 29, 2021
  
  fa5f45f0
24 May, 2021 6 commits
- Merge pull request #38 from GODVIX/gates · 7f6463f0
  Rick Ho authored May 24, 2021
```
Update test_gates.py
```
  7f6463f0
- Update test_gates.py · 26cc37cb
  GODVIX authored May 24, 2021
  
  26cc37cb
- Merge pull request #37 from laekov/gshard-random-routing · ddaac5eb
  Rick Ho authored May 24, 2021
```
Add random routing in gshard gate
```
  ddaac5eb
- add random routing in gshard gate · e58e7b3b
  Rich Ho authored May 24, 2021
  
  e58e7b3b
- Merge pull request #36 from Co1lin/master · ba2b7aa9
  Rick Ho authored May 24, 2021
```
mask and experts list
```
  ba2b7aa9
- mask and experts list (#2) · ff7333c7
  Colin authored May 24, 2021
```
- mask some tensors of tokens for fmoe forward
- pass a list of expert classes to specify what experts in what order want to use
```
  ff7333c7
23 May, 2021 1 commit
- mask and experts list · 28ba2d28
  Colin authored May 23, 2021
  
  28ba2d28
21 May, 2021 2 commits
- Merge pull request #31 from laekov/gate · baae8fb9
  Rick Ho authored May 22, 2021
```
Reconstruct gate and add gshard / switch
```
  baae8fb9
- update accoding to comments · 8d14dd29
  Rich Ho authored May 21, 2021
  
  8d14dd29