"references/vscode:/vscode.git/clone" did not exist on "f96deba0f834d7b9566e8f3c1783d9e0a1c5b5af"
- 08 Nov, 2021 1 commit
-
-
Rick Ho authored
-
- 26 Oct, 2021 2 commits
- 10 Oct, 2021 2 commits
- 08 Jul, 2021 2 commits
- 07 Jul, 2021 1 commit
-
-
Rick Ho authored
-
- 18 Jun, 2021 1 commit
-
-
Tiago Antunes authored
* Added default weight initializations to FMoELinear and NoisyGate * Following torch's naming convention
-
- 24 May, 2021 1 commit
-
-
Colin authored
- mask some tensors of tokens for fmoe forward - pass a list of expert classes to specify what experts in what order want to use
-
- 23 May, 2021 1 commit
-
-
Colin authored
-
- 19 May, 2021 2 commits
- 13 May, 2021 1 commit
-
-
Rich Ho authored
-
- 27 Apr, 2021 1 commit
-
-
Rick Ho authored
-
- 26 Apr, 2021 1 commit
-
-
Rick Ho authored
-
- 23 Mar, 2021 2 commits
-
-
TiagoMAntunes authored
Bias now being calculated directly in MOELinear layer. Added corresponding CUDA changes. Updated forward and backward functions of MOELinear
-
TiagoMAntunes authored
-
- 22 Mar, 2021 1 commit
-
-
Sengxian authored
-
- 13 Mar, 2021 1 commit
-
-
Rick Ho authored
-
- 09 Mar, 2021 1 commit
-
-
Sengxian authored
-
- 26 Feb, 2021 1 commit
-
-
Rick Ho authored
-
- 25 Feb, 2021 4 commits
-
-
Jiezhong Qiu authored
i.e., expand bias using torch.repeat_interleave directly
-
Jiezhong Qiu authored
-
Jiezhong Qiu authored
-
Jiezhong Qiu authored
-
- 24 Feb, 2021 1 commit
-
-
Rick Ho authored
-
- 23 Feb, 2021 4 commits
- 22 Feb, 2021 1 commit
-
-
Rick Ho authored
-
- 21 Feb, 2021 4 commits
-
-
Jiezhong Qiu authored
-
Rick Ho authored
-
Jiezhong Qiu authored
-
Jiezhong Qiu authored
-
- 20 Feb, 2021 3 commits
-
-
Jiezhong Qiu authored
-
Jiezhong Qiu authored
-
Jiezhong Qiu authored
-
- 08 Feb, 2021 1 commit
-
-
Sengxian authored
-