Added default weight initializations to FMoELinear and NoisyGate (#52)
* Added default weight initializations to FMoELinear and NoisyGate * Following torch's naming convention
Showing
Please register or sign in to comment
* Added default weight initializations to FMoELinear and NoisyGate * Following torch's naming convention