* Added default weight initializations to FMoELinear and NoisyGate * Following torch's naming convention
Attach a file by drag & drop or click to upload