- mask some tensors of tokens for fmoe forward - pass a list of expert classes to specify what experts in what order want to use
Attach a file by drag & drop or click to upload