[`NllbMoe`] Update code to properly support loss computation (#25429)
* update nllb_moe * fix * doc nits * nits * add a small test * ficup * remove adapted from
Showing
Please register or sign in to comment
* update nllb_moe * fix * doc nits * nits * add a small test * ficup * remove adapted from