Back out "[fairseq][PR] Fix bug (the returned value has a dimension mismatch)...

Back out "[fairseq][PR] Fix bug (the returned value has a dimension mismatch) in label-smoothed-cross-entropy for MoE" (#837) Summary: Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/837 Original commit changeset: a73bc03d2280 Differential Revision: D16904372 fbshipit-source-id: b4c4047b2686ba47258cdf0783059726134c920a

Back out "[fairseq][PR] Fix bug (the returned value has a dimension mismatch)...
Back out "[fairseq][PR] Fix bug (the returned value has a dimension mismatch) in label-smoothed-cross-entropy for MoE" (#837) Summary: Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/837 Original commit changeset: a73bc03d2280 Differential Revision: D16904372 fbshipit-source-id: b4c4047b2686ba47258cdf0783059726134c920a
c81fed46 · Myle Ott · Facebook Github Bot · 6ce55e4b · c81fed46
Commit c81fed46 authored Aug 19, 2019 by Myle Ott Committed by Facebook Github Bot Aug 19, 2019
Show whitespace changes
Inline Side-by-side

Showing with 3 additions and 3 deletions

fairseq/criterions/label_smoothed_cross_entropy.py fairseq/criterions/label_smoothed_cross_entropy.py +3 -3

No files found.
--- a/fairseq/criterions/label_smoothed_cross_entropy.py
+++ b/fairseq/criterions/label_smoothed_cross_entropy.py
@@ -16,9 +16,9 @@ def label_smoothed_nll_loss(lprobs, target, epsilon, ignore_index=None, reduce=T
    nll_loss = -lprobs.gather(dim=-1, index=target)
    smooth_loss = -lprobs.sum(dim=-1, keepdim=True)
    if ignore_index is not None:
-        pad_mask = target.eq(ignore_index)
+        non_pad_mask = target.ne(ignore_index)
-        nll_loss[pad_mask] = nll_loss[pad_mask] * 0.
+        nll_loss = nll_loss[non_pad_mask]
-        smooth_loss[pad_mask] = smooth_loss[pad_mask] * 0.
+        smooth_loss = smooth_loss[non_pad_mask]
    else:
        nll_loss = nll_loss.squeeze(-1)
        smooth_loss = smooth_loss.squeeze(-1)