Faster masking in MultiheadAttention
Summary: Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/612 Differential Revision: D15541377 Pulled By: myleott fbshipit-source-id: 4762516a3b545d03bc81d3660f47827e15466dce
Showing
Please register or sign in to comment