1. 09 Dec, 2019 1 commit
    • R茅mi Louf's avatar
      create encoder attention mask from shape of hidden states · 3520be78
      R茅mi Louf authored
      We currently create encoder attention masks (when they're not provided)
      based on the shape of the inputs to the encoder. This is obviously
      wrong; sequences can be of different lengths. We now create the encoder
      attention mask based on the batch_size and sequence_length of the
      encoder hidden states.
      3520be78
  2. 07 Dec, 2019 1 commit
  3. 06 Dec, 2019 7 commits
  4. 05 Dec, 2019 19 commits
  5. 04 Dec, 2019 9 commits
  6. 03 Dec, 2019 3 commits