1. 15 Oct, 2019 6 commits
  2. 14 Oct, 2019 8 commits
  3. 11 Oct, 2019 3 commits
  4. 10 Oct, 2019 13 commits
  5. 08 Oct, 2019 8 commits
  6. 07 Oct, 2019 2 commits
    • Rémi Louf's avatar
      generalize BertSelfAttention to take separate query, key, value · a0dcefa3
      Rémi Louf authored
      There is currently no way to specify the quey, key and value separately
      in the Attention module. However, the decoder's "encoder-decoder
      attention" layers take the decoder's last output as a query, the
      encoder's states as key and value. We thus modify the existing code so
      query, key and value can be added separately.
      
      This obviously poses some naming conventions; `BertSelfAttention` is not
      a self-attention module anymore. The way the residual is forwarded is
      now awkard, etc. We will need to do some refacto once the decoder is
      fully implemented.
      a0dcefa3
    • Rémi Louf's avatar
      add class wireframes for Bert decoder · 31adbb24
      Rémi Louf authored
      31adbb24