1. 14 May, 2021 1 commit
    • Jared Casper's avatar
      Update arguments checks. · 8044c7b4
      Jared Casper authored
      hidden_size % attention_heads == 0 is handled above when dealing with kv_channels.
      
      Adding check for decoder sequence length.
      8044c7b4
  2. 21 Apr, 2021 3 commits
  3. 19 Apr, 2021 1 commit
  4. 16 Apr, 2021 2 commits
  5. 08 Apr, 2021 1 commit
  6. 03 Apr, 2021 1 commit
  7. 02 Apr, 2021 1 commit
  8. 31 Mar, 2021 1 commit
  9. 24 Mar, 2021 3 commits
  10. 20 Mar, 2021 1 commit
  11. 19 Mar, 2021 2 commits
  12. 18 Mar, 2021 1 commit
  13. 17 Mar, 2021 1 commit
  14. 16 Mar, 2021 2 commits
  15. 08 Mar, 2021 1 commit
  16. 04 Mar, 2021 3 commits
  17. 03 Mar, 2021 3 commits
  18. 26 Feb, 2021 2 commits
  19. 25 Feb, 2021 1 commit
  20. 23 Feb, 2021 3 commits
  21. 22 Feb, 2021 1 commit
  22. 18 Feb, 2021 4 commits
  23. 17 Feb, 2021 1 commit