1. 14 Sep, 2023 1 commit
  2. 17 Aug, 2023 1 commit
    • Arthur's avatar
      [`resize_embedding`] Introduce `pad_to_multiple_of` and guidance (#25088) · d6bf08f7
      Arthur authored
      * fix
      
      * revert cahnges and update resizing of embedding layer
      
      * use wraning
      
      * fixup
      
      * more styling nits
      
      * fix all tests that overload the embedding tests
      
      * 👀👀 remove breakpoint
      
      * remove useless overload + overload correctly where needed
      
      * resize lm head with new vocab size
      
      * reverse not necessary changes
      
      * style
      
      * fix CIs!
      
      * fix last CI tests, adapt bark and Marian
      
      * fixup
      d6bf08f7
  3. 08 Aug, 2023 1 commit
  4. 07 Aug, 2023 1 commit
  5. 04 Aug, 2023 1 commit
  6. 02 Aug, 2023 1 commit
  7. 25 Jul, 2023 2 commits
  8. 24 Jul, 2023 2 commits
  9. 14 Jul, 2023 1 commit
  10. 28 Jun, 2023 1 commit
  11. 27 Jun, 2023 1 commit
    • Sylvain Gugger's avatar
      Clean load keys (#24505) · 8e5d1619
      Sylvain Gugger authored
      * Preliminary work on some models
      
      * Fix test load missing and make sure nonpersistent buffers are tested
      
      * Always ignore nonpersistent buffers if in state_dict
      
      * Treat models
      
      * More models
      
      * Treat remaining models
      
      * Fix quality
      
      * Fix tests
      
      * Remove draft
      
      * This test is not needed anymore
      
      * Fix copies
      
      * Fix last test
      
      * Newly added models
      
      * Fix last tests
      
      * Address review comments
      8e5d1619
  12. 22 Jun, 2023 1 commit
  13. 21 Jun, 2023 1 commit
  14. 15 Jun, 2023 1 commit
  15. 13 Jun, 2023 1 commit
    • Sylvain Gugger's avatar
      Tied params cleanup (#24211) · 695928e1
      Sylvain Gugger authored
      * First test
      
      * Add info for all models
      
      * style
      
      * Repo consistency
      
      * Fix last model and cleanup prints
      
      * Repo consistency
      
      * Use consistent function for detecting tied weights
      695928e1
  16. 16 May, 2023 1 commit
  17. 24 Apr, 2023 1 commit
  18. 04 Apr, 2023 2 commits
  19. 31 Mar, 2023 1 commit
  20. 13 Mar, 2023 1 commit
  21. 09 Mar, 2023 1 commit
  22. 06 Mar, 2023 1 commit
  23. 27 Feb, 2023 2 commits
  24. 22 Feb, 2023 1 commit
  25. 14 Feb, 2023 1 commit
  26. 10 Feb, 2023 2 commits
  27. 09 Feb, 2023 3 commits
  28. 06 Feb, 2023 1 commit
    • Sylvain Gugger's avatar
      Update quality tooling for formatting (#21480) · 6f79d264
      Sylvain Gugger authored
      * Result of black 23.1
      
      * Update target to Python 3.7
      
      * Switch flake8 to ruff
      
      * Configure isort
      
      * Configure isort
      
      * Apply isort with line limit
      
      * Put the right black version
      
      * adapt black in check copies
      
      * Fix copies
      6f79d264
  29. 01 Feb, 2023 1 commit
  30. 26 Jan, 2023 1 commit
  31. 23 Jan, 2023 1 commit
  32. 12 Jan, 2023 1 commit
  33. 27 Dec, 2022 1 commit