1. 26 Apr, 2024 1 commit
  2. 25 Apr, 2024 1 commit
  3. 23 Apr, 2024 1 commit
  4. 19 Apr, 2024 2 commits
  5. 12 Apr, 2024 1 commit
  6. 10 Apr, 2024 1 commit
  7. 09 Apr, 2024 1 commit
  8. 02 Apr, 2024 1 commit
    • Nicolas Patry's avatar
      Hard error when ignoring tensors. (#27484) (#29906) · 9b0a8ea7
      Nicolas Patry authored
      
      
      * Hard error when ignoring tensors. (#27484)
      
      * [WIP] Hard error when ignoring tensors.
      
      * Better selection/error when saving a checkpoint.
      
      - Find all names we should normally drop (those are in the transformers
        config)
      - Find all disjoint tensors (for those we can safely trigger a copy to
        get rid of the sharing before saving)
      - Clone those disjoint tensors getting rid of the issue
      - Find all identical names (those should be declared in the config
        but we try to find them all anyway.)
      - For all identical names:
        - If they are in the config, just ignore them everything is fine
        - If they are not, warn about them.
      - For all remainder tensors which are shared yet neither identical NOR
        disjoint. raise a hard error.
      
      * Adding a failing test on `main` that passes here.
      
      * We don't need to keep the subfolder logic in this test.
      
      * Apply suggestions from code review
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      ---------
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Add small tests.
      
      * Dead variable.
      
      * Fixup.
      
      * Fixing tied_Weights_keys on generic models.
      
      * Fixup + T5 encoder/decoder tying (with different layers)
      
      * Code quality.
      
      * Dynamic member.
      
      * trigger
      
      * Fixing encoder name for other types of encoder/decoder combos.
      
      * Fix scoping.
      
      * Update .github/workflows/self-scheduled.yml
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Fixing the tied_weights after the call.
      
      ---------
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      9b0a8ea7
  9. 27 Mar, 2024 1 commit
  10. 25 Mar, 2024 2 commits
  11. 18 Mar, 2024 1 commit
  12. 15 Mar, 2024 1 commit
  13. 13 Mar, 2024 2 commits
  14. 11 Mar, 2024 2 commits
  15. 07 Mar, 2024 2 commits
  16. 06 Mar, 2024 1 commit
  17. 05 Mar, 2024 1 commit
  18. 01 Mar, 2024 1 commit
  19. 27 Feb, 2024 1 commit
  20. 20 Feb, 2024 1 commit
  21. 16 Feb, 2024 1 commit
  22. 15 Feb, 2024 1 commit
  23. 14 Feb, 2024 1 commit
  24. 12 Feb, 2024 1 commit
  25. 06 Feb, 2024 1 commit
  26. 05 Feb, 2024 1 commit
    • Nicolas Patry's avatar
      [WIP] Hard error when ignoring tensors. (#27484) · 2da28c4b
      Nicolas Patry authored
      
      
      * [WIP] Hard error when ignoring tensors.
      
      * Better selection/error when saving a checkpoint.
      
      - Find all names we should normally drop (those are in the transformers
        config)
      - Find all disjoint tensors (for those we can safely trigger a copy to
        get rid of the sharing before saving)
      - Clone those disjoint tensors getting rid of the issue
      - Find all identical names (those should be declared in the config
        but we try to find them all anyway.)
      - For all identical names:
        - If they are in the config, just ignore them everything is fine
        - If they are not, warn about them.
      - For all remainder tensors which are shared yet neither identical NOR
        disjoint. raise a hard error.
      
      * Adding a failing test on `main` that passes here.
      
      * We don't need to keep the subfolder logic in this test.
      
      * Apply suggestions from code review
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      ---------
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      2da28c4b
  27. 02 Feb, 2024 2 commits
  28. 31 Jan, 2024 1 commit
  29. 30 Jan, 2024 1 commit
  30. 26 Jan, 2024 4 commits
  31. 18 Jan, 2024 1 commit