1. 24 Aug, 2022 2 commits
  2. 01 Aug, 2022 1 commit
  3. 26 Jul, 2022 1 commit
  4. 20 Jun, 2022 1 commit
  5. 31 May, 2022 1 commit
  6. 25 May, 2022 1 commit
  7. 12 May, 2022 1 commit
  8. 09 May, 2022 1 commit
  9. 25 Apr, 2022 1 commit
  10. 19 Apr, 2022 1 commit
    • Suraj Patil's avatar
      [Flax] improve large model init and loading (#16148) · d3bd9ac7
      Suraj Patil authored
      
      
      * begin do_init
      
      * add params_shape_tree
      
      * raise error if params are accessed when do_init is False
      
      * don't allow do_init=False when keys are missing
      
      * make shape tree a property
      
      * assign self._params at the end
      
      * add test for do_init
      
      * add do_init arg to all flax models
      
      * fix param setting
      
      * disbale do_init for composite models
      
      * update test
      
      * add do_init in FlaxBigBirdForMultipleChoice
      
      * better names and errors
      
      * improve test
      
      * style
      
      * add a warning when do_init=False
      
      * remove extra if
      
      * set params after _required_params
      
      * add test for from_pretrained
      
      * do_init => _do_init
      
      * chage warning to info
      
      * fix typo
      
      * add params in init_weights
      
      * add params to gpt neo init
      
      * add params to init_weights
      
      * update do_init test
      
      * Trigger CI
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * update template
      
      * trigger CI
      
      * style
      
      * style
      
      * fix template
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      d3bd9ac7
  11. 12 Apr, 2022 1 commit
  12. 04 Apr, 2022 1 commit
  13. 01 Apr, 2022 1 commit
    • Rishav Chandra Varma's avatar
      Adding missing type hints for mBART model (PyTorch) (#16429) · 5fe06b9b
      Rishav Chandra Varma authored
      
      
      * added type hints for mbart tensorflow tf implementation
      
      * Adding missing type hints for mBART model 
      
      Tensorflow Implementation model added with missing type hints
      
      * Missing Type hints - correction
      
      For TF model
      
      * Code fixup using make quality tests
      
      * Hint types - typo error
      
      * make fix-copies and make fixup
      
      * type hints
      
      * updated files
      
      * type hints update
      
      * making dependent modesls coherent
      Co-authored-by: default avatarmatt <rocketknight1@gmail.com>
      5fe06b9b
  14. 31 Mar, 2022 1 commit
  15. 25 Mar, 2022 1 commit
  16. 23 Mar, 2022 1 commit
    • Sylvain Gugger's avatar
      Reorganize file utils (#16264) · 4975002d
      Sylvain Gugger authored
      * Split file_utils in several submodules
      
      * Fixes
      
      * Add back more objects
      
      * More fixes
      
      * Who exactly decided to import that from there?
      
      * Second suggestion to code with code review
      
      * Revert wront move
      
      * Fix imports
      
      * Adapt all imports
      
      * Adapt all imports everywhere
      
      * Revert this import, will fix in a separate commit
      4975002d
  17. 22 Mar, 2022 1 commit
  18. 21 Mar, 2022 2 commits
  19. 16 Mar, 2022 1 commit
  20. 23 Feb, 2022 1 commit
  21. 09 Feb, 2022 1 commit
  22. 01 Feb, 2022 1 commit
  23. 31 Jan, 2022 2 commits
  24. 30 Jan, 2022 1 commit
  25. 28 Jan, 2022 1 commit
    • Suraj Patil's avatar
      Add XGLM models (#14876) · d25e25ee
      Suraj Patil authored
      
      
      * add xglm
      
      * update vocab size
      
      * fix model name
      
      * style and tokenizer
      
      * typo
      
      * no mask token
      
      * fix pos embed compute
      
      * fix args
      
      * fix tokenizer
      
      * fix positions
      
      * fix tokenization
      
      * style and dic fixes
      
      * fix imports
      
      * add fast tokenizer
      
      * update names
      
      * add pt tests
      
      * fix tokenizer
      
      * fix typo
      
      * fix tokenizer import
      
      * fix fast tokenizer
      
      * fix tokenizer
      
      * fix converter
      
      * add tokenizer test
      
      * update checkpoint names
      
      * fix tokenizer tests
      
      * fix slow tests
      
      * add copied from comments
      
      * rst -> mdx
      
      * flax model
      
      * update flax tests
      
      * quality
      
      * style
      
      * doc
      
      * update index and readme
      
      * fix copies
      
      * fix doc
      
      * update toctrr
      
      * fix indent
      
      * minor fixes
      
      * fix config doc
      
      * don't save embed_pos weights
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * address Sylvains commnets, few doc fixes
      
      * fix check_repo
      
      * align order of arguments
      
      * fix copies
      
      * fix labels
      
      * remove unnecessary mapping
      
      * fix saving tokenizer
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      d25e25ee