1. 12 Oct, 2023 1 commit
  2. 10 Oct, 2023 1 commit
  3. 04 Oct, 2023 1 commit
    • Phuc Van Phan's avatar
      refactor: change default block_size (#26229) · 6015f91a
      Phuc Van Phan authored
      * refactor: change default block_size
      
      * fix: return tf to origin
      
      * fix: change files to origin
      
      * rebase
      
      * rebase
      
      * rebase
      
      * rebase
      
      * rebase
      
      * rebase
      
      * rebase
      
      * rebase
      
      * refactor: add min block_size to files
      
      * reformat: add min block_size for run_clm tf
      6015f91a
  4. 11 Sep, 2023 2 commits
  5. 27 Jun, 2023 1 commit
  6. 22 Feb, 2023 1 commit
  7. 06 Feb, 2023 1 commit
    • Sylvain Gugger's avatar
      Update quality tooling for formatting (#21480) · 6f79d264
      Sylvain Gugger authored
      * Result of black 23.1
      
      * Update target to Python 3.7
      
      * Switch flake8 to ruff
      
      * Configure isort
      
      * Configure isort
      
      * Apply isort with line limit
      
      * Put the right black version
      
      * adapt black in check copies
      
      * Fix copies
      6f79d264
  8. 29 Dec, 2022 1 commit
  9. 20 Dec, 2022 1 commit
    • fzyzcjy's avatar
      Fix tiny typo (#20841) · ae3cbbca
      fzyzcjy authored
      * Fix typo
      
      * Update README.md
      
      * Update run_mlm_flax_stream.py
      
      * Update README.md
      ae3cbbca
  10. 06 Oct, 2022 1 commit
  11. 09 Sep, 2022 1 commit
  12. 11 Jul, 2022 2 commits
  13. 16 May, 2022 1 commit
    • Kenneth Enevoldsen's avatar
      fixed bug in run_mlm_flax_stream.py (#17203) · 71d18d08
      Kenneth Enevoldsen authored
      
      
      * fixed bug run_mlm_flax_stream.py
      
      Fixed bug caused by an update to tokenizer keys introduced in recent transformers versions (between `4.6.2` and `4.18.0`) where additional keys were introduced to the tokenizer output.
      
      * Update run_mlm_flax_stream.py
      
      * adding missing paranthesis
      
      * formatted to black
      
      * remove cols from dataset instead
      
      * reformat to black
      
      * moved rem. columns to map
      
      * formatted to black
      Co-authored-by: default avatarKennethEnevoldsen <kennethcenevolsen@gmail.com>
      71d18d08
  14. 12 May, 2022 1 commit
  15. 19 Apr, 2022 1 commit
    • Suraj Patil's avatar
      [Flax] improve large model init and loading (#16148) · d3bd9ac7
      Suraj Patil authored
      
      
      * begin do_init
      
      * add params_shape_tree
      
      * raise error if params are accessed when do_init is False
      
      * don't allow do_init=False when keys are missing
      
      * make shape tree a property
      
      * assign self._params at the end
      
      * add test for do_init
      
      * add do_init arg to all flax models
      
      * fix param setting
      
      * disbale do_init for composite models
      
      * update test
      
      * add do_init in FlaxBigBirdForMultipleChoice
      
      * better names and errors
      
      * improve test
      
      * style
      
      * add a warning when do_init=False
      
      * remove extra if
      
      * set params after _required_params
      
      * add test for from_pretrained
      
      * do_init => _do_init
      
      * chage warning to info
      
      * fix typo
      
      * add params in init_weights
      
      * add params to gpt neo init
      
      * add params to init_weights
      
      * update do_init test
      
      * Trigger CI
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * update template
      
      * trigger CI
      
      * style
      
      * style
      
      * fix template
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      d3bd9ac7
  16. 30 Mar, 2022 1 commit
  17. 23 Mar, 2022 1 commit
  18. 17 Mar, 2022 1 commit
  19. 16 Mar, 2022 1 commit
  20. 10 Mar, 2022 1 commit
  21. 04 Mar, 2022 1 commit
  22. 09 Feb, 2022 1 commit
  23. 06 Dec, 2021 1 commit
  24. 22 Nov, 2021 1 commit
  25. 17 Nov, 2021 1 commit
  26. 11 Nov, 2021 1 commit
    • Suraj Patil's avatar
      Fix Flax params dtype (#13098) · e92190c0
      Suraj Patil authored
      
      
      * fix inits
      
      * fix embed dtype
      
      * fix embed dtype
      
      * add test to check default dtype
      
      * quality
      
      * add type conversion methods for flax models
      
      * more robust casting
      
      * cast sinusoidal positions
      
      * update pegasus
      
      * update albert
      
      * update test
      
      * make sure dtype is passed to every module
      
      * style
      
      * fix electra dense
      
      * fix t5
      
      * quality
      
      * add more tests
      
      * better name
      
      * use the dtype for lm head computation
      
      * fix albert
      
      * style
      
      * fix albert embed dtype
      
      * more tests
      
      * fix vision enc-dec
      
      * cleanup
      
      * fix embed dtype pegasus
      
      * fix default param test
      
      * doc
      
      * update template
      
      * fix final_logits_bias dtype
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * fix doc
      
      * fix doc
      
      * add detailed docstring for dtype parameter
      
      * remove un-necessary import
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      e92190c0
  27. 09 Nov, 2021 1 commit
  28. 02 Sep, 2021 1 commit
  29. 27 Aug, 2021 1 commit
  30. 12 Aug, 2021 1 commit
  31. 04 Aug, 2021 1 commit
  32. 20 Jul, 2021 2 commits
  33. 17 Jul, 2021 1 commit
  34. 16 Jul, 2021 2 commits
  35. 15 Jul, 2021 2 commits