1. 28 Jul, 2023 2 commits
  2. 17 Jul, 2023 1 commit
  3. 12 Jul, 2023 1 commit
  4. 07 Jun, 2023 1 commit
  5. 22 May, 2023 1 commit
  6. 09 May, 2023 1 commit
  7. 02 May, 2023 1 commit
  8. 13 Apr, 2023 1 commit
  9. 23 Mar, 2023 1 commit
  10. 22 Mar, 2023 1 commit
  11. 14 Mar, 2023 1 commit
  12. 22 Feb, 2023 1 commit
  13. 06 Feb, 2023 1 commit
    • Sylvain Gugger's avatar
      Update quality tooling for formatting (#21480) · 6f79d264
      Sylvain Gugger authored
      * Result of black 23.1
      
      * Update target to Python 3.7
      
      * Switch flake8 to ruff
      
      * Configure isort
      
      * Configure isort
      
      * Apply isort with line limit
      
      * Put the right black version
      
      * adapt black in check copies
      
      * Fix copies
      6f79d264
  14. 23 Jan, 2023 1 commit
  15. 19 Jan, 2023 1 commit
  16. 18 Jan, 2023 1 commit
  17. 04 Jan, 2023 1 commit
  18. 20 Dec, 2022 1 commit
    • fzyzcjy's avatar
      Fix tiny typo (#20841) · ae3cbbca
      fzyzcjy authored
      * Fix typo
      
      * Update README.md
      
      * Update run_mlm_flax_stream.py
      
      * Update README.md
      ae3cbbca
  19. 01 Dec, 2022 1 commit
  20. 28 Nov, 2022 1 commit
  21. 01 Nov, 2022 1 commit
  22. 13 Oct, 2022 1 commit
  23. 10 Oct, 2022 2 commits
  24. 14 Sep, 2022 1 commit
  25. 09 Sep, 2022 1 commit
  26. 14 Aug, 2022 1 commit
    • Karim Foda's avatar
      Flax Remat for LongT5 (#17994) · d6eeb871
      Karim Foda authored
      
      
      * [Flax] Add remat (gradient checkpointing)
      
      * fix variable naming in test
      
      * flip: checkpoint using a method
      
      * fix naming
      
      * fix class naming
      
      * apply PVP's suggestions from code review
      
      * add gradient_checkpointing to examples
      
      * Add gradient_checkpointing to run_mlm_flax
      
      * Add remat to longt5
      
      * Add gradient checkpointing test longt5
      
      * Fix args errors
      
      * Fix remaining tests
      
      * Make fixup & quality fixes
      
      * replace kwargs
      
      * remove unecessary kwargs
      
      * Make fixup changes
      
      * revert long_t5_flax changes
      
      * Remove return_dict and copy to LongT5
      
      * Remove test_gradient_checkpointing
      Co-authored-by: default avatarsanchit-gandhi <sanchit@huggingface.co>
      d6eeb871
  27. 06 Aug, 2022 1 commit
  28. 01 Aug, 2022 2 commits
  29. 29 Jul, 2022 1 commit
  30. 28 Jul, 2022 1 commit
  31. 27 Jul, 2022 3 commits
    • Lysandre's avatar
      Dev version · c89a592e
      Lysandre authored
      c89a592e
    • Sanchit Gandhi's avatar
      [Flax] Fix incomplete batches in example scripts (#17863) · 7490a97c
      Sanchit Gandhi authored
      * [Flax] Fix incomplete batches in example scripts
      
      * fix dataloader batching
      
      * convert jnp batch idxs to np array
      
      * add missing `pad_shard_unpad` to final prediction generate step
      
      * only `pad_shard_unpad` at inference time
      
      * merge conflicts
      
      * remove incomplete batch step from eval
      
      * fix run_qa.py
      
      * add `pad_shard_unpad` to run_flax_ner.py
      
      * add `pad_shard_unpad` to run_flax_glue.py
      
      * add `pad_shard_unpad` to run_image_classification.py
      
      * make style
      
      * fix mlm flax eval batches
      
      * remove redundant imports
      7490a97c
    • Duong A. Nguyen's avatar
      Generalize decay_mask_fn to apply mask to all LayerNorm params (#18273) · 170fcaa6
      Duong A. Nguyen authored
      * generalize decay_mask_fn to find all layernorm params
      
      * fixup
      
      * generalising decay_mask_fn
      170fcaa6
  32. 19 Jul, 2022 1 commit
  33. 11 Jul, 2022 1 commit
  34. 16 Jun, 2022 1 commit
  35. 07 Jun, 2022 1 commit
    • Sylvain Gugger's avatar
      Add examples telemetry (#17552) · 3cab9027
      Sylvain Gugger authored
      * Add examples telemetry
      
      * Alternative approach
      
      * Add to all other examples
      
      * Add to templates as well
      
      * Put framework separately
      
      * Same for TensorFlow
      3cab9027