"tests/models/vit_mae/__init__.py" did not exist on "29c10a41d04f855c433a6cde7797b325651417d2"
  1. 10 Oct, 2022 2 commits
  2. 14 Sep, 2022 1 commit
  3. 09 Sep, 2022 1 commit
  4. 14 Aug, 2022 1 commit
    • Karim Foda's avatar
      Flax Remat for LongT5 (#17994) · d6eeb871
      Karim Foda authored
      
      
      * [Flax] Add remat (gradient checkpointing)
      
      * fix variable naming in test
      
      * flip: checkpoint using a method
      
      * fix naming
      
      * fix class naming
      
      * apply PVP's suggestions from code review
      
      * add gradient_checkpointing to examples
      
      * Add gradient_checkpointing to run_mlm_flax
      
      * Add remat to longt5
      
      * Add gradient checkpointing test longt5
      
      * Fix args errors
      
      * Fix remaining tests
      
      * Make fixup & quality fixes
      
      * replace kwargs
      
      * remove unecessary kwargs
      
      * Make fixup changes
      
      * revert long_t5_flax changes
      
      * Remove return_dict and copy to LongT5
      
      * Remove test_gradient_checkpointing
      Co-authored-by: default avatarsanchit-gandhi <sanchit@huggingface.co>
      d6eeb871
  5. 06 Aug, 2022 1 commit
  6. 01 Aug, 2022 2 commits
  7. 29 Jul, 2022 1 commit
  8. 28 Jul, 2022 1 commit
  9. 27 Jul, 2022 3 commits
    • Lysandre's avatar
      Dev version · c89a592e
      Lysandre authored
      c89a592e
    • Sanchit Gandhi's avatar
      [Flax] Fix incomplete batches in example scripts (#17863) · 7490a97c
      Sanchit Gandhi authored
      * [Flax] Fix incomplete batches in example scripts
      
      * fix dataloader batching
      
      * convert jnp batch idxs to np array
      
      * add missing `pad_shard_unpad` to final prediction generate step
      
      * only `pad_shard_unpad` at inference time
      
      * merge conflicts
      
      * remove incomplete batch step from eval
      
      * fix run_qa.py
      
      * add `pad_shard_unpad` to run_flax_ner.py
      
      * add `pad_shard_unpad` to run_flax_glue.py
      
      * add `pad_shard_unpad` to run_image_classification.py
      
      * make style
      
      * fix mlm flax eval batches
      
      * remove redundant imports
      7490a97c
    • Duong A. Nguyen's avatar
      Generalize decay_mask_fn to apply mask to all LayerNorm params (#18273) · 170fcaa6
      Duong A. Nguyen authored
      * generalize decay_mask_fn to find all layernorm params
      
      * fixup
      
      * generalising decay_mask_fn
      170fcaa6
  10. 19 Jul, 2022 1 commit
  11. 11 Jul, 2022 1 commit
  12. 16 Jun, 2022 1 commit
  13. 07 Jun, 2022 1 commit
    • Sylvain Gugger's avatar
      Add examples telemetry (#17552) · 3cab9027
      Sylvain Gugger authored
      * Add examples telemetry
      
      * Alternative approach
      
      * Add to all other examples
      
      * Add to templates as well
      
      * Put framework separately
      
      * Same for TensorFlow
      3cab9027
  14. 12 May, 2022 2 commits
  15. 27 Apr, 2022 1 commit
    • Leonid Boytsov's avatar
      Misc. fixes for Pytorch QA examples: (#16958) · c82e017a
      Leonid Boytsov authored
      1. Fixes evaluation errors popping up when you train/eval on squad v2 (one was newly encountered and one that was previously reported Running SQuAD 1.0 sample command raises IndexError #15401 but not completely fixed).
      2. Removes boolean arguments that don't use store_true. Please, don't use these: *ANY non-empty string is being converted to True in this case and this clearly is not the desired behavior (and it creates a LOT of confusion).
      3. All no-trainer test scripts are now saving metric values in the same way (with the right prefix eval_), which is consistent with the trainer-based versions.
      4. Adds forgotten model.eval() in the no-trainer versions. This improved some results, but not everything (see the discussion in the end). Please, see the F1 scores and the discussion below.
      c82e017a
  16. 19 Apr, 2022 1 commit
  17. 11 Apr, 2022 2 commits
  18. 06 Apr, 2022 1 commit
  19. 04 Apr, 2022 1 commit
  20. 30 Mar, 2022 1 commit
  21. 28 Mar, 2022 1 commit
  22. 25 Mar, 2022 1 commit
  23. 23 Mar, 2022 2 commits
  24. 08 Mar, 2022 1 commit
  25. 04 Mar, 2022 1 commit
  26. 03 Mar, 2022 1 commit
  27. 23 Feb, 2022 1 commit
  28. 01 Feb, 2022 2 commits
  29. 31 Jan, 2022 1 commit
  30. 27 Jan, 2022 2 commits
  31. 19 Jan, 2022 1 commit
    • Kamal Raj's avatar
      [FLAX] glue training example refactor (#13815) · d1f5ca1a
      Kamal Raj authored
      * refactor run_flax_glue.py
      
      * updated readme
      
      * rm unused import and args typo fix
      
      * refactor
      
      * make consistent arg name across task
      
      * has_tensorboard check
      
      * argparse -> argument dataclasses
      
      * refactor according to review
      
      * fix
      d1f5ca1a