1. 01 Nov, 2022 1 commit
  2. 17 Oct, 2022 1 commit
  3. 12 Oct, 2022 2 commits
  4. 10 Oct, 2022 1 commit
  5. 28 Sep, 2022 2 commits
  6. 14 Sep, 2022 1 commit
  7. 13 Sep, 2022 1 commit
  8. 25 Aug, 2022 1 commit
  9. 22 Aug, 2022 1 commit
  10. 18 Aug, 2022 1 commit
  11. 08 Aug, 2022 1 commit
    • Rasmus Arpe Fogh Jensen's avatar
      Update no_trainer.py scripts to include accelerate gradient accumulation wrapper (#18473) · a765b68a
      Rasmus Arpe Fogh Jensen authored
      * Added accelerate gradient accumulation wrapper to run_image_classification_no_trainer.py example script
      
      * make fixup changes
      
      * PR comments
      
      * changed input to Acceletor based on PR comment, ran make fixup
      
      * Added comment explaining the sync_gradients statement
      
      * Fixed lr scheduler max steps
      
      * Changed run_clm_no_trainer.py script to use accelerate gradient accum wrapper
      
      * Fixed all scripts except wav2vec2 pretraining to use accelerate gradient accum wrapper
      
      * Added accelerate gradient accum wrapper for wav2vec2_pretraining_no_trainer.py script
      
      * make fixup and lr_scheduler step inserted back into run_qa_beam_search_no_trainer.py
      
      * removed changes to run_wav2vec2_pretraining_no_trainer.py script and fixed using wrong constant in qa_beam_search_no_trainer.py script
      a765b68a
  12. 06 Aug, 2022 1 commit
  13. 04 Aug, 2022 1 commit
  14. 01 Aug, 2022 1 commit
  15. 29 Jul, 2022 1 commit
  16. 27 Jul, 2022 1 commit
  17. 29 Jun, 2022 1 commit
  18. 23 Jun, 2022 1 commit
  19. 22 Jun, 2022 1 commit
    • Eran Hirsch's avatar
      Add logits_processor parameter, used by `generate`, to `Seq2SeqTrainer`... · 13570381
      Eran Hirsch authored
      Add logits_processor parameter, used by `generate`, to `Seq2SeqTrainer` methods `evaluate` and `predict` (#17805)
      
      * Add logits_processor parameter, used by `generate`, to `Seq2SeqTrainer` methods `evaluate` and `predict`
      
      * Add all generate parameters to `Seq2SeqTrainer`, and also to `QuestionAnsweringSeq2SeqTrainer` which overrides it
      
      * Remove `self._num_beams` from trainer classes
      
      * - Run fixup
      - Fix "Constraint" not exposed
      - Fix synced_gpus to actually read from param
      
      * Use kwargs
      
      * Copy kwargs before making changes to it
      
      * Fix style issues unused imports
      13570381
  20. 16 Jun, 2022 1 commit
  21. 07 Jun, 2022 1 commit
    • Sylvain Gugger's avatar
      Add examples telemetry (#17552) · 3cab9027
      Sylvain Gugger authored
      * Add examples telemetry
      
      * Alternative approach
      
      * Add to all other examples
      
      * Add to templates as well
      
      * Put framework separately
      
      * Same for TensorFlow
      3cab9027
  22. 27 May, 2022 1 commit
  23. 12 May, 2022 2 commits
  24. 09 May, 2022 1 commit
  25. 02 May, 2022 2 commits
  26. 27 Apr, 2022 3 commits
    • Zachary Mueller's avatar
      Fixup no_trainer save logic (#16968) · 60e1d883
      Zachary Mueller authored
      * Fixup all examples
      60e1d883
    • Sylvain Gugger's avatar
      Fix multiple deletions of the same files in save_pretrained (#16947) · c79bbc3b
      Sylvain Gugger authored
      * Fix multiple deletions of the same files in save_pretrained
      
      * Add is_main_process argument
      c79bbc3b
    • Leonid Boytsov's avatar
      Misc. fixes for Pytorch QA examples: (#16958) · c82e017a
      Leonid Boytsov authored
      1. Fixes evaluation errors popping up when you train/eval on squad v2 (one was newly encountered and one that was previously reported Running SQuAD 1.0 sample command raises IndexError #15401 but not completely fixed).
      2. Removes boolean arguments that don't use store_true. Please, don't use these: *ANY non-empty string is being converted to True in this case and this clearly is not the desired behavior (and it creates a LOT of confusion).
      3. All no-trainer test scripts are now saving metric values in the same way (with the right prefix eval_), which is consistent with the trainer-based versions.
      4. Adds forgotten model.eval() in the no-trainer versions. This improved some results, but not everything (see the discussion in the end). Please, see the F1 scores and the discussion below.
      c82e017a
  27. 13 Apr, 2022 1 commit
  28. 11 Apr, 2022 1 commit
  29. 08 Apr, 2022 1 commit
  30. 06 Apr, 2022 2 commits
  31. 04 Apr, 2022 1 commit
  32. 31 Mar, 2022 1 commit
  33. 30 Mar, 2022 1 commit