1. 19 May, 2022 1 commit
  2. 18 May, 2022 3 commits
  3. 17 May, 2022 1 commit
  4. 16 May, 2022 3 commits
  5. 12 May, 2022 2 commits
  6. 09 May, 2022 1 commit
  7. 04 May, 2022 4 commits
  8. 03 May, 2022 1 commit
  9. 02 May, 2022 3 commits
  10. 28 Apr, 2022 2 commits
  11. 27 Apr, 2022 5 commits
    • Zachary Mueller's avatar
      Fixup no_trainer save logic (#16968) · 60e1d883
      Zachary Mueller authored
      * Fixup all examples
      60e1d883
    • Sylvain Gugger's avatar
      Fix multiple deletions of the same files in save_pretrained (#16947) · c79bbc3b
      Sylvain Gugger authored
      * Fix multiple deletions of the same files in save_pretrained
      
      * Add is_main_process argument
      c79bbc3b
    • Leonid Boytsov's avatar
      Misc. fixes for Pytorch QA examples: (#16958) · c82e017a
      Leonid Boytsov authored
      1. Fixes evaluation errors popping up when you train/eval on squad v2 (one was newly encountered and one that was previously reported Running SQuAD 1.0 sample command raises IndexError #15401 but not completely fixed).
      2. Removes boolean arguments that don't use store_true. Please, don't use these: *ANY non-empty string is being converted to True in this case and this clearly is not the desired behavior (and it creates a LOT of confusion).
      3. All no-trainer test scripts are now saving metric values in the same way (with the right prefix eval_), which is consistent with the trainer-based versions.
      4. Adds forgotten model.eval() in the no-trainer versions. This improved some results, but not everything (see the discussion in the end). Please, see the F1 scores and the discussion below.
      c82e017a
    • NielsRogge's avatar
      Add semantic script, trainer (#16834) · 479fdc49
      NielsRogge authored
      * Add first draft
      
      * Improve script and README
      
      * Improve README
      
      * Apply suggestions from code review
      
      * Improve script, add link to resulting model
      
      * Add corresponding test
      
      * Adjust learning rate
      479fdc49
    • Anton Lozhkov's avatar
      [Research] Speed up evaluation for XTREME-S (#16785) · a4a88fa0
      Anton Lozhkov authored
      * Avoid repeated per-lang filtering
      
      * Language groups and logits preprocessing
      
      * Style
      a4a88fa0
  12. 25 Apr, 2022 2 commits
  13. 21 Apr, 2022 1 commit
  14. 20 Apr, 2022 1 commit
  15. 19 Apr, 2022 5 commits
    • Jeevesh Juneja's avatar
      Correct Logging of Eval metric to Tensorboard (#16825) · b5c6a63e
      Jeevesh Juneja authored
      * Correct Logging of Eval metric to Tensorboard
      
      An empty dictionary ``eval_metrics`` was being logged, is replaced by ``eval_metric`` which is the output dictionary of ``metric.compute()``.
      
      * Remove unused variable
      b5c6a63e
    • NielsRogge's avatar
      Add image classification script, no trainer (#16727) · b96e82c8
      NielsRogge authored
      * Add first draft
      
      * Improve README and run fixup
      
      * Make script aligned with other scripts, improve README
      
      * Improve script and add test
      
      * Remove print statement
      
      * Apply suggestions from code review
      
      * Add num_labels to make test pass
      
      * Improve README
      b96e82c8
    • Wonjae Kim's avatar
      b74a9553
    • Suraj Patil's avatar
      [Flax] improve large model init and loading (#16148) · d3bd9ac7
      Suraj Patil authored
      
      
      * begin do_init
      
      * add params_shape_tree
      
      * raise error if params are accessed when do_init is False
      
      * don't allow do_init=False when keys are missing
      
      * make shape tree a property
      
      * assign self._params at the end
      
      * add test for do_init
      
      * add do_init arg to all flax models
      
      * fix param setting
      
      * disbale do_init for composite models
      
      * update test
      
      * add do_init in FlaxBigBirdForMultipleChoice
      
      * better names and errors
      
      * improve test
      
      * style
      
      * add a warning when do_init=False
      
      * remove extra if
      
      * set params after _required_params
      
      * add test for from_pretrained
      
      * do_init => _do_init
      
      * chage warning to info
      
      * fix typo
      
      * add params in init_weights
      
      * add params to gpt neo init
      
      * add params to init_weights
      
      * update do_init test
      
      * Trigger CI
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * update template
      
      * trigger CI
      
      * style
      
      * style
      
      * fix template
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      d3bd9ac7
    • NielsRogge's avatar
      Add semantic script no trainer, v2 (#16788) · 7db7aab4
      NielsRogge authored
      * Add first draft from previous PR
      
      * First draft
      
      * Improve README and remove num_labels
      
      * Make script more aligned with other scripts
      
      * Improve README and apply suggestion from code review
      7db7aab4
  16. 15 Apr, 2022 1 commit
  17. 14 Apr, 2022 1 commit
  18. 13 Apr, 2022 2 commits
    • Zachary Mueller's avatar
      Fixup no_trainer examples scripts and add more tests (#16765) · be752d12
      Zachary Mueller authored
      * Change tracking to store_true
      
      * Remove step param and use it in the log dictionary directly
      
      * use vars(args) when passing args to init_trackers
      
      * Include tracking tests since tensorboard is already a dep
      be752d12
    • Tu Vu's avatar
      Add self training code for text classification (#16738) · 34ef029d
      Tu Vu authored
      * Add self-training code for text-classification
      
      * Add self-training code for text-classification
      
      * Add self-training code for text-classification
      
      * Add self-training code for text-classification
      
      * Add self-training code for text-classification
      
      * Delete strata
      34ef029d
  19. 12 Apr, 2022 1 commit