1. 04 May, 2022 3 commits
  2. 03 May, 2022 1 commit
  3. 02 May, 2022 3 commits
  4. 28 Apr, 2022 2 commits
  5. 27 Apr, 2022 5 commits
    • Zachary Mueller's avatar
      Fixup no_trainer save logic (#16968) · 60e1d883
      Zachary Mueller authored
      * Fixup all examples
      60e1d883
    • Sylvain Gugger's avatar
      Fix multiple deletions of the same files in save_pretrained (#16947) · c79bbc3b
      Sylvain Gugger authored
      * Fix multiple deletions of the same files in save_pretrained
      
      * Add is_main_process argument
      c79bbc3b
    • Leonid Boytsov's avatar
      Misc. fixes for Pytorch QA examples: (#16958) · c82e017a
      Leonid Boytsov authored
      1. Fixes evaluation errors popping up when you train/eval on squad v2 (one was newly encountered and one that was previously reported Running SQuAD 1.0 sample command raises IndexError #15401 but not completely fixed).
      2. Removes boolean arguments that don't use store_true. Please, don't use these: *ANY non-empty string is being converted to True in this case and this clearly is not the desired behavior (and it creates a LOT of confusion).
      3. All no-trainer test scripts are now saving metric values in the same way (with the right prefix eval_), which is consistent with the trainer-based versions.
      4. Adds forgotten model.eval() in the no-trainer versions. This improved some results, but not everything (see the discussion in the end). Please, see the F1 scores and the discussion below.
      c82e017a
    • NielsRogge's avatar
      Add semantic script, trainer (#16834) · 479fdc49
      NielsRogge authored
      * Add first draft
      
      * Improve script and README
      
      * Improve README
      
      * Apply suggestions from code review
      
      * Improve script, add link to resulting model
      
      * Add corresponding test
      
      * Adjust learning rate
      479fdc49
    • Anton Lozhkov's avatar
      [Research] Speed up evaluation for XTREME-S (#16785) · a4a88fa0
      Anton Lozhkov authored
      * Avoid repeated per-lang filtering
      
      * Language groups and logits preprocessing
      
      * Style
      a4a88fa0
  6. 25 Apr, 2022 2 commits
  7. 21 Apr, 2022 1 commit
  8. 20 Apr, 2022 1 commit
  9. 19 Apr, 2022 5 commits
    • Jeevesh Juneja's avatar
      Correct Logging of Eval metric to Tensorboard (#16825) · b5c6a63e
      Jeevesh Juneja authored
      * Correct Logging of Eval metric to Tensorboard
      
      An empty dictionary ``eval_metrics`` was being logged, is replaced by ``eval_metric`` which is the output dictionary of ``metric.compute()``.
      
      * Remove unused variable
      b5c6a63e
    • NielsRogge's avatar
      Add image classification script, no trainer (#16727) · b96e82c8
      NielsRogge authored
      * Add first draft
      
      * Improve README and run fixup
      
      * Make script aligned with other scripts, improve README
      
      * Improve script and add test
      
      * Remove print statement
      
      * Apply suggestions from code review
      
      * Add num_labels to make test pass
      
      * Improve README
      b96e82c8
    • Wonjae Kim's avatar
      b74a9553
    • Suraj Patil's avatar
      [Flax] improve large model init and loading (#16148) · d3bd9ac7
      Suraj Patil authored
      
      
      * begin do_init
      
      * add params_shape_tree
      
      * raise error if params are accessed when do_init is False
      
      * don't allow do_init=False when keys are missing
      
      * make shape tree a property
      
      * assign self._params at the end
      
      * add test for do_init
      
      * add do_init arg to all flax models
      
      * fix param setting
      
      * disbale do_init for composite models
      
      * update test
      
      * add do_init in FlaxBigBirdForMultipleChoice
      
      * better names and errors
      
      * improve test
      
      * style
      
      * add a warning when do_init=False
      
      * remove extra if
      
      * set params after _required_params
      
      * add test for from_pretrained
      
      * do_init => _do_init
      
      * chage warning to info
      
      * fix typo
      
      * add params in init_weights
      
      * add params to gpt neo init
      
      * add params to init_weights
      
      * update do_init test
      
      * Trigger CI
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * update template
      
      * trigger CI
      
      * style
      
      * style
      
      * fix template
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      d3bd9ac7
    • NielsRogge's avatar
      Add semantic script no trainer, v2 (#16788) · 7db7aab4
      NielsRogge authored
      * Add first draft from previous PR
      
      * First draft
      
      * Improve README and remove num_labels
      
      * Make script more aligned with other scripts
      
      * Improve README and apply suggestion from code review
      7db7aab4
  10. 15 Apr, 2022 1 commit
  11. 14 Apr, 2022 1 commit
  12. 13 Apr, 2022 2 commits
    • Zachary Mueller's avatar
      Fixup no_trainer examples scripts and add more tests (#16765) · be752d12
      Zachary Mueller authored
      * Change tracking to store_true
      
      * Remove step param and use it in the log dictionary directly
      
      * use vars(args) when passing args to init_trackers
      
      * Include tracking tests since tensorboard is already a dep
      be752d12
    • Tu Vu's avatar
      Add self training code for text classification (#16738) · 34ef029d
      Tu Vu authored
      * Add self-training code for text-classification
      
      * Add self-training code for text-classification
      
      * Add self-training code for text-classification
      
      * Add self-training code for text-classification
      
      * Add self-training code for text-classification
      
      * Delete strata
      34ef029d
  13. 12 Apr, 2022 2 commits
  14. 11 Apr, 2022 4 commits
    • Zachary Mueller's avatar
      Fix example logs repeating themselves (#16669) · 69233cf0
      Zachary Mueller authored
      Move declaration of log streams to before tests, so that results won't get compounded on top of each other
      69233cf0
    • Zachary Mueller's avatar
      Don't push checkpoints to hub in `no_trainer` scripts (#16703) · d4b3e359
      Zachary Mueller authored
      Adds checkpoint prefixes to the gitignore if `push_to_hub` is used along with `checkpointint_steps`
      d4b3e359
    • Ahmed Elnaggar's avatar
      Fix t5 shard on TPU Pods (#16527) · 5e686757
      Ahmed Elnaggar authored
      
      
      * Fix t5 shard on TPU Pods
      
      The current script doesn't work properly on a TPU pod because the global batch is not divided correctly per host.
      This pull request fixes this issue by dividing the global batch to each host before it is shared on each host.
      
      * fix style
      Co-authored-by: default avatarahmed-elnaggar <ahmed.elnaggar@allianz.com>
      5e686757
    • Jia LI's avatar
      Jia multi gpu eval (#16428) · 4868a830
      Jia LI authored
      
      
      * add simple multi gpu complet
      
      * add human_eval_multi_gpu
      
      * use copy strategy to distribute across gpu, to avoid padding
      
      * add doc string
      
      * update code style
      
      * use task id to arrange output
      
      * truncate input to avoid zero pad
      
      * Stop the copy mechanism
      
      * update style
      
      * restore copies to scale better in distributed mode
      
      * update style
      
      * replace human eval
      
      * Apply suggestions from code review
      
      1. Tokenize all input at the same time
      2. use attention_mask to get the input length
      3. other small fixes
      Co-authored-by: default avatarLeandro von Werra <lvwerra@users.noreply.github.com>
      
      * correct typo and update docstring
      
      * update code style
      
      * remove num sample division constraint
      
      * remove max len calculation
      
      * use accelerator.gather once to speed up
      
      * use accelerate set_seed; update accelerate version
      
      * correct gather bug
      Co-authored-by: default avatarLeandro von Werra <lvwerra@users.noreply.github.com>
      4868a830
  15. 08 Apr, 2022 2 commits
    • Zachary Mueller's avatar
      Add tests for no_trainer and fix existing examples (#16656) · d57da992
      Zachary Mueller authored
      * Fixed some bugs involving saving during epochs
      * Added tests mimicking the existing examples tests
      * Added in json exporting to all `no_trainer` examples for consistency 
      d57da992
    • NielsRogge's avatar
      Add TAPEX (#16473) · 4ef0abb7
      NielsRogge authored
      
      
      * Add TapexTokenizer
      
      * Improve docstrings and provide option to provide answer
      
      * Remove option for pretokenized inputs
      
      * Add TAPEX to README
      
      * Fix copies
      
      * Remove option for pretokenized inputs
      
      * Initial commit: add tapex fine-tuning examples on both table-based question answering and table-based fact verification.
      
      * - Draft a README file for running the script and introducing some background.
      - Remove unused code lines in tabfact script.
      - Disable the deafult `pad_to_max_length` option which is memory-consuming.
      
      * * Support `as_target_tokenizer` function for TapexTokenizer.
      * Fix the do_lower_case behaviour of TapexTokenizer.
      * Add unit tests for target scenarios and cased/uncased scenarios for both source and target.
      
      * * Replace the label BartTokenizer with TapexTokenizer's as_target_tokenizer function.
      * Fix typos in tapex example README.
      
      * * fix the evaluation script - remove the property `task_name`
      
      * * Make the label space more clear for tabfact tasks
      
      * * Using a new fine-tuning script for tapex-base on tabfact.
      
      * * Remove the lowercase code outside the tokenizer - we use the tokenizer to control whether do_lower_case
      * Guarantee the hyper-parameter can be run without out-of-memory on 16GB card and report the new reproduced number on wikisql
      
      * * Remove the default tokenizer_name option.
      * Provide evaluation command.
      
      * * Support for WikiTableQuestion dataset.
      
      * Fix a typo in README.
      
      * * Fix the datasets's key name in WikiTableQuestions
      
      * Run make fixup and move test to folder
      
      * Fix quality
      
      * Apply suggestions from code review
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSuraj Patil <surajp815@gmail.com>
      
      * Apply suggestions from code review
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Apply some more suggestions from code review
      
      * Improve docstrings
      
      * Overwrite failing test
      
      * Improve comment in example scripts
      
      * Fix rebase
      
      * Add TAPEX to Auto mapping
      
      * Add TAPEX to auto config mappings
      
      * Put TAPEX higher than BART in auto mapping
      
      * Add TAPEX to doc tests
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MBP.localdomain>
      Co-authored-by: default avatarSivilTaram <qianlxc@outlook.com>
      Co-authored-by: default avatarNiels Rogge <nielsrogge@nielss-mbp.home>
      Co-authored-by: default avatarSuraj Patil <surajp815@gmail.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      4ef0abb7
  16. 06 Apr, 2022 2 commits
  17. 04 Apr, 2022 1 commit
  18. 01 Apr, 2022 1 commit
  19. 31 Mar, 2022 1 commit