1. 18 Mar, 2021 9 commits
    • Sylvain Gugger's avatar
      Fix distributed evaluation (#10795) · 008672e6
      Sylvain Gugger authored
      * Fix distributed evaluation
      
      * Use logger
      008672e6
    • Stas Bekman's avatar
      [examples/seq2seq/README.md] fix t5 examples (#10734) · 9352b515
      Stas Bekman authored
      * [examples/seq2seq] fix t5 examples
      
      This PR:
      * fixes T5 examples to include `--source_prefix` - it's **not** optional. If you give it a try you will see that you get 10x worse bleu scores w/o it. w/ `27.6849`, w/ `2.374`
      * added a normal translation example w/o the peculiarities of MBart and T5
      * reduces the default max samples to 50 so it's much faster to test quickly
      
      summarization seems to be broken for t5 score-wise: https://github.com/huggingface/transformers/issues/10733
      
      @sgugger
      
      * specify explicitly the t5 models requiring the special handling
      
      * one more
      
      * update the t5 summarization example to use cnn_dailymail
      
      * move max*samples into the top level README.md
      
      * better wording
      
      * better wording
      9352b515
    • Vimarsh Chaturvedi's avatar
      from_pretrained: check that the pretrained model is for the right model architecture (#10586) · 094afa51
      Vimarsh Chaturvedi authored
      
      
      * Added check to ensure model name passed to from_pretrained and model are the same
      
      * Added test to check from_pretrained throws assert error when passed an incompatiable model name
      
      * Modified assert in from_pretrained with f-strings. Modified test to ensure desired assert message is being generated
      
      * Added check to ensure config and model has model_type
      
      * Fix FlauBERT heads
      
      Co-authored-by: vimarsh chaturvedi <vimarsh chaturvedi>
      Co-authored-by: default avatarStas Bekman <stas@stason.org>
      Co-authored-by: default avatarLysandre <lysandre.debut@reseau.eseo.fr>
      094afa51
    • Julien Chaumond's avatar
      [file_utils] do not gobble certain kinds of requests.ConnectionError (#10235) · 4f3e93cf
      Julien Chaumond authored
      
      
      * do not gobble certain kinds of requests.ConnectionError
      
      * Apply review comments
      Co-authored-by: default avatarLysandre <lysandre.debut@reseau.eseo.fr>
      4f3e93cf
    • James Thomin's avatar
      Fix bug in input check for LengthGroupSampler (#10783) · ce9724e1
      James Thomin authored
      This commit fixes a bug in the LengthGroupSampler where if
      model_input_name is not set, the default value is None instead of
      "input_ids"
      ce9724e1
    • Suraj Patil's avatar
      add run_common_voice script (#10767) · 5f19c07a
      Suraj Patil authored
      * add initial script
      
      * finish script
      
      * add shell script example
      
      * accept chars_to_ignor as cl arg
      
      * align the script with other example scripts
      
      * add torchaudio dep
      5f19c07a
    • Mohamed El-Geish's avatar
      wav2vec2: support datasets other than LibriSpeech (#10581) · af8afdc8
      Mohamed El-Geish authored
      * wav2vec2: support datasets other than LibriSpeech
      
      * Formatting run_asr.py to pass code quality test
      
      * bundled orthography options and added verbose logs
      
      * fixing a typo in timit fine-tuning script
      
      * update comment for clarity
      
      * resize_lm_head and load custom vocab from file
      
      * adding a max_duration_in_seconds filter
      
      * do not assign `duration_filter` lambda, use a def
      
      * log untransliterated text as well
      
      * fix base model for arabic
      
      * fix duration filter when target_sr is not set
      
      * drop duration_in_seconds when unneeded
      
      * script for wav2vec2-large-lv60-timit-asr
      
      * fix for "tha" in arabic corpus (huggingface#10581)
      
      * adding more options to work with common_voice
      
      * PR feedback (huggingface#10581)
      
      * small README change
      af8afdc8
    • Patrick von Platen's avatar
      [Flax] Adapt Flax models to new structure (#9484) · 0b98ca36
      Patrick von Platen authored
      
      
      * Create modeling_flax_eletra with code copied from modeling_flax_bert
      
      * Add ElectraForMaskedLM and ElectraForPretraining
      
      * Add modeling test for Flax electra and fix naming and arg in Flax Electra model
      
      * Add documentation
      
      * Fix code style
      
      * Create modeling_flax_eletra with code copied from modeling_flax_bert
      
      * Add ElectraForMaskedLM and ElectraForPretraining
      
      * Add modeling test for Flax electra and fix naming and arg in Flax Electra model
      
      * Add documentation
      
      * Fix code style
      
      * Fix code quality
      
      * Adjust tol in assert_almost_equal due to very small difference between model output, ranging 0.0010 - 0.0016
      
      * Remove redundant ElectraPooler
      
      * save intermediate
      
      * adapt
      
      * correct bert flax design
      
      * adapt roberta as well
      
      * finish roberta flax
      
      * finish
      
      * apply suggestions
      
      * apply suggestions
      Co-authored-by: default avatarChris Nguyen <anhtu2687@gmail.com>
      0b98ca36
    • Funtowicz Morgan's avatar
  2. 17 Mar, 2021 11 commits
  3. 16 Mar, 2021 14 commits
  4. 15 Mar, 2021 6 commits