1. 17 Feb, 2021 1 commit
  2. 16 Feb, 2021 5 commits
  3. 15 Feb, 2021 12 commits
  4. 13 Feb, 2021 6 commits
  5. 12 Feb, 2021 4 commits
  6. 11 Feb, 2021 8 commits
    • Sylvain Gugger's avatar
      Add SageMakerTrainer for model paralellism (#10122) · 31245775
      Sylvain Gugger authored
      * Refactor things out of main train
      
      * Store signature
      
      * Add SageMakerTrainer
      
      * Init + Copyright
      
      * Address review comments
      31245775
    • Stas Bekman's avatar
      [DeepSpeed in notebooks] Jupyter + Colab (#10130) · b54cb0bd
      Stas Bekman authored
      * init devices/setup explicitly
      
      * docs + test
      
      * simplify
      
      * cleanup
      
      * cleanup
      
      * cleanup
      
      * correct the required dist setup
      
      * derive local_rank from env LOCAL_RANK
      b54cb0bd
    • Sylvain Gugger's avatar
      Typo fix · 6710d1d5
      Sylvain Gugger authored
      6710d1d5
    • Patrick von Platen's avatar
      Update README.md · 8e13b735
      Patrick von Platen authored
      8e13b735
    • Patrick von Platen's avatar
      Update ADD_BIG_BIRD.md · d6b4f48e
      Patrick von Platen authored
      d6b4f48e
    • Patrick von Platen's avatar
      [Wav2Vec2] Improve Tokenizer & Model for batched inference (#10117) · 495c157d
      Patrick von Platen authored
      * save intermediate
      
      * finish batch the same as fairseq
      
      * add normalization
      
      * fix batched input
      
      * add better comment
      
      * Update src/transformers/models/wav2vec2/modeling_wav2vec2.py
      
      * add nice docstring
      
      * add tokenizer tests
      
      * make all slow tests pass
      
      * finish PR
      
      * correct import
      495c157d
    • Tanmay Thakur's avatar
      Add new community notebook - Blenderbot (#10126) · 2f3b5f4d
      Tanmay Thakur authored
      * Update:community.md, new nb add
      
      * feat: updated grammar on  nb description
      
      * Update: Train summarizer for BlenderBotSmall
      2f3b5f4d
    • Qbiwan's avatar
      Update run_xnli.py to use Datasets library (#9829) · 8dcfaea0
      Qbiwan authored
      * remove xnli_compute_metrics, add load_dataset, load_metric, set_seed,metric.compute,load_metric
      
      * fix
      
      * fix
      
      * fix
      
      * push
      
      * fix
      
      * everything works
      
      * fix init
      
      * fix
      
      * special treatment for sepconv1d
      
      * style
      
      * 馃檹馃徑
      
      * add doc and cleanup
      
      
      * fix doc
      
      * fix doc again
      
      * fix doc again
      
      * Apply suggestions from code review
      
      * make style
      
      * Proposal that should work
      
      * Remove needless code
      
      * Fix test
      
      * Apply suggestions from code review
      
      * remove xnli_compute_metrics, add load_dataset, load_metric, set_seed,metric.compute,load_metric
      
      * amend README
      
      * removed data_args.task_name and replaced with task_name = "xnli"; use split function to load train and validation dataset separately; remove __post_init__; remove flag --task_name from README.
      
      * removed dict task_to_keys, use str "xnli" instead of variable task_name, change preprocess_function to use examples["premise"], examples["hypothesis"] directly, remove sentence1_key and sentence2_key, change compute_metrics function to cater only to accuracy metric, add condition for train_langauge is None when using dataset.load_dataset()
      
      * removed `torch.distributed.barrier()` and `import torch` as `from_pretrained` is able to do the work; amend README
      8dcfaea0
  7. 10 Feb, 2021 4 commits
    • Stas Bekman's avatar
      [DeepSpeed] restore memory for evaluation (#10114) · 77b86284
      Stas Bekman authored
      * free up memory at the end of train
      
      * rework tests
      
      * consistent formatting
      
      * correction
      77b86284
    • Suraj Patil's avatar
      remove adjust_logits_during_generation method (#10087) · c130e67d
      Suraj Patil authored
      * add forced logits processors
      
      * delete adjust_logits method
      
      * add forced_eos_token_id argument in config
      
      * add tests for forced logits processors
      
      * update gen utils tests
      
      * add forced option to tf generate
      
      * remove adjust_logits method from tf models
      
      * update adjust_logits for marian
      
      * delete _force_token_id_to_be_generated method
      
      * style
      
      * import warnings
      
      * pass max_length to _get_logits_processor
      
      * set forced_eos_token_id to None
      
      * set forced attributes in conf utils
      
      * typo
      
      * fix rag generate
      
      * add forced_eos_token_id in rag config
      
      * remove force_bos_token_to_be_generated from BartConfig
      
      * remove _force_token_ids_generation from FSMT
      
      * nit
      
      * fix negative constant
      
      * apply suggestions from code review
      c130e67d
    • Julien Plu's avatar
      Fix TF LED/Longformer attentions computation (#10007) · 22a32cf4
      Julien Plu authored
      * Fix test
      
      * Remove commented test
      
      * Fix name
      
      * Apply style
      
      * Fix check copies
      
      * Remove prints
      
      * Restore boolean
      
      * Fix reshape
      22a32cf4
    • Lysandre Debut's avatar