1. 10 Mar, 2021 7 commits
    • Philipp Schmid's avatar
      Extend trainer logging for sm (#10633) · 49c61a4a
      Philipp Schmid authored
      * renamed logging to hf_logging
      
      * changed logging from hf_logging to logging and loggin to native_logging
      
      * removed everything trying to fix import Trainer error
      
      * adding imports again
      
      * added custom add_handler function to logging.py
      
      * make style
      
      * added remove_handler
      
      * added another conditional to assert
      49c61a4a
    • Sylvain Gugger's avatar
      Fix GPU tests with speech · 1aa9c13f
      Sylvain Gugger authored
      1aa9c13f
    • Sylvain Gugger's avatar
      Copy tokenizer files in each of their repo (#10624) · 2295d783
      Sylvain Gugger authored
      * Move tokenizer files in each repo
      
      * Fix mBART50 tests
      
      * Fix mBART tests
      
      * Fix Marian tests
      
      * Update templates
      2295d783
    • Suraj Patil's avatar
      Speech2TextTransformer (#10175) · d26b37e7
      Suraj Patil authored
      * s2t
      
      * fix config
      
      * conversion script
      
      * fix import
      
      * add tokenizer
      
      * fix tok init
      
      * fix tokenizer
      
      * first version working
      
      * fix embeds
      
      * fix lm head
      
      * remove extra heads
      
      * fix convert script
      
      * handle encoder attn mask
      
      * style
      
      * better enc attn mask
      
      * override _prepare_attention_mask_for_generation
      
      * handle attn_maks in encoder and decoder
      
      * input_ids => input_features
      
      * enable use_cache
      
      * remove old code
      
      * expand embeddings if needed
      
      * remove logits bias
      
      * masked_lm_loss => loss
      
      * hack tokenizer to support feature processing
      
      * fix model_input_names
      
      * style
      
      * fix error message
      
      * doc
      
      * remove inputs_embeds
      
      * remove input_embeds
      
      * remove unnecessary docstring
      
      * quality
      
      * SpeechToText => Speech2Text
      
      * style
      
      * remove shared_embeds
      
      * subsample => conv
      
      * remove Speech2TextTransformerDecoderWrapper
      
      * update output_lengths formula
      
      * fix table
      
      * remove max_position_em...
      d26b37e7
    • Sylvain Gugger's avatar
      Add new GLUE example with no Trainer. (#10555) · efb5c0a4
      Sylvain Gugger authored
      * Add new GLUE example with no Trainer.
      
      * Style
      
      * Address review comments
      efb5c0a4
    • Suraj Patil's avatar
      remove final_logits_bias (#10606) · 44f64132
      Suraj Patil authored
      44f64132
    • Allen Wang's avatar
      Fixes an issue in `text-classification` where MNLI eval/test datasets are not... · 6f52fce6
      Allen Wang authored
      Fixes an issue in `text-classification` where MNLI eval/test datasets are not being preprocessed. (#10621)
      
      * Fix MNLI tests
      
      * Linter fix
      6f52fce6
  2. 09 Mar, 2021 9 commits
  3. 08 Mar, 2021 20 commits
  4. 06 Mar, 2021 3 commits
    • Suraj Patil's avatar
      Add m2m100 (#10236) · f6e74a63
      Suraj Patil authored
      * m2m_100
      
      * no layernorm_embedding
      
      * sinusoidal positional embeddings
      
      * update pos embeddings
      
      * add default config values
      
      * tokenizer
      
      * add conversion script
      
      * fix config
      
      * fix pos embed
      
      * remove _float_tensor
      
      * update tokenizer
      
      * update lang codes
      
      * handle lang codes
      
      * fix pos embeds
      
      * fix spm key
      
      * put embedding weights on device
      
      * remove qa and seq classification heads
      
      * fix convert script
      
      * lang codes pn one line
      
      * fix embeds
      
      * fix tokenizer
      
      * fix tokenizer
      
      * add fast tokenizer
      
      * style
      
      * M2M100MT => M2M100
      
      * fix copyright, style
      
      * tokenizer converter
      
      * vocab file
      
      * remove fast tokenizer
      
      * fix embeds
      
      * fix tokenizer
      
      * fix tests
      
      * add tokenizer tests
      
      * add integration test
      
      * quality
      
      * fix model name
      
      * fix test
      
      * doc
      
      * doc
      
      * fix doc
      
      * add copied from statements
      
      * fix tokenizer tests
      
      * apply review suggestions
      
      * fix urls
      
      * fix shift_tokens_right
      
      * apply review suggestions
      
      * fix
      
      * fix doc
      
      * add lang code to id
      
      * remove unused function
      
      * update checkpoint names
      
      * fix copy
      
      * fix tokenizer
      
      * fix checkpoint names
      
      * fix merge issue
      
      * style
      f6e74a63
    • Lysandre's avatar
      Temporarily disable stale bot · fd011044
      Lysandre authored
      fd011044
    • Stas Bekman's avatar
      offline mode for firewalled envs (#10407) · 88a951e3
      Stas Bekman authored
      
      
      * offline mode start
      
      * add specific values
      
      * fix fallback
      
      * add test
      
      * better values check and range
      
      * test that actually works
      
      * document the offline mode
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * more strict check
      
      * cleaner test
      
      * pt-only test
      
      * style
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      88a951e3
  5. 05 Mar, 2021 1 commit
    • Daniel Hug's avatar
      Refactoring checkpoint names for multiple models (#10527) · 90ecc296
      Daniel Hug authored
      * Refactor checkpoint name in ALBERT and ALBERT_tf
      
      * Refactor checkpoint name in BART and BART_tf
      
      * Refactor checkpoint name in BERT generation
      
      * Refactor checkpoint name in Blenderbot_tf
      
      * Refactor checkpoint name in Blenderbot_small_tf
      
      * Refactor checkpoint name in ConvBERT AND CONVBERT_TF
      
      * Refactor checkpoint name in CTRL AND CTRL_TF
      
      * Refactor checkpoint name in DistilBERT AND DistilBERT_TF
      
      * Refactor checkpoint name in DistilBERT redo
      
      * Refactor checkpoint name in Electra and Electra_tf
      
      * Refactor checkpoint name in FlauBERT and FlauBERT_tf
      
      * Refactor checkpoint name in FSMT
      
      * Refactor checkpoint name in GPT2 and GPT2_tf
      
      * Refactor checkpoint name in IBERT
      
      * Refactor checkpoint name in LED and LED_tf
      
      * Refactor checkpoint name in Longformer and Longformer_tf
      
      * Refactor checkpoint name in Lxmert and Lxmert_tf
      
      * Refactor checkpoint name in Marian_tf
      
      * Refactor checkpoint name in MBART and MBART_tf
      
      * Refactor checkpoint name in MobileBERT and MobileBERT_tf
      
      * Refactor checkpoint name in mpnet and mpnet_tf
      
      * Refactor checkpoint name in openai and openai_tf
      
      * Refactor checkpoint name in pegasus_tf
      
      * Refactor checkpoint name in reformer
      
      * Refactor checkpoint name in Roberta and Roberta_tf
      
      * Refactor checkpoint name in SqueezeBert
      
      * Refactor checkpoint name in Transformer_xl and Transformer_xl_tf
      
      * Refactor checkpoint name in XLM and XLM_tf
      
      * Refactor checkpoint name in XLNET and XLNET_tf
      
      * Refactor checkpoint name in BERT_tf
      
      * run make tests, style, quality, fixup
      90ecc296