1. 23 Mar, 2021 1 commit
  2. 10 Mar, 2021 1 commit
  3. 09 Mar, 2021 1 commit
  4. 05 Mar, 2021 1 commit
  5. 03 Mar, 2021 1 commit
  6. 25 Feb, 2021 1 commit
    • mingruimingrui's avatar
      Bugfix: Removal of padding_idx in BartLearnedPositionalEmbedding (#10200) · 894db670
      mingruimingrui authored
      
      
      * Assumption of padding_idx <2 might not stand
      
      * Use offset instead of 2
      
      * Fix with black
      
      * Change behavior to warning instead for backward compatibility.
      
      * Fix with black
      
      * Remove warning
      
      * Make padding_idx non-required
      
      * padding_idx fix for blenderbot
      
      * padding_idx fix for blenderbot_small
      
      * padding_idx fix for led
      
      * padding_idx fix for mbart
      
      * Remove extra whitespaces
      
      * padding_idx fix for template
      
      * Fix padding_idx passed to nn.Embedding mistake
      
      * Fixed padding_idx passed to positional embedding in template
      
      * Remove padding_idx from pytorch learned positional embeddings
      
      * Remove accidentally added quotes
      
      * Remove padding_idx from tf learned positional embeddings
      
      * Remove zeroing of weights in __init__
      Co-authored-by: default avatarWang Ming Rui <mingrui.wang@C02CJTUYMD6M.local>
      894db670
  7. 17 Feb, 2021 1 commit
    • Julien Plu's avatar
      Making TF BART-like models XLA and AMP compliant (#10191) · 83d803ba
      Julien Plu authored
      * Update BART
      
      * Update Blenderbot
      
      * Update BlenderbotSmall
      
      * Update Marian
      
      * Update MBart
      
      * Update MBart
      
      * Update Pegasus
      
      * Update template
      
      * Fix Marian and Pegasus
      
      * Apply style
      
      * Default initializer
      
      * Default initializer
      
      * Default initializer
      
      * Remove int32 casts
      
      * Fix template
      
      * Remove more cast
      83d803ba
  8. 15 Feb, 2021 2 commits
  9. 11 Feb, 2021 2 commits
  10. 09 Feb, 2021 2 commits
  11. 08 Feb, 2021 4 commits
  12. 05 Feb, 2021 2 commits
  13. 04 Feb, 2021 2 commits
    • Lysandre Debut's avatar
      Fix model templates (#9999) · e89c959a
      Lysandre Debut authored
      e89c959a
    • demSd's avatar
      BartForCausalLM analogs to `ProphetNetForCausalLM` (#9128) · 00031785
      demSd authored
      
      
      * initiliaze bart4causalLM
      
      * create BartDecoderWrapper, setters/getters
      
      * delete spaces
      
      * forward and additional methods
      
      * update cache function, loss function, remove ngram* params in data class.
      
      * add bartcausallm, bartdecoder testing
      
      * correct bart for causal lm
      
      * remove at
      
      * add mbart as well
      
      * up
      
      * fix typo
      
      * up
      
      * correct
      
      * add pegasusforcausallm
      
      * add blenderbotforcausallm
      
      * add blenderbotsmallforcausallm
      
      * add marianforcausallm
      
      * add test for MarianForCausalLM
      
      * add Pegasus test
      
      * add BlenderbotSmall test
      
      * add blenderbot test
      
      * fix a fail
      
      * fix an import fail
      
      * a fix
      
      * fix
      
      * Update modeling_pegasus.py
      
      * fix models
      
      * fix inputs_embeds setting getter
      
      * adapt tests
      
      * correct repo utils check
      
      * finish test improvement
      
      * fix tf models as well
      
      * make style
      
      * make fix-copies
      
      * fix copies
      
      * run all tests
      
      * last changes
      
      * fix all tests
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      00031785
  14. 02 Feb, 2021 2 commits
  15. 01 Feb, 2021 2 commits
  16. 28 Jan, 2021 2 commits
  17. 27 Jan, 2021 5 commits
  18. 26 Jan, 2021 2 commits
  19. 25 Jan, 2021 1 commit
  20. 21 Jan, 2021 2 commits
    • Julien Plu's avatar
      Fix TF s2s models (#9478) · a7dabfb3
      Julien Plu authored
      * Fix Seq2Seq models for serving
      
      * Apply style
      
      * Fix lonfgormer
      
      * Fix mBart/Pegasus/Blenderbot
      
      * Apply style
      
      * Add a main intermediate layer
      
      * Apply style
      
      * Remove import
      
      * Apply tf.function to Longformer
      
      * Fix utils check_copy
      
      * Update S2S template
      
      * Fix BART + Blenderbot
      
      * Fix BlenderbotSmall
      
      * Fix BlenderbotSmall
      
      * Fix BlenderbotSmall
      
      * Fix MBart
      
      * Fix Marian
      
      * Fix Pegasus + template
      
      * Apply style
      
      * Fix common attributes test
      
      * Forgot to fix the LED test
      
      * Apply Patrick's comment on LED Decoder
      a7dabfb3
    • Julien Plu's avatar
      Fix mixed precision in TF models (#9163) · 3f290e6c
      Julien Plu authored
      * Fix Gelu precision
      
      * Fix gelu_fast
      
      * Naming
      
      * Fix usage and apply style
      
      * add TF gelu approximate version
      
      * add TF gelu approximate version
      
      * add TF gelu approximate version
      
      * Apply style
      
      * Fix albert
      
      * Remove the usage of the Activation layer
      3f290e6c
  21. 20 Jan, 2021 2 commits
  22. 19 Jan, 2021 1 commit