1. 30 Nov, 2021 1 commit
    • Daniel Stancl's avatar
      [Flax] Add FlaxBlenderbot (#13633) · faacd747
      Daniel Stancl authored
      
      
      * Init Flax implementation for Blenderbot
      
      * Add a majority of stuff except for tests
      
      * make style quality
      
      * Add tests and fix some bugs
      
      * Add tests
      
      * Clean source code and fix some bugs
      
      * Fix copies and docs
      
      * Fix jax device condition for tests
      
      * Fix layer norm in the encoder
      
      * Fix a few typos in the test file
      
      * make fix-copies
      
      * make fix-copies
      
      * fix layer norm
      
      * Fix Flax params dtype (#13090)
      
      * Fix PR reference (#13098)
      
      * make fix-copies
      
      * Update tests/test_modeling_flax_blenderbot.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarSuraj Patil <surajp815@gmail.com>
      faacd747
  2. 07 Jul, 2021 1 commit
    • Daniel Stancl's avatar
      [Flax] Add FlaxMBart (#12236) · 61400e1e
      Daniel Stancl authored
      
      
      * Copy BART to MBart and rename some stuff
      
      * Add copy statements pointing to FlaxBart
      
      * Update/add some common files
      
      * Update shift_tokens_rigth + fix imports
      
      * Fix shift_tokens_right method according to MBart implementation
      
      * Update shift_tokens_right in tests accordingly
      
      * Fix the import issue and update docs file
      * make style quality
      
      * Do some minor changes according to patil-suraj suggestions
      
      * Change the order of normalization layer and attention
      
      * Add some copu statementes
      
      * Update generate method and add integration test for mBart
      
      * Make a few updates after a review
      
      Besides, add `lang_code_to_id` to MBartTokenizeFast
      
      * fix-copies; make style quality
      
      * Apply suggestions from code review
      
      * Apply suggestions from code review
      
      * Apply suggestions from code review
      
      * fix output type, style
      
      * add copied from
      
      * resolve conflicts
      Co-authored-by: default avatarSuraj Patil <surajp815@gmail.com>
      61400e1e