1. 17 Mar, 2023 1 commit
  2. 21 Feb, 2023 1 commit
  3. 15 Feb, 2023 1 commit
    • Steven Liu's avatar
      Refactor model summary (#21408) · 7a5533b2
      Steven Liu authored
      * first draft of model summary
      
      * restructure docs
      
      * finish first draft
      
      * minor reviews and edits
      
      * apply feedbacks
      
      * save important info, create new page for attention
      
      * add attention doc to toctree
      
      *  few more minor fixes
      7a5533b2
  4. 03 May, 2022 1 commit
    • Sanchit Gandhi's avatar
      [FlaxBert] Add ForCausalLM (#16995) · cd9274d0
      Sanchit Gandhi authored
      * [FlaxBert] Add ForCausalLM
      
      * make style
      
      * fix output attentions
      
      * Add RobertaForCausalLM
      
      * remove comment
      
      * fix fx-to-pt model loading
      
      * remove comment
      
      * add modeling tests
      
      * add enc-dec model tests
      
      * add big_bird
      
      * add electra
      
      * make style
      
      * make repo-consitency
      
      * add to docs
      
      * remove roberta test
      
      * quality
      
      * amend cookiecutter
      
      * fix attention_mask bug in flax bert model tester
      
      * tighten pt-fx thresholds to 1e-5
      
      * add 'copied from' statements
      
      * amend 'copied from' statements
      
      * amend 'copied from' statements
      
      * quality
      cd9274d0
  5. 04 Apr, 2022 1 commit
  6. 27 Dec, 2021 1 commit
  7. 22 Dec, 2021 1 commit