1. 26 Apr, 2024 1 commit
    • JB (Don)'s avatar
      [`BERT`] Add support for sdpa (#28802) · dfa7b580
      JB (Don) authored
      * Adding SDPA support for BERT
      
      * Using the proper input name for testing model input in inference()
      
      * Adding documentation for SDPA in BERT model page
      
      * Use the stable link for the documentation
      
      * Adding a gate to only call .contiguous() for torch < 2.2.0
      
      * Additions and fixes to the documentation
      
      * Minor updates to documentation
      
      * Adding extra requirements needed for the contiguous() bug
      
      * Adding "Adapted from" in plcae of the "Copied from"
      
      * Add benchmark speedup tables to the documentation
      
      * Minor fixes to the documentation
      
      * Use ClapText as a replacemenet for Bert in the Copied-From
      
      * Some more fixes for the fix-copies references
      
      * Overriding the test_eager_matches_sdpa_generate in bert tests to not load with low_cpu_mem_usage
      
      [test all]
      
      * Undo changes to separate test
      
      * Refactored SDPA self attention code for KV projections
      
      * Change use_sdpa to attn_implementation
      
      * Fix test_sdpa_can_dispatch_on_flash by preparing input (required for MultipleChoice models)
      dfa7b580
  2. 11 Mar, 2024 1 commit
  3. 03 Nov, 2023 1 commit
    • Maria Khalusova's avatar
      [Docs] Model_doc structure/clarity improvements (#26876) · 5964f820
      Maria Khalusova authored
      * first batch of structure improvements for model_docs
      
      * second batch of structure improvements for model_docs
      
      * more structure improvements for model_docs
      
      * more structure improvements for model_docs
      
      * structure improvements for cv model_docs
      
      * more structural refactoring
      
      * addressed feedback about image processors
      5964f820
  4. 20 Jun, 2023 1 commit
  5. 17 Mar, 2023 1 commit
  6. 21 Feb, 2023 1 commit
  7. 15 Feb, 2023 1 commit
    • Steven Liu's avatar
      Refactor model summary (#21408) · 7a5533b2
      Steven Liu authored
      * first draft of model summary
      
      * restructure docs
      
      * finish first draft
      
      * minor reviews and edits
      
      * apply feedbacks
      
      * save important info, create new page for attention
      
      * add attention doc to toctree
      
      *  few more minor fixes
      7a5533b2
  8. 01 Nov, 2022 1 commit
    • Steven Liu's avatar
      Add BERT resources (#19852) · dec8578e
      Steven Liu authored
      * add resources for bert
      
      * add course chapters
      
      * apply reviews
      
      * add pipeline icons and community resource
      
      * fix buttons
      dec8578e
  9. 27 Jun, 2022 1 commit
    • Matt's avatar
      Add a TF in-graph tokenizer for BERT (#17701) · ee0d001d
      Matt authored
      * Add a TF in-graph tokenizer for BERT
      
      * Add from_pretrained
      
      * Add proper truncation, option handling to match other tokenizers
      
      * Add proper imports and guards
      
      * Add test, fix all the bugs exposed by said test
      
      * Fix truncation of paired texts in graph mode, more test updates
      
      * Small fixes, add a (very careful) test for savedmodel
      
      * Add tensorflow-text dependency, make fixup
      
      * Update documentation
      
      * Update documentation
      
      * make fixup
      
      * Slight changes to tests
      
      * Add some docstring examples
      
      * Update tests
      
      * Update tests and add proper lowercasing/normalization
      
      * make fixup
      
      * Add docstring for padding!
      
      * Mark slow tests
      
      * make fixup
      
      * Fall back to BertTokenizerFast if BertTokenizer is unavailable
      
      * Fall back to BertTokenizerFast if BertTokenizer is unavailable
      
      * make fixup
      
      * Properly handle tensorflow-text dummies
      ee0d001d
  10. 03 May, 2022 1 commit
    • Sanchit Gandhi's avatar
      [FlaxBert] Add ForCausalLM (#16995) · cd9274d0
      Sanchit Gandhi authored
      * [FlaxBert] Add ForCausalLM
      
      * make style
      
      * fix output attentions
      
      * Add RobertaForCausalLM
      
      * remove comment
      
      * fix fx-to-pt model loading
      
      * remove comment
      
      * add modeling tests
      
      * add enc-dec model tests
      
      * add big_bird
      
      * add electra
      
      * make style
      
      * make repo-consitency
      
      * add to docs
      
      * remove roberta test
      
      * quality
      
      * amend cookiecutter
      
      * fix attention_mask bug in flax bert model tester
      
      * tighten pt-fx thresholds to 1e-5
      
      * add 'copied from' statements
      
      * amend 'copied from' statements
      
      * amend 'copied from' statements
      
      * quality
      cd9274d0
  11. 04 Apr, 2022 1 commit
  12. 17 Dec, 2021 1 commit