1. 19 Jun, 2024 1 commit
    • Anton Vlasjuk's avatar
      [`GPT2`] Add SDPA support (#31172) · b275a410
      Anton Vlasjuk authored
      * `gpt2` sdpa support
      
      * fix (at least) one test, style, repo consistency
      
      * fix sdpa mask in forward --> fixes generation
      
      * test
      
      * test2
      
      * test3
      
      * test4
      
      * simplify shapes for attn mask creation and small comments
      
      * hub fail test
      
      * benchmarks
      
      * flash attn 2 mask should not be inverted on enc-dec setup
      
      * fix comment
      
      * apply some suggestion from code review
      
      - only save _attn_implentation once
      - remove unnecessary comment
      
      * change elif logic
      
      * [run-slow] gpt2
      
      * modify `test_gpt2_sample_max_time` to follow previous assertion patterns
      b275a410
  2. 28 Mar, 2024 1 commit
  3. 03 Nov, 2023 1 commit
    • Maria Khalusova's avatar
      [Docs] Model_doc structure/clarity improvements (#26876) · 5964f820
      Maria Khalusova authored
      * first batch of structure improvements for model_docs
      
      * second batch of structure improvements for model_docs
      
      * more structure improvements for model_docs
      
      * more structure improvements for model_docs
      
      * structure improvements for cv model_docs
      
      * more structural refactoring
      
      * addressed feedback about image processors
      5964f820
  4. 20 Jun, 2023 1 commit
  5. 03 May, 2023 1 commit
  6. 02 May, 2023 1 commit
  7. 17 Mar, 2023 1 commit
  8. 21 Feb, 2023 1 commit
  9. 15 Feb, 2023 1 commit
    • Steven Liu's avatar
      Refactor model summary (#21408) · 7a5533b2
      Steven Liu authored
      * first draft of model summary
      
      * restructure docs
      
      * finish first draft
      
      * minor reviews and edits
      
      * apply feedbacks
      
      * save important info, create new page for attention
      
      * add attention doc to toctree
      
      *  few more minor fixes
      7a5533b2
  10. 07 Feb, 2023 1 commit
  11. 29 Nov, 2022 1 commit
    • Pi Esposito's avatar
      add in layer gpt2 tokenizer (#20421) · fb2b45e5
      Pi Esposito authored
      * add minimal working gpt2 tokenizer
      
      * graph mode and output equivalence tests working
      
      * not today tensorflow. serialization test passing!
      
      * fix style, documentation, docstrings and all that jazz
      
      * passing consistency checks
      
      * move keras nlp to tf dependencies
      
      * fix tf modeling utils and gpt2 attention to enable compiling
      
      * fix (I hope) keras nlp dependencies
      
      * rever changes on generation
      
      * remove debug prints
      
      * remove redundant tf dummy objects
      
      * add from config, get config and max length settings to address review
      
      * let flake ignore the error on distillation you are welcome
      
      * test from config
      
      * add padding test
      
      * address sgugger review
      fb2b45e5
  12. 07 Nov, 2022 1 commit
  13. 27 Oct, 2022 1 commit
  14. 04 Apr, 2022 1 commit
  15. 22 Dec, 2021 1 commit