1. 26 Jun, 2024 3 commits
    • Anton Vlasjuk's avatar
      [`GPT-NeoX`] Add SDPA support (#31031) · b07770c5
      Anton Vlasjuk authored
      * starting support for sdpa in `gptneox` models
      
      * small comment on tests
      
      * fix dropout
      
      * documentation and style
      
      * clarify concrete paths for reference
      
      * generalise attn projections and rope application
      
      added head mask check to sdpa mask creation
      
      handle sdpa memory backend bug via own version flag
      
      * update docs and style
      
      * move dtype casting outside of general attn_projection_and_rope function
      
      fix flash_attn_2 stuff
      
      * more generic attn warning if output_attns or head_mask
      
      * simplify head mask check by moving head mask creation to a later point
      
      * remove copied llama artifact
      
      * remove padding_mask from attention function signature
      
      * removing unnecessary comments, only "save" attn implementation once
      
      * [run_slow] gpt_neox
      b07770c5
    • Vladimir Iashin's avatar
      Removed unnecessary `self.projection` call in `VivitTubeletEmbeddings` (#31632) · 1218e439
      Vladimir Iashin authored
      removes unnecessary second projection call
      1218e439
    • Saurav Maheshkar's avatar
      docs: move translations to `i18n` (#31584) · 2daf2c3e
      Saurav Maheshkar authored
      docs: move translations to i18n
      2daf2c3e
  2. 25 Jun, 2024 6 commits
  3. 24 Jun, 2024 10 commits
  4. 23 Jun, 2024 1 commit
  5. 21 Jun, 2024 4 commits
  6. 20 Jun, 2024 7 commits
  7. 19 Jun, 2024 9 commits