1. 26 Mar, 2024 6 commits
  2. 25 Mar, 2024 6 commits
  3. 24 Mar, 2024 1 commit
    • gamepad_coder's avatar
      model_summary.md - Restore link to Harvard's Annotated Transformer. (#29702) · 76a33a10
      gamepad_coder authored
      * model_summary.md - Add link to Harvard's Annotated Transformer.
      
      * model_summary.md - slight wording change + capitalize name of the paper
      
      * model_summary.md - moves the Annotated Transformer link in a praenthesis next to the link to the original paper (great idea, stevhliu!)
      
      * model_summary.md - moves the Annotated Transformer link in a praenthesis next to the link to the original paper (commit pt. 2, accidentally removed "has" in pt. 1)
      76a33a10
  4. 23 Mar, 2024 1 commit
  5. 22 Mar, 2024 10 commits
  6. 21 Mar, 2024 15 commits
  7. 20 Mar, 2024 1 commit
    • Arthur's avatar
      [`BC 4.37 -> 4.38`] for Llama family, memory and speed (#29753) · ff841900
      Arthur authored
      * attempt to fix
      
      * the actual fix that works with compilation!
      
      * this?
      
      * temporary update
      
      * nit?
      
      * dispatcg to memory efficient?
      
      * update both models that have static cache support
      
      * fix copies fix compile
      
      * make sure fix
      
      * fix cohere and gemma
      
      * fix beams?
      
      * nit
      
      * slipped through the cracks
      
      * nit
      
      * nits
      
      * update
      
      * fix-copies
      
      * skip failing tests
      
      * nits
      ff841900