1. 06 Aug, 2024 1 commit
  2. 01 Aug, 2024 1 commit
    • Nikos Karampatziakis's avatar
      Offloaded KV Cache (#31325) · ca59d6f7
      Nikos Karampatziakis authored
      * Initial implementation of OffloadedCache
      
      * enable usage via cache_implementation
      
      * Address feedback, add tests, remove legacy methods.
      
      * Remove flash-attn, discover synchronization bugs, fix bugs
      
      * Prevent usage in CPU only mode
      
      * Add a section about offloaded KV cache to the docs
      
      * Fix typos in docs
      
      * Clarifications and better explanation of streams
      ca59d6f7
  3. 09 Jul, 2024 1 commit
  4. 23 May, 2024 1 commit
  5. 14 May, 2024 1 commit
  6. 02 May, 2024 1 commit
  7. 15 Apr, 2024 1 commit
  8. 28 Mar, 2024 1 commit
  9. 05 Mar, 2024 1 commit
  10. 16 Feb, 2024 1 commit
  11. 20 Dec, 2023 1 commit
  12. 29 Sep, 2023 1 commit
  13. 12 Sep, 2023 1 commit
  14. 04 Sep, 2023 1 commit
    • omahs's avatar
      Fix typos (#25936) · 0f0e1a2c
      omahs authored
      * fix typo
      
      * fix typo
      
      * fix typo
      
      * fix typos
      
      * fix typos
      
      * fix typo
      
      * fix typo
      
      * fix typo
      
      * fix typos
      
      * fix typo
      
      * fix typo
      
      * fix typo
      
      * fix typos
      
      * fix typos
      0f0e1a2c
  15. 30 Aug, 2023 1 commit
  16. 29 Aug, 2023 1 commit
  17. 27 Jun, 2023 1 commit
  18. 20 Jun, 2023 1 commit
  19. 16 May, 2023 1 commit
  20. 24 Apr, 2023 1 commit
  21. 18 Apr, 2023 1 commit
    • Joao Gante's avatar
      Generate: Add assisted generation (#22211) · 78cda46f
      Joao Gante authored
      * working mvp
      
      * remove breakpoint
      
      * fix commit
      
      * standardize outputs
      
      * tmp commit
      
      * tests almost ready
      
      * tmp commit
      
      * skip a few models
      
      * Add streaming; Docs and examples
      
      * document limitations
      
      * PR commits
      
      * Amy PR comments
      78cda46f
  22. 07 Apr, 2023 1 commit
  23. 30 Mar, 2023 2 commits
  24. 24 Mar, 2023 1 commit
  25. 17 Jan, 2023 1 commit