1. 09 Jun, 2024 1 commit
  2. 04 Jun, 2024 2 commits
  3. 01 Jun, 2024 1 commit
  4. 30 May, 2024 1 commit
  5. 29 May, 2024 1 commit
  6. 28 May, 2024 2 commits
  7. 25 May, 2024 1 commit
  8. 24 May, 2024 1 commit
  9. 23 May, 2024 2 commits
  10. 20 May, 2024 1 commit
    • Sam's avatar
      feat: add support for flash_attn (#4120) · e15307fd
      Sam authored
      * feat: enable flash attention if supported
      
      * feat: enable flash attention if supported
      
      * feat: enable flash attention if supported
      
      * feat: add flash_attn support
      e15307fd
  11. 15 May, 2024 2 commits
  12. 14 May, 2024 1 commit
  13. 11 May, 2024 1 commit
  14. 10 May, 2024 2 commits
  15. 09 May, 2024 5 commits
  16. 08 May, 2024 1 commit
  17. 07 May, 2024 1 commit
  18. 06 May, 2024 3 commits
  19. 05 May, 2024 1 commit
    • Daniel Hiltgen's avatar
      Centralize server config handling · f56aa200
      Daniel Hiltgen authored
      This moves all the env var reading into one central module
      and logs the loaded config once at startup which should
      help in troubleshooting user server logs
      f56aa200
  20. 01 May, 2024 4 commits
  21. 29 Apr, 2024 1 commit
  22. 26 Apr, 2024 1 commit
  23. 25 Apr, 2024 1 commit
  24. 23 Apr, 2024 3 commits