1. 11 Mar, 2024 1 commit
  2. 09 Mar, 2024 1 commit
  3. 08 Mar, 2024 1 commit
  4. 01 Mar, 2024 1 commit
  5. 20 Feb, 2024 2 commits
  6. 14 Feb, 2024 1 commit
  7. 09 Feb, 2024 1 commit
    • Daniel Hiltgen's avatar
      Shutdown faster · 66807615
      Daniel Hiltgen authored
      Make sure that when a shutdown signal comes, we shutdown quickly instead
      of waiting for a potentially long exchange to wrap up.
      66807615
  8. 31 Jan, 2024 1 commit
  9. 22 Jan, 2024 1 commit
  10. 21 Jan, 2024 1 commit
  11. 17 Jan, 2024 1 commit
  12. 14 Jan, 2024 1 commit
  13. 11 Jan, 2024 1 commit
    • Daniel Hiltgen's avatar
      Support multiple variants for a given llm lib type · 8da7bef0
      Daniel Hiltgen authored
      In some cases we may want multiple variants for a given GPU type or CPU.
      This adds logic to have an optional Variant which we can use to select
      an optimal library, but also allows us to try multiple variants in case
      some fail to load.
      
      This can be useful for scenarios such as ROCm v5 vs v6 incompatibility
      or potentially CPU features.
      8da7bef0
  14. 10 Jan, 2024 1 commit
  15. 07 Jan, 2024 1 commit
  16. 04 Jan, 2024 1 commit