1. 08 Jul, 2025 3 commits
    • Daniel Hiltgen's avatar
      doc: add MacOS docs (#11334) · 66fb8575
      Daniel Hiltgen authored
      also removes stale model dir instructions for windows
      66fb8575
    • Daniel Hiltgen's avatar
      Reduce default parallelism to 1 (#11330) · 20c3266e
      Daniel Hiltgen authored
      The current scheduler algorithm of picking the paralellism based on available
      VRAM complicates the upcoming dynamic layer memory allocation algorithm.  This
      changes the default to 1, with the intent going forward that parallelism is
      explicit and will no longer be dynamically determined.  Removal of the dynamic
      logic will come in a follow up.
      20c3266e
    • Daniel Hiltgen's avatar
      API/CLI context enhancements (#11331) · 34088dbc
      Daniel Hiltgen authored
      * API: expose context size of loaded models
      
      * CLI: add context UX
      
      This adds a column in the ps output to show the models context size.
      34088dbc
  2. 07 Jul, 2025 4 commits
  3. 06 Jul, 2025 1 commit
  4. 05 Jul, 2025 3 commits
  5. 03 Jul, 2025 1 commit
  6. 02 Jul, 2025 1 commit
  7. 01 Jul, 2025 1 commit
  8. 30 Jun, 2025 1 commit
  9. 29 Jun, 2025 1 commit
  10. 27 Jun, 2025 3 commits
  11. 26 Jun, 2025 4 commits
  12. 25 Jun, 2025 5 commits
  13. 24 Jun, 2025 3 commits
  14. 23 Jun, 2025 4 commits
  15. 20 Jun, 2025 4 commits
  16. 19 Jun, 2025 1 commit