1. 22 Jul, 2025 1 commit
  2. 20 Jul, 2025 2 commits
  3. 19 Jul, 2025 1 commit
  4. 17 Jul, 2025 5 commits
  5. 16 Jul, 2025 3 commits
  6. 11 Jul, 2025 4 commits
  7. 09 Jul, 2025 1 commit
    • Jesse Gross's avatar
      ggml: Report ordinal IDs for AMD GPUs on Windows · 35fda7b4
      Jesse Gross authored
      We don't get valid UUIDs for AMD GPUs on Windows, so the best option
      is to use the ordinal IDs. This brings us in line with what we currently
      do on the Ollama server - the only exception is AMD GPUs on Linux, which
      falls back to using ordinal IDs. The GGML implementation has no fallback
      but it doesn't appear to occur for any of the GPUs that we support.
      
      It's also possible that there are collisions between ordinal IDs for
      different libraries - however the only places where we use them are
      AMD on Windows and Metal on Mac, which can never occur on the same
      system.
      35fda7b4
  8. 08 Jul, 2025 3 commits
    • Daniel Hiltgen's avatar
      doc: add MacOS docs (#11334) · 66fb8575
      Daniel Hiltgen authored
      also removes stale model dir instructions for windows
      66fb8575
    • Daniel Hiltgen's avatar
      Reduce default parallelism to 1 (#11330) · 20c3266e
      Daniel Hiltgen authored
      The current scheduler algorithm of picking the paralellism based on available
      VRAM complicates the upcoming dynamic layer memory allocation algorithm.  This
      changes the default to 1, with the intent going forward that parallelism is
      explicit and will no longer be dynamically determined.  Removal of the dynamic
      logic will come in a follow up.
      20c3266e
    • Daniel Hiltgen's avatar
      API/CLI context enhancements (#11331) · 34088dbc
      Daniel Hiltgen authored
      * API: expose context size of loaded models
      
      * CLI: add context UX
      
      This adds a column in the ps output to show the models context size.
      34088dbc
  9. 07 Jul, 2025 4 commits
  10. 06 Jul, 2025 1 commit
  11. 05 Jul, 2025 3 commits
  12. 03 Jul, 2025 1 commit
  13. 02 Jul, 2025 1 commit
  14. 01 Jul, 2025 1 commit
  15. 30 Jun, 2025 1 commit
  16. 29 Jun, 2025 1 commit
  17. 27 Jun, 2025 3 commits
  18. 26 Jun, 2025 4 commits