1. 16 Sep, 2025 3 commits
  2. 15 Sep, 2025 3 commits
  3. 12 Sep, 2025 5 commits
  4. 11 Sep, 2025 6 commits
  5. 10 Sep, 2025 5 commits
  6. 09 Sep, 2025 4 commits
  7. 08 Sep, 2025 4 commits
  8. 05 Sep, 2025 1 commit
  9. 04 Sep, 2025 2 commits
  10. 02 Sep, 2025 3 commits
  11. 31 Aug, 2025 2 commits
  12. 29 Aug, 2025 2 commits
    • Daniel Hiltgen's avatar
      perf: build graph for next batch async to keep GPU busy (#11863) · 517807cd
      Daniel Hiltgen authored
      * perf: build graph for next batch in parallel to keep GPU busy
      
      This refactors the main run loop of the ollama runner to perform the main GPU
      intensive tasks (Compute+Floats) in a go routine so we can prepare the next
      batch in parallel to reduce the amount of time the GPU stalls waiting for the
      next batch of work.
      
      * tests: tune integration tests for ollama engine
      
      This tunes the integration tests to focus more on models supported
      by the new engine.
      517807cd
    • Daniel Hiltgen's avatar
      Always filter devices (#12108) · ead4a9a1
      Daniel Hiltgen authored
      * Always filter devices
      
      Avoid crashing on unsupported AMD iGPUs
      
      * Remove cuda device filtering
      
      This interferes with mixed setups
      ead4a9a1