"src/libtorio/ffmpeg/ffmpeg.h" did not exist on "f663cb2874f918e7663c0daa1d3a3ae8636349ad"
  1. 26 Mar, 2025 1 commit
    • Jesse Gross's avatar
      kvcache: Sliding window cache only needs a single batch total · 1feff619
      Jesse Gross authored
      When computing the size of the cache for sliding window attention,
      we don't need to multiple the batch size by the number of parallel
      sequences - the batch size is constant.
      
      This also simplifies the check for whether to allocate the cache
      size based on capacity or window size as the batch size is already
      incorporated into the capacity when handled by the runner.
      1feff619
  2. 25 Mar, 2025 1 commit
  3. 24 Mar, 2025 1 commit
  4. 21 Mar, 2025 12 commits
  5. 20 Mar, 2025 6 commits
  6. 19 Mar, 2025 2 commits
  7. 18 Mar, 2025 2 commits
  8. 17 Mar, 2025 9 commits
  9. 15 Mar, 2025 3 commits
    • Patrick Devine's avatar
      fix: correctly save in interactive mode (#9788) · 2c8b4846
      Patrick Devine authored
      This fixes the case where a FROM line in previous modelfile points to a
      file which may/may not be present in a different ollama instance. We
      shouldn't be relying on the filename though and instead just check if
      the FROM line was instead a valid model name and point to that instead.
      2c8b4846
    • Blake Mizerany's avatar
      server/internal/client/ollama: set User-Agent for registry client (#9775) · 82946761
      Blake Mizerany authored
      This sets the agent header in DefaultRegistry to include the version of
      the client, OS, and architecture in the previous format, with a minor
      twist.
      
      Note: The version is obtained from the build info, instead of the
      version in version.Version, which should not longer be necessary, but we
      can remove in a future commit. Using the build info is more accurate and
      also provides extra build information if the build is not tagged, and if
      it is "dirty". Previously, the version was just "0.0.0" with no other
      helpful information. The ollama.com registry and others handle this
      swimmingly.
      82946761
    • Patrick Devine's avatar
      gemma3 quantization (#9776) · ef378ad6
      Patrick Devine authored
      ef378ad6
  10. 14 Mar, 2025 3 commits
    • Daniel Hiltgen's avatar
      Align versions for local builds (#9635) · 2d2247e5
      Daniel Hiltgen authored
      Darwin was using a different pattern for the version string
      than linux or windows.
      2d2247e5
    • Jesse Gross's avatar
      gemma3: Allow multiple image in a single input · 7bf793a6
      Jesse Gross authored
      Previously processing multiple images in a batch would trigger
      segfaults so sending images together was disabled as a way to
      mitigate this. The trigger was processing one image on the CPU
      and one on the GPU.
      
      This can no longer happen:
       - The vision encoder is now on the GPU so both images would be
         processed on the GPU.
       - We require images to be fully contained in a batch and each
         image including its special tokens is over half the batch size.
         As a result, we will never get two images in the same batch.
      
      Fixes #9731
      7bf793a6
    • Jesse Gross's avatar
      ollamarunner: Use a separate context per multimodal input · 282bfaaa
      Jesse Gross authored
      Currently there is a single context per sequence, shared all by
      all multimodal inputs. Since we build a vision encoder graph per
      image, with a large number of inputs we can eventually hit the
      maximum number of graph nodes per context.
      
      This changes to use a separate context for each image, ensuring
      that available resource limits are consistent.
      282bfaaa