1. 08 May, 2025 1 commit
  2. 09 Dec, 2024 1 commit
    • Jesse Gross's avatar
      prompt: Don't trim whitespace from prompts · 900f64e6
      Jesse Gross authored
      New lines can be an important part of a user's prompt and trimming
      it can alter the results. We previously only trimmed prompts with
      images but refactoring brought this behavior to all prompts, where
      it became more noticable.
      
      The /generate endpoint adds less whitespace and therefore doesn't
      need to trim it out - this brings the same behavior to /chat.
      
      Thanks to @gabe-l-hart for spotting the issue!
      
      Fixes #7795
      900f64e6
  3. 17 Nov, 2024 1 commit
  4. 30 Oct, 2024 1 commit
    • Jesse Gross's avatar
      runner.go: Better abstract vision model integration · c826e574
      Jesse Gross authored
      
      
      -Update mllama to take the cross attention state as embeddings in
      a batch, more similar to how Llava handles it. This improves
      integration with the input cache.
      -Pass locations in a prompt for embeddings using tags similar to Llava.
      -Abstract interface to vision models so the main runner accesses Clip
      and Mllama similarly
      Co-authored-by: default avatarMichael Yang <mxyng@pm.me>
      c826e574
  5. 18 Oct, 2024 1 commit
  6. 02 Aug, 2024 1 commit
  7. 16 Jul, 2024 1 commit
  8. 15 Jul, 2024 1 commit
  9. 13 Jul, 2024 1 commit
  10. 11 Jul, 2024 1 commit
  11. 05 Jul, 2024 2 commits
  12. 01 Jul, 2024 1 commit
  13. 26 Mar, 2024 1 commit
  14. 29 Feb, 2024 1 commit
  15. 16 Feb, 2024 1 commit
  16. 12 Feb, 2024 1 commit