1. 17 Dec, 2024 5 commits
    • Blake Mizerany's avatar
      llm: do not error on "null" format (#8139) · 2ddc32d5
      Blake Mizerany authored
      This fixes another regression in the previous commit that fixed other
      known bugs.
      2ddc32d5
    • Jascha Beste's avatar
    • Blake Mizerany's avatar
      llm: do not silently fail for supplied, but invalid formats (#8130) · 87f0a49f
      Blake Mizerany authored
      Changes in #8002 introduced fixes for bugs with mangling JSON Schemas.
      It also fixed a bug where the server would silently fail when clients
      requested invalid formats. It also, unfortunately, introduced a bug
      where the server would reject requests with an empty format, which
      should be allowed.
      
      The change in #8127 updated the code to allow the empty format, but also
      reintroduced the regression where the server would silently fail when
      the format was set, but invalid.
      
      This commit fixes both regressions. The server does not reject the empty
      format, but it does reject invalid formats. It also adds tests to help
      us catch regressions in the future.
      
      Also, the updated code provides a more detailed error message when a
      client sends a non-empty, but invalid format, echoing the invalid format
      in the response.
      
      This commits also takes the opportunity to remove superfluous linter
      checks.
      87f0a49f
    • Jeffrey Morgan's avatar
    • Daniel Hiltgen's avatar
      darwin: restore multiple runners for x86 (#8125) · 8f805dd7
      Daniel Hiltgen authored
      In 0.5.2 we simplified packaging to have avx only for macos x86.  It looks like
      there may still be some non-AVX systems out there, so this puts back the prior
      logic of building no-AVX for the primary binary, and now 2 runners for avx and avx2.
      These will be packaged in the App bundle only, so the stand-alone binary will now be
      without AVX support on macos.  On arm, we'll also see these runners reported
      as available in the log, but they're dormant and will never be used at runtime.
      8f805dd7
  2. 16 Dec, 2024 2 commits
  3. 15 Dec, 2024 1 commit
  4. 14 Dec, 2024 2 commits
  5. 13 Dec, 2024 2 commits
  6. 12 Dec, 2024 2 commits
  7. 11 Dec, 2024 10 commits
  8. 10 Dec, 2024 8 commits
  9. 09 Dec, 2024 1 commit
    • Jesse Gross's avatar
      prompt: Don't trim whitespace from prompts · 900f64e6
      Jesse Gross authored
      New lines can be an important part of a user's prompt and trimming
      it can alter the results. We previously only trimmed prompts with
      images but refactoring brought this behavior to all prompts, where
      it became more noticable.
      
      The /generate endpoint adds less whitespace and therefore doesn't
      need to trim it out - this brings the same behavior to /chat.
      
      Thanks to @gabe-l-hart for spotting the issue!
      
      Fixes #7795
      900f64e6
  10. 08 Dec, 2024 2 commits
  11. 06 Dec, 2024 3 commits
  12. 05 Dec, 2024 2 commits