1. 13 Feb, 2025 1 commit
  2. 13 Dec, 2024 1 commit
  3. 11 Dec, 2024 1 commit
    • Blake Mizerany's avatar
      llama: preserve field order in user-defined JSON schemas (#8002) · 9039c821
      Blake Mizerany authored
      Previously we decoded and re-encoded JSON schemas during validation,
      which served no purpose since json.RawMessage already validates JSON
      syntax. Worse, the re-encoding lost field ordering from the original
      schema, which affects inference quality during step-by-step reasoning.
      
      While fixing this ordering issue by using json.RawMessage directly,
      testing revealed that schema_to_grammar (from llama.cpp) also fails to
      preserve field order during grammar generation. This appears to be the
      root cause of inference degradation.
      
      This change prevents us from mangling the user's original schema order,
      but we still need to address the ordering issue in schema_to_grammar.
      That will be a separate change.
      
      Updates #7978
      9039c821
  4. 05 Dec, 2024 1 commit
  5. 30 Nov, 2024 1 commit
  6. 27 Nov, 2024 2 commits
  7. 07 Sep, 2024 2 commits
  8. 06 Sep, 2024 1 commit
  9. 02 Aug, 2024 1 commit
  10. 01 Aug, 2024 1 commit
  11. 29 Jul, 2024 1 commit
  12. 19 Jul, 2024 2 commits
  13. 17 Jul, 2024 2 commits
  14. 16 Jul, 2024 1 commit
    • royjhan's avatar
      OpenAI: /v1/embeddings compatibility (#5285) · 987dbab0
      royjhan authored
      
      
      * OpenAI v1 models
      
      * Empty List Testing
      
      * Add back envconfig
      
      * v1/models docs
      
      * Remove Docs
      
      * OpenAI batch embed compatibility
      
      * merge conflicts
      
      * integrate with api/embed
      
      * ep
      
      * merge conflicts
      
      * request tests
      
      * rm resp test
      
      * merge conflict
      
      * merge conflict
      
      * test fixes
      
      * test fn renaming
      
      * input validation for empty string
      
      ---------
      Co-authored-by: default avatarjmorganca <jmorganca@gmail.com>
      987dbab0
  15. 14 Jul, 2024 1 commit
  16. 09 Jul, 2024 1 commit
  17. 02 Jul, 2024 2 commits
  18. 14 Jun, 2024 1 commit
  19. 04 Jun, 2024 1 commit
  20. 11 May, 2024 1 commit
  21. 09 May, 2024 1 commit
  22. 26 Mar, 2024 1 commit
  23. 07 Feb, 2024 1 commit