1. 09 Dec, 2024 5 commits
  2. 06 Dec, 2024 6 commits
  3. 04 Dec, 2024 1 commit
  4. 03 Dec, 2024 2 commits
    • Nicolas Patry's avatar
      Saving some VRAM. (#2790) · b57f3703
      Nicolas Patry authored
      * Saving some VRAM.
      
      - 8B on 4xL4 attention=flashdecoding . Before 4.28GB left, After 4.32GB
        left, so 400MB saved.
      
      - Effect not as visible on attention=flashinfer and n_shard=1. I suspect
        it's linked to the torch allocator.
      
      * Adding assertion.
      b57f3703
    • Daniël de Kok's avatar
      Sync (most) server dependencies with Nix (#2782) · 2003d8be
      Daniël de Kok authored
      
      
      * Sync (most) server dependencies with Nix
      
      Skipped most grpcio packages, because of protobuf version
      incompatibility with the opentelemetry packages.
      
      * Add a primitive script to generate Poetry commands to sync with Nix
      
      This is not fully automated, since getting the Nix versions may be
      unresolvable. However, it does take most of the work out of doing
      this manually.
      
      * Upgrade eetq ?
      
      * Fmt.
      
      ---------
      Co-authored-by: default avatarNicolas Patry <patry.nicolas@protonmail.com>
      2003d8be
  5. 02 Dec, 2024 4 commits
  6. 28 Nov, 2024 1 commit
    • drbh's avatar
      Support continue final message (#2733) · d4718051
      drbh authored
      * feat: support continue_final_message param in chat request
      
      * feat: add test for continue final message
      
      * fix: bump openapi docs
      
      * fix: remove continue_final_message chat request param
      
      * fix: remove unneeded launcher args in continue test
      
      * fix: bump test output
      
      * fix: remove accidentally included guideline from rebase
      
      * fix: remove guideline tests
      
      * fix: adjust continuation tests expected text
      
      * fix: replace expected output for continue test
      d4718051
  7. 26 Nov, 2024 3 commits
  8. 25 Nov, 2024 2 commits
  9. 22 Nov, 2024 2 commits
  10. 21 Nov, 2024 7 commits
  11. 20 Nov, 2024 5 commits
  12. 19 Nov, 2024 2 commits
    • drbh's avatar
    • drbh's avatar
      PR 2634 CI - Fix the tool_choice format for named choice by adapting OpenAIs scheme (#2645) · 5489406c
      drbh authored
      
      
      * add OpenAI like tool_choice for named choice
      
      * add tests
      
      * fix: run linter and bump api docs
      
      * fix: consolidate changes and remove old tool type
      
      * feat: improve, simplify and rename tool choice struct add required support and refactor
      
      * fix: simplify tool choice logic, improve tests, openapi and rust docs
      
      * fix: refactor away prepare_chat_input and improve tool grammar apply control flow
      
      * feat: update docs and add tool choice configuration section
      
      * fix: simplify naming, tool choice default and improve test
      
      * fix: adjust tool choice none logic, add test and small refactors
      
      * fix: add missing snapshot file
      
      * fix: adjust tool choice type in test
      
      * fix: adjust default when json tool choice is
      
      * fix: remove trailing space lint after rebase
      
      * fix: remove mostly mocked unit test
      
      ---------
      Co-authored-by: default avatarLinus Bierhoff <linus.bierhoff@icloud.com>
      5489406c