1. 24 Jan, 2025 1 commit
  2. 20 Jan, 2025 1 commit
  3. 27 Dec, 2024 1 commit
  4. 24 Dec, 2024 1 commit
  5. 23 Dec, 2024 1 commit
  6. 09 Dec, 2024 6 commits
  7. 06 Dec, 2024 6 commits
  8. 04 Dec, 2024 1 commit
  9. 03 Dec, 2024 2 commits
    • Nicolas Patry's avatar
      Saving some VRAM. (#2790) · b57f3703
      Nicolas Patry authored
      * Saving some VRAM.
      
      - 8B on 4xL4 attention=flashdecoding . Before 4.28GB left, After 4.32GB
        left, so 400MB saved.
      
      - Effect not as visible on attention=flashinfer and n_shard=1. I suspect
        it's linked to the torch allocator.
      
      * Adding assertion.
      b57f3703
    • Daniël de Kok's avatar
      Sync (most) server dependencies with Nix (#2782) · 2003d8be
      Daniël de Kok authored
      
      
      * Sync (most) server dependencies with Nix
      
      Skipped most grpcio packages, because of protobuf version
      incompatibility with the opentelemetry packages.
      
      * Add a primitive script to generate Poetry commands to sync with Nix
      
      This is not fully automated, since getting the Nix versions may be
      unresolvable. However, it does take most of the work out of doing
      this manually.
      
      * Upgrade eetq ?
      
      * Fmt.
      
      ---------
      Co-authored-by: default avatarNicolas Patry <patry.nicolas@protonmail.com>
      2003d8be
  10. 02 Dec, 2024 4 commits
  11. 28 Nov, 2024 1 commit
    • drbh's avatar
      Support continue final message (#2733) · d4718051
      drbh authored
      * feat: support continue_final_message param in chat request
      
      * feat: add test for continue final message
      
      * fix: bump openapi docs
      
      * fix: remove continue_final_message chat request param
      
      * fix: remove unneeded launcher args in continue test
      
      * fix: bump test output
      
      * fix: remove accidentally included guideline from rebase
      
      * fix: remove guideline tests
      
      * fix: adjust continuation tests expected text
      
      * fix: replace expected output for continue test
      d4718051
  12. 26 Nov, 2024 3 commits
  13. 25 Nov, 2024 2 commits
  14. 22 Nov, 2024 2 commits
  15. 21 Nov, 2024 7 commits
  16. 20 Nov, 2024 1 commit