1. 17 Sep, 2024 1 commit
  2. 12 Sep, 2024 2 commits
    • Nicolas Patry's avatar
      Add nix test. (#2513) · d95c670a
      Nicolas Patry authored
      * Add nix test.
      
      * Modifying yourself means you need to rerun.
      
      * Fixing the test + adding click (needed for pre-commit hooks).
      
      * Try thuis.
      
      * Our runner + pure test (not written)
      
      * Reemove server.
      
      * Root user.
      
      * Different user ?
      
      * Add the actual test target.
      
      * Forgot this modification.
      
      * Add a formatter.
      
      * Add the secrets.
      
      * Fixed the auth token ?
      
      * Adding the other tests.
      
      * Missing pre-commit.
      
      * Test requires cargo for cargo fmt.
      
      * Update it a bit.
      
      * Up.
      
      * Attempting to use a cache location for the models.
      
      * Ignore the cache for now.
      d95c670a
    • Daniël de Kok's avatar
      nix: support Python tokenizer conversion in the router (#2515) · 94304649
      Daniël de Kok authored
      Ideally we wouldn't have the router wrapper that this change adds,
      but when I give PyO3 a Python interpreter with packages, it ends
      up linking libpython from the Python interpreter rather than the
      constructed environment and cannot pick up the Python modules as
      a result.
      94304649
  3. 06 Sep, 2024 1 commit
  4. 02 Sep, 2024 1 commit
  5. 29 Aug, 2024 1 commit
    • Daniël de Kok's avatar
      nix: build Torch against MKL and various other improvements (#2469) · 4e821c00
      Daniël de Kok authored
      Updates tgi-nix input:
      
      - Move Torch closer to upstream by building against MKL.
      - Remove compute capability 8.7 from Torch (Jetson).
      - Sync nixpkgs cumpute capabilities with Torch (avoids
        compiling too mana capabilities for MAGMA).
      - Use nixpkgs configuration passed through by `tgi-nix`.
      4e821c00
  6. 23 Aug, 2024 1 commit
    • Daniël de Kok's avatar
      nix: add default package (#2453) · f3c5d7d9
      Daniël de Kok authored
      The default package wraps the launcher and puts the server/router in the
      path.
      
      As a result, TGI can be started using something like:
      
      ```
      nix run .# -- \
        --model-id hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4 \
        --port 8080
      ```
      f3c5d7d9
  7. 21 Aug, 2024 1 commit
  8. 20 Aug, 2024 2 commits
    • Daniël de Kok's avatar
      nix: add pure server to flake, add both pure and impure devshells (#2430) · f5f11b79
      Daniël de Kok authored
      * nix: pure server and support both pure and impure devShells
      
      * nix: remove unused poetry2nix input
      
      It is not wired up and we now have a pure server.
      
      * nix: add ipdb to impure devshell
      f5f11b79
    • Nicolas Patry's avatar
      Prefix caching (#2402) · b70ae096
      Nicolas Patry authored
      
      
      * Prefix caching WIP
      
      * Fixing prefix attention.
      
      * Fixing flashinfer import.
      
      * Fixing black.
      
      * Fixing medusa (still wrong outputs, but functional).
      
      * Just medusa values now.
      
      * Fixing medusa without prefix caching.
      
      * Fixing prefix caching.
      
      * Medusa requires reshaping.
      
      * Removing the logs.
      
      * Remove router.nix
      
      * Fixup:
      
      - Remove logs
      - Disable VLMs (they do not work)
      - Disable prefix caching when user wants prefill logprobs.
      
      * Update flake.lock
      
      ---------
      Co-authored-by: default avatarDaniël de Kok <me@danieldk.eu>
      b70ae096
  9. 19 Aug, 2024 1 commit
  10. 16 Aug, 2024 1 commit
  11. 15 Aug, 2024 1 commit
  12. 14 Aug, 2024 2 commits
  13. 13 Aug, 2024 2 commits
  14. 12 Aug, 2024 3 commits
  15. 09 Aug, 2024 3 commits