1. 15 May, 2025 1 commit
    • Abrar Shivani's avatar
      feat: Use existing Tokio runtime in components (#941) · 2a5eb7e7
      Abrar Shivani authored
      The runtime library already provides a from_current method that creates and returns a Runtime object initialized with the current Tokio runtime handle. Since components do not use the runtime library directly but access it through the worker, the worker needs to be updated to create itself using a Runtime instance derived from the current Tokio runtime.
      This PR updates the http component and the worker to use the existing Tokio runtime instead of creating a new one. Other components can be similarly updated to run using the existing runtime.
      2a5eb7e7
  2. 14 May, 2025 1 commit
  3. 08 May, 2025 2 commits
  4. 06 May, 2025 1 commit
  5. 01 May, 2025 1 commit
  6. 29 Apr, 2025 2 commits
    • Hongkuan Zhou's avatar
      562c7f51
    • Graham King's avatar
      chore: Split PushRouter from Client (#817) · a1a10365
      Graham King authored
      In a distributed system we don't know if the remote workers need pre-processing done ingress-side or not. Previously Client required us to decide this before discovering the remote endpoints, which was fine because pre-processing was worker-side.
      
      As part of moving pre-processing back to ingress-side we need to split this into two steps:
      - Client discovers the endpoints, and (later PR) will fetch their Model Deployment Card.
      - PushRouter will use the Model Deployment Card to decide if they need pre-processing or not, which affects the types of the generic parameters.
      
      Part of #743
      a1a10365
  7. 28 Apr, 2025 2 commits
  8. 26 Apr, 2025 1 commit
  9. 11 Apr, 2025 1 commit
  10. 04 Apr, 2025 1 commit
    • Graham King's avatar
      feat: Python decorator dynamo_worker takes optional `static` parameter without etcd (#494) · 88ad3425
      Graham King authored
      Adds `@dynamo_worker(static = True)` to create a static worker which has a predictable name and hence does not require discovery or `etcd` to be running. There can only be a single static worker per namespace / component / endpoint trio.
      
      This contrasts with the default dynamic `dynamo_worker` endpoints we have now, which get a unique random name (based on namespace/component/endpoint), and are discovered by ingress components using etcd.
      
      Also change the hello_world example to use `dynamo_worker(static = True)` so that it is exercised and demonstrated somewhere.
      
      For NIM.
      88ad3425
  11. 02 Apr, 2025 1 commit
  12. 31 Mar, 2025 1 commit
  13. 19 Mar, 2025 1 commit
    • Graham King's avatar
      chore: Don't depend on openssl (#292) · 7c3fd5c9
      Graham King authored
      This makes the Rust parts all use ring / rustls library instead of local install of openssl. It's a step on the journey to being statically linked.
      
      Pieces:
      - `tokenizers` and `mistralrs` now support rustls (mistralrs by default, tokenizers with feature flag).
      - Move shared dependencies up into workspace
      - New `rand` crate has some renames for future rust
      - Ensure the dependency doesn't creep back in by enforcing it with cargo deny.
      7c3fd5c9
  14. 18 Mar, 2025 1 commit
  15. 17 Mar, 2025 2 commits
  16. 14 Mar, 2025 1 commit
  17. 13 Mar, 2025 1 commit
  18. 12 Mar, 2025 1 commit
  19. 11 Mar, 2025 2 commits
  20. 10 Mar, 2025 1 commit
  21. 08 Mar, 2025 1 commit
  22. 07 Mar, 2025 3 commits