1. 24 Apr, 2025 1 commit
  2. 21 Apr, 2025 4 commits
  3. 18 Apr, 2025 3 commits
  4. 17 Apr, 2025 3 commits
  5. 12 Apr, 2025 1 commit
  6. 11 Apr, 2025 1 commit
  7. 09 Apr, 2025 2 commits
  8. 07 Apr, 2025 1 commit
    • Graham King's avatar
      feat(dynamo-run): Basic routing choice (#524) · ec2e7307
      Graham King authored
      As a first step towards KV routing:
      - introduce a `--router-mode` in dynamo-run that only does random and round-robin right now. Not that interesting yet.
      - Make the vllm engine publish the KV events received from our patched vllm.
      
      Now we "just" need to connect the two. Easy right?
      ec2e7307
  9. 04 Apr, 2025 3 commits
    • Yan Ru Pei's avatar
    • Graham King's avatar
      chore: Upgrade Rust to 1.86 (#518) · e99aa1e1
      Graham King authored
      Also upgrade the cargo resolver to v3, the default.
      
      New clippy lints:
      - `next_back()` instead of `last()` for a double-ended iterator. That avoids walking the whole list.
      - ` repeat_n` instead of `repeat.take`. That avoids cloning.
      - Doc indenting
      e99aa1e1
    • Graham King's avatar
      feat: Python decorator dynamo_worker takes optional `static` parameter without etcd (#494) · 88ad3425
      Graham King authored
      Adds `@dynamo_worker(static = True)` to create a static worker which has a predictable name and hence does not require discovery or `etcd` to be running. There can only be a single static worker per namespace / component / endpoint trio.
      
      This contrasts with the default dynamic `dynamo_worker` endpoints we have now, which get a unique random name (based on namespace/component/endpoint), and are discovered by ingress components using etcd.
      
      Also change the hello_world example to use `dynamo_worker(static = True)` so that it is exercised and demonstrated somewhere.
      
      For NIM.
      88ad3425
  10. 03 Apr, 2025 3 commits
  11. 02 Apr, 2025 1 commit
  12. 01 Apr, 2025 2 commits
  13. 31 Mar, 2025 3 commits
  14. 28 Mar, 2025 1 commit
  15. 26 Mar, 2025 1 commit
  16. 25 Mar, 2025 1 commit
  17. 24 Mar, 2025 1 commit
  18. 21 Mar, 2025 1 commit
  19. 20 Mar, 2025 1 commit
  20. 19 Mar, 2025 4 commits
  21. 18 Mar, 2025 2 commits