Commits · 6d2abdba35f2da56c02efd802a1743798d095be4 · OpenDAS / dynamo

14 Feb, 2025 2 commits
- feat: Add event plane · 6d2abdba
  Blazej authored Feb 14, 2025
```
Signed-off-by: Piotr Marcinkiewicz <piotrm@nvidia.com>
Co-authored-by: Piotr Marcinkiewicz <piotrm@nvidia.com>
Co-authored-by: Neelay Shah <neelays@nvidia.com>
```
  6d2abdba
- fix: Fix rust pre-merge for new tio directory (#177) · a48d932e
  Ryan McCormick authored Feb 13, 2025
  
  a48d932e
13 Feb, 2025 4 commits

chore: Add CODEOWNERS (#170) · a6caed32
Ryan McCormick authored Feb 13, 2025

a6caed32

feat: Add `tio` your friendly cmd line uncle to run triton-llm services (#174) · 418ae5e8

Graham King authored Feb 13, 2025

This provides a simple example of how to write a triton-llm engine, and how to connect it to the OpenAI HTTP server.

This is the tool previously called `nio` and `llmctl`.

- **Inputs**: Text and HTTP.
- **Engines**: Echo, which streams your prompt back with a slight delay.

Build: `cargo build`

Pre-requisites: `nats-server` and `etcd` must be running locally, even though they are not yet used by `tio`.

Run with text input:
```
./target/debug/tio in=text out=echo_full --model-name test
```

Run with the triton-llm HTTP server:
```
./target/debug/tio in=http out=echo_full --http-port 8080 --model-name Echo-0B
```

List models:
```
curl localhost:8080/v1/models | jq
```

Will output
```
{
  "object": "list",
  "data": [
    {
      "id": "Echo-0B",
      "object": "object",
      "created": 1739400430,
      "owned_by": "nvidia"
    }
  ]
}
```

#### What's next

As triton-distributed gains features `tio` will be able to grow:
- When we get the pre-processor we can have token-in token-out engines. 
- When we get a pull-router we can have `in=nats` and `out=nats`.
- When we get discovery we can have dynamic engines.

418ae5e8

fix: tcp updates + initial zmq (#176) · 2fd6592f
Ryan Olson authored Feb 13, 2025

2fd6592f
feat: vLLM v0.7.2 patch supporting XpYd and heterogeneous P/D parallel configs (#167) · f38aa469
ptarasiewiczNV authored Feb 13, 2025
```
Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>
Co-authored-by: Neelay Shah <neelays@nvidia.com>
```
f38aa469

12 Feb, 2025 5 commits
- fix: tcp retry and error handling updates (#169) · dddebc0d
  Ryan Olson authored Feb 12, 2025
```
Signed-off-by: Ryan Olson <ryanolson@users.noreply.github.com>
Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>
```
  dddebc0d
- fix: async access to gil updates; notes on perf (#159) · c0e008b4
  Ryan Olson authored Feb 12, 2025
  
  c0e008b4
- fix: Fix TRTLLM Backend rebuild · c2a9636c
  Tanmay Verma authored Feb 12, 2025
  
  c2a9636c
- test: Add unit test to protocol module (#165) · 42dfe524
  Tanmay Verma authored Feb 11, 2025
```
Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>
```
  42dfe524
- ci: add path checks for vllm framework testing (#153) · dee73d91
  Anant Sharma authored Feb 11, 2025
  
  dee73d91
11 Feb, 2025 6 commits
- refactor: Handle JSON serialize / de-serialize errors. (#156) · b705549c
  Graham King authored Feb 11, 2025
  
  b705549c
- chore: Add triton_distributed_rs wheel install to container build (#135) · 139a9a83
  Ryan McCormick authored Feb 11, 2025
  
  139a9a83
- refactor: Use tracing crate (#161) · a62a8627
  Graham King authored Feb 11, 2025
  
  a62a8627
- chore: Again: Add rust-toolchain so we're all on the same version (#160) · e1bd07fe
  Graham King authored Feb 11, 2025
  
  e1bd07fe
- chore: update rust versions to v0.2.0 (#155) · 2e409565
  Anant Sharma authored Feb 10, 2025
```
Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>
```
  2e409565
- chore: Update base from from 24.12 to 25.01 Triton version (#100) · 20b36843
  Tanmay Verma authored Feb 10, 2025
  
  20b36843
10 Feb, 2025 7 commits
- ci: Move slow pytests to nightly (#152) · 21a8a79c
  Ryan McCormick authored Feb 10, 2025
  
  21a8a79c
- fix: Improve PythonAsyncEngine error handling and Increase Tokio thread count (#129) · 6ca24080
  Ryan Olson authored Feb 10, 2025
```
Signed-off-by: Ryan Olson <ryanolson@users.noreply.github.com>
Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>
```
  6ca24080
- Update trigger_ci_push.yml · 910751c6
  Meenakshi Sharma authored Feb 10, 2025
```
Signed-off-by: Meenakshi Sharma <163925564+nvda-mesharma@users.noreply.github.com>
```
  910751c6
- Update trigger_ci_pull.yml · 532e4491
  Meenakshi Sharma authored Feb 10, 2025
```
Signed-off-by: Meenakshi Sharma <163925564+nvda-mesharma@users.noreply.github.com>
```
  532e4491
- doc: Fix doc links (#149) · c7f8acd4
  Graham King authored Feb 10, 2025
  
  c7f8acd4
- feat: OpenAI compatible http service (#123) · ffc6dde1
  Ryan Olson authored Feb 10, 2025
```
Signed-off-by: Ryan Olson <ryanolson@users.noreply.github.com>
Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>
Co-authored-by: Neelay Shah <neelays@nvidia.com>
```
  ffc6dde1
- Add rust-toolchain so we're all on the same version (#145) · 9d6643b7
  Graham King authored Feb 10, 2025
  
  9d6643b7
08 Feb, 2025 3 commits
- docs: readme updates · 66f048f6
  Neelay Shah authored Feb 08, 2025
  
  66f048f6
- refactor: Modify rust vllm example to use container · d2e7ae02
  Ryan McCormick authored Feb 07, 2025
```
Co-authored-by: Piotr Tarasiewicz <ptarasiewicz@nvidia.com>
Co-authored-by: Neelay Shah <neelays@nvidia.com>
```
  d2e7ae02
- build: Support rebuilding and replacing trtllm backend (#99) · 2cfb1b6d
  Tanmay Verma authored Feb 07, 2025
  
  2cfb1b6d
07 Feb, 2025 3 commits
- feat: vLLM DistributedRuntime Monolith and Disagg Workers Example · 45bc426c
  ptarasiewiczNV authored Feb 08, 2025
```
Co-authored-by: Neelay Shah <neelays@nvidia.com>
Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>
```
  45bc426c
- fix: Fix the test_mock_disaggregated_serving on multi-GPU machines (#117) · 39441642
  Tanmay Verma authored Feb 07, 2025
```
Co-authored-by: Piotr Marcinkiewicz <piotrm@nvidia.com>
Co-authored-by: Neelay Shah <neelays@nvidia.com>
```
  39441642
- ci: restrict which branches & paths trigging lab testing (#96) · e03d23c8
  J Wyman authored Feb 07, 2025
  
  e03d23c8
06 Feb, 2025 5 commits
- ci: move file to .github/workflows folder (#125) · d1d2d7e3
  J Wyman authored Feb 06, 2025
  
  d1d2d7e3
- docs: readme + examples (#116) · cdcdce96
  Ryan Olson authored Feb 06, 2025
```
Co-authored-by: aflowers <aflowers@nvidia.com>
```
  cdcdce96
- ci: Remove conflicting paths-ignore from rust action (#122) · c9efcce6
  Ryan McCormick authored Feb 05, 2025
  
  c9efcce6
- add Readme, fix formatting (#120) · b3646497
  Alec authored Feb 05, 2025
```
Co-authored-by: aflowers <aflowers@nvidia.com>
```
  b3646497
- fix: Set default 'latest' tag if git tags fail to get parsed (#118) · 7894967e
  Ryan McCormick authored Feb 05, 2025
  
  7894967e
05 Feb, 2025 5 commits
- add deny.toml, update CI to run license checks (#101) · 0cd5d783
  Alec authored Feb 05, 2025
```
Co-authored-by: aflowers <aflowers@nvidia.com>
```
  0cd5d783
- ci: Add Copyright Verification Scripts w/ Automation (#110) · c9130f8f
  J Wyman authored Feb 05, 2025
  
  c9130f8f
- Added runtime/rust/ATTRIBUTIONS-Rust.md (#115) · 9322edef
  Dmitry Tokarev authored Feb 05, 2025
  
  9322edef
- feat: add python bindings + wheel build (#94) · 03b0101e
  Ryan Olson authored Feb 05, 2025
```
Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>
Co-authored-by: Neelay Shah <neelays@nvidia.com>
```
  03b0101e
- ci: update rust pre-merge checks (#97) · ffbc06cc
  Anant Sharma authored Feb 05, 2025
  
  ffbc06cc