Commits · 42ce6931a788c31a9f1f99b51336ba15e189f617 · OpenDAS / dynamo

09 May, 2025 1 commit
- feat: kv block manager (#965) (#1021) · 42ce6931
  Harrison Saturley-Hall authored May 09, 2025
```
Co-authored-by: Ryan Olson <ryanolson@users.noreply.github.com>
```
  42ce6931
04 Apr, 2025 1 commit

chore: Upgrade Rust to 1.86 (#518) · e99aa1e1

Graham King authored Apr 04, 2025

Also upgrade the cargo resolver to v3, the default.

New clippy lints:
- `next_back()` instead of `last()` for a double-ended iterator. That avoids walking the whole list.
- ` repeat_n` instead of `repeat.take`. That avoids cloning.
- Doc indenting

e99aa1e1

19 Mar, 2025 1 commit

chore: Don't depend on openssl (#292) · 7c3fd5c9

Graham King authored Mar 19, 2025

This makes the Rust parts all use ring / rustls library instead of local install of openssl. It's a step on the journey to being statically linked.

Pieces:
- `tokenizers` and `mistralrs` now support rustls (mistralrs by default, tokenizers with feature flag).
- Move shared dependencies up into workspace
- New `rand` crate has some renames for future rust
- Ensure the dependency doesn't creep back in by enforcing it with cargo deny.

7c3fd5c9

14 Mar, 2025 1 commit

feat(dynamo-run): Various UX improvements (#168) · 1fb31d6a

Graham King authored Mar 14, 2025

Engines mistralrs, sglang and vllm included by default. Can be disabled like this: `cargo build --no-default-features --features <add-back-what-you-want>`.

Added `--feature vulkan` option, for llamacpp.

Build time message if CUDA or Metal would help and are missing. That's the best we can do:
> warning: dynamo-run@0.1.0: CUDA not enabled, re-run with `--features cuda`

Runtime message if CUDA, Metal or Vulkan are enabled:
> 2025-03-14T21:59:26.501937Z  INFO dynamo_run: CUDA on

Runtime message if they are missing:
> 2025-03-14T22:02:37.439404Z  INFO dynamo_run: CPU mode. Rebuild with `--features cuda|metal|vulkan` for better performance

Defaut engine message includes available engines:
> 2025-03-14T21:59:26.503612Z  INFO dynamo_run: Using default engine: mistralrs. Use out=<engine> to specify one of echo_core, echo_full, mistralrs, llamacpp, sglang, vllm, pystr, pytok

The really important outcome is that this should now "just work":
```
cargo install dynamo-run
dynamo-run Qwen/Qwen2.5-3B-Instruct
```

Sadly you still need `--features cuda|metal` for performance, I couldn't automate that.

1fb31d6a

13 Mar, 2025 1 commit
- build: add top level rust workspace (#137) · 3d292851
  Anant Sharma authored Mar 13, 2025
  
  3d292851
11 Mar, 2025 1 commit
- refactor: Move rust binaries out of examples, update nixl dockerfile (#89) · e5db9e86
  Neelay Shah authored Mar 11, 2025
```
Co-authored-by: Meenakshi Sharma <163925564+nvda-mesharma@users.noreply.github.com>
```
  e5db9e86
08 Mar, 2025 1 commit
- chore: rename dynamo (#44) · 602352ce
  Neelay Shah authored Mar 08, 2025
```
Co-authored-by: Biswa Panda <biswa.panda@gmail.com>
```
  602352ce
07 Mar, 2025 1 commit
- refactor: rename count to metrics and move location (#21) · ac13ed06
  Neelay Shah authored Mar 06, 2025
  
  ac13ed06
05 Mar, 2025 1 commit
- refactor: Rename 'tio' to 'dynemo-run' (#18) · 14ce7e03
  Graham King authored Mar 04, 2025
  
  14ce7e03
04 Mar, 2025 2 commits
- fix · 46dcc058
  Anant Sharma authored Mar 04, 2025
  
  46dcc058
- remove timeout for cargo deny check · 5db8d13a
  Anant Sharma authored Mar 04, 2025
  
  5db8d13a
03 Mar, 2025 1 commit

fix: Install specific toolchain (#329) · 2d906fb4

Graham King authored Mar 03, 2025

`cargo build --locked` won't let you use "1.85.0" if you only have "stable" installed, even if those are the same thing right now.

2d906fb4

26 Feb, 2025 1 commit
- ci: fix rust deny workflow (#275) · 76439997
  Anant Sharma authored Feb 26, 2025
  
  76439997
25 Feb, 2025 3 commits
- refactor: moving tio to launch dir · eb022ec9
  Neelay Shah authored Feb 25, 2025
  
  eb022ec9
- ci: Add rust checks to missing directories (#239) · c06b95ff
  Ryan McCormick authored Feb 25, 2025
```
Signed-off-by: Ryan McCormick <rmccormick@nvidia.com>
```
  c06b95ff
- refactor: move libs to lib dir · 08fcd7e9
  Neelay Shah authored Feb 24, 2025
```
Signed-off-by: Neelay Shah <neelays@nvidia.com>
Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>
```
  08fcd7e9
21 Feb, 2025 1 commit

feat: event plane + count · 3b7a462d

Ryan Olson authored Feb 21, 2025


Signed-off-by: Ryan Olson <ryanolson@users.noreply.github.com>
Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>

3b7a462d

20 Feb, 2025 1 commit
- ci: add instruction for running github actions locally (#218) · b90535aa
  Biswa Panda authored Feb 20, 2025
  
  b90535aa
14 Feb, 2025 1 commit
- fix: Fix rust pre-merge for new tio directory (#177) · a48d932e
  Ryan McCormick authored Feb 13, 2025
  
  a48d932e
13 Feb, 2025 1 commit

feat: Add `tio` your friendly cmd line uncle to run triton-llm services (#174) · 418ae5e8

Graham King authored Feb 13, 2025

This provides a simple example of how to write a triton-llm engine, and how to connect it to the OpenAI HTTP server.

This is the tool previously called `nio` and `llmctl`.

- **Inputs**: Text and HTTP.
- **Engines**: Echo, which streams your prompt back with a slight delay.

Build: `cargo build`

Pre-requisites: `nats-server` and `etcd` must be running locally, even though they are not yet used by `tio`.

Run with text input:
```
./target/debug/tio in=text out=echo_full --model-name test
```

Run with the triton-llm HTTP server:
```
./target/debug/tio in=http out=echo_full --http-port 8080 --model-name Echo-0B
```

List models:
```
curl localhost:8080/v1/models | jq
```

Will output
```
{
  "object": "list",
  "data": [
    {
      "id": "Echo-0B",
      "object": "object",
      "created": 1739400430,
      "owned_by": "nvidia"
    }
  ]
}
```

#### What's next

As triton-distributed gains features `tio` will be able to grow:
- When we get the pre-processor we can have token-in token-out engines. 
- When we get a pull-router we can have `in=nats` and `out=nats`.
- When we get discovery we can have dynamic engines.

418ae5e8

11 Feb, 2025 1 commit
- chore: update rust versions to v0.2.0 (#155) · 2e409565
  Anant Sharma authored Feb 10, 2025
```
Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>
```
  2e409565
06 Feb, 2025 1 commit
- ci: Remove conflicting paths-ignore from rust action (#122) · c9efcce6
  Ryan McCormick authored Feb 05, 2025
  
  c9efcce6
05 Feb, 2025 3 commits
- add deny.toml, update CI to run license checks (#101) · 0cd5d783
  Alec authored Feb 05, 2025
```
Co-authored-by: aflowers <aflowers@nvidia.com>
```
  0cd5d783
- ci: Add Copyright Verification Scripts w/ Automation (#110) · c9130f8f
  J Wyman authored Feb 05, 2025
  
  c9130f8f
- ci: update rust pre-merge checks (#97) · ffbc06cc
  Anant Sharma authored Feb 05, 2025
  
  ffbc06cc