Commits · e47457e4e7fb22435bd2ba2cc4a7ad50a30e6680 · OpenDAS / dynamo

13 Jan, 2026 1 commit
- test: unit test tcp/client.rs handle_writer [1/n] (#5055) · e47457e4
  Qi Wang authored Jan 13, 2026
  
  e47457e4
02 Jan, 2026 1 commit
- chore: update all copyright headers in repo to 2026 (#5130) · cf433e68
  Tushar Sharma authored Jan 02, 2026
```
Signed-off-by: Tushar Sharma <tusharma@nvidia.com>
```
  cf433e68
13 Dec, 2025 1 commit
- feat: Support for field include_stop_str_in_output (#4924) · 45e881d3
  KrishnanPrash authored Dec 12, 2025
```
Signed-off-by: Krishnan Prashanth <kprashanth@nvidia.com>
```
  45e881d3
10 Nov, 2025 1 commit
- refactor: Make the Runtime and DistributedRuntime fields private (#4193) · cf630bf7
  Graham King authored Nov 10, 2025
```
Signed-off-by: Graham King <grahamk@nvidia.com>
```
  cf630bf7
07 Nov, 2025 1 commit
- chore: better error logging for "failed to join reader and writer tasks" #3910 (#3913) · 0c66b2d2
  Yan Ru Pei authored Nov 07, 2025
```
Signed-off-by: PeaBrane <yanrpei@gmail.com>
```
  0c66b2d2
19 Sep, 2025 1 commit
- feat: Request Cancellation unary request support (#3004) · a8fd1271
  Jacky authored Sep 18, 2025
```
Signed-off-by: Jacky <18255193+kthui@users.noreply.github.com>
```
  a8fd1271
16 Sep, 2025 1 commit
- chore(runtime): Shorten the license header (#3059) · 02a22cbc
  Graham King authored Sep 16, 2025
```
Signed-off-by: Graham King <grahamk@nvidia.com>
```
  02a22cbc
22 Aug, 2025 1 commit
- chore: Rust to 1.89 and edition 2024 (#2659) · bce74588
  Graham King authored Aug 22, 2025
  
  bce74588
24 Jun, 2025 1 commit
- fix: rename create_response_steam to create_response_stream (#1615) · 68e4d2c1
  zxyy-bys authored Jun 24, 2025
  
  68e4d2c1
22 May, 2025 1 commit

feat(dynamo-run): Allow setting context-length (#1157) · 6d5da821

Graham King authored May 22, 2025

Llama 4 has a very large context length (aka n_ctx, model_max_length, max_model_len), and vllm won't start unless it can allocate enough KV cache for the entire context.

Allow passing `--context-length <N>` to `dynamo-run` to limit it so long-context models will fit.

Future todo:
- Restrict every request's `max_tokens` to below the context length. Our pre-processor should do this by setting stop_conditions.max_tokens. mistralrs engine wrapper must do it itself because it does not use the pre-processor.
- mistralrs and llamacpp currently have a hard-coded max context length if one is not provided on the command line. Change those to be the model's built-in max, read from the GGUF or tokenizer_config.json.

6d5da821

25 Feb, 2025 1 commit

refactor: move libs to lib dir · 08fcd7e9

Neelay Shah authored Feb 24, 2025


Signed-off-by: Neelay Shah <neelays@nvidia.com>
Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>

08fcd7e9

13 Feb, 2025 1 commit
- fix: tcp updates + initial zmq (#176) · 2fd6592f
  Ryan Olson authored Feb 13, 2025
  
  2fd6592f
12 Feb, 2025 1 commit

fix: tcp retry and error handling updates (#169) · dddebc0d

Ryan Olson authored Feb 12, 2025


Signed-off-by: Ryan Olson <ryanolson@users.noreply.github.com>
Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>

dddebc0d

11 Feb, 2025 2 commits
- refactor: Handle JSON serialize / de-serialize errors. (#156) · b705549c
  Graham King authored Feb 11, 2025
  
  b705549c
- refactor: Use tracing crate (#161) · a62a8627
  Graham King authored Feb 11, 2025
  
  a62a8627
05 Feb, 2025 1 commit
- ci: Add Copyright Verification Scripts w/ Automation (#110) · c9130f8f
  J Wyman authored Feb 05, 2025
  
  c9130f8f
04 Feb, 2025 1 commit
- feat: rust - initial commit · 5ed8c1c0
  Ryan Olson authored Feb 03, 2025
```
the journey begins
```
  5ed8c1c0