Commits · c3a85c0698d1bfa0bf5467f03fbba75a61a8d62d · OpenDAS / dynamo

"examples/backends/sglang/vscode:/vscode.git/clone" did not exist on "91ddf4189b6c88963f57b50b416397da6759c511"

22 May, 2025 1 commit

feat(dynamo-run): Allow setting context-length (#1157) · 6d5da821

Graham King authored May 22, 2025

Llama 4 has a very large context length (aka n_ctx, model_max_length, max_model_len), and vllm won't start unless it can allocate enough KV cache for the entire context.

Allow passing `--context-length <N>` to `dynamo-run` to limit it so long-context models will fit.

Future todo:
- Restrict every request's `max_tokens` to below the context length. Our pre-processor should do this by setting stop_conditions.max_tokens. mistralrs engine wrapper must do it itself because it does not use the pre-processor.
- mistralrs and llamacpp currently have a hard-coded max context length if one is not provided on the command line. Change those to be the model's built-in max, read from the GGUF or tokenizer_config.json.

6d5da821

25 Feb, 2025 1 commit

refactor: move libs to lib dir · 08fcd7e9

Neelay Shah authored Feb 24, 2025


Signed-off-by: Neelay Shah <neelays@nvidia.com>
Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>

08fcd7e9

13 Feb, 2025 1 commit
- fix: tcp updates + initial zmq (#176) · 2fd6592f
  Ryan Olson authored Feb 13, 2025
  
  2fd6592f
12 Feb, 2025 1 commit

fix: tcp retry and error handling updates (#169) · dddebc0d

Ryan Olson authored Feb 12, 2025


Signed-off-by: Ryan Olson <ryanolson@users.noreply.github.com>
Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>

dddebc0d

11 Feb, 2025 2 commits
- refactor: Handle JSON serialize / de-serialize errors. (#156) · b705549c
  Graham King authored Feb 11, 2025
  
  b705549c
- refactor: Use tracing crate (#161) · a62a8627
  Graham King authored Feb 11, 2025
  
  a62a8627
05 Feb, 2025 1 commit
- ci: Add Copyright Verification Scripts w/ Automation (#110) · c9130f8f
  J Wyman authored Feb 05, 2025
  
  c9130f8f
04 Feb, 2025 1 commit
- feat: rust - initial commit · 5ed8c1c0
  Ryan Olson authored Feb 03, 2025
```
the journey begins
```
  5ed8c1c0