- 21 Jan, 2026 1 commit
-
-
blarson-b10 authored
Signed-off-by:Brian Larson <brian.larson@baseten.co>
-
- 18 Jan, 2026 1 commit
-
-
blarson-b10 authored
Signed-off-by:Brian Larson <brian.larson@baseten.co>
-
- 15 Jan, 2026 1 commit
-
-
Biswa Panda authored
-
- 02 Jan, 2026 1 commit
-
-
Tushar Sharma authored
Signed-off-by:Tushar Sharma <tusharma@nvidia.com>
-
- 31 Dec, 2025 1 commit
-
-
Yan Ru Pei authored
Signed-off-by:
PeaBrane <yanrpei@gmail.com> Co-authored-by:
Ishan Dhanani <ishandhanani@gmail.com> Co-authored-by:
Sean SH Choi <sechoi@nvidia.com> Co-authored-by:
ishandhanani <82981111+ishandhanani@users.noreply.github.com>
-
- 19 Dec, 2025 1 commit
-
-
zhongdaor-nv authored
Signed-off-by:
zhongdaor <zhongdaor@nvidia.com> Signed-off-by:
zhongdaor-nv <zhongdaor@nvidia.com>
-
- 17 Dec, 2025 1 commit
-
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
- 12 Dec, 2025 1 commit
-
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
- 11 Dec, 2025 1 commit
-
-
Karen Chung authored
Co-authored-by:Yan Ru Pei <yanrpei@gmail.com>
-
- 21 Nov, 2025 1 commit
-
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
- 18 Nov, 2025 1 commit
-
-
Vladislav Nosivskoy authored
Signed-off-by:
Vladislav Nosivskoy <vladnosiv@gmail.com> Signed-off-by:
PeaBrane <yanrpei@gmail.com> Co-authored-by:
PeaBrane <yanrpei@gmail.com>
-
- 12 Nov, 2025 2 commits
-
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
Waël Boukhobza authored
Signed-off-by:Wael Boukhobza <wawa_wael@live.fr>
-
- 05 Nov, 2025 1 commit
-
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
- 28 Oct, 2025 1 commit
-
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
- 24 Oct, 2025 1 commit
-
-
Keiven C authored
refactor: redesign the metrics API from Trait to composition to make the code cleaner and easier to understand (#3687) Signed-off-by:Keiven Chang <keivenchang@users.noreply.github.com>
-
- 16 Oct, 2025 1 commit
-
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
- 13 Oct, 2025 1 commit
-
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
- 11 Oct, 2025 1 commit
-
-
Yan Ru Pei authored
Signed-off-by:
PeaBrane <yanrpei@gmail.com> Signed-off-by:
Yan Ru Pei <yanrpei@gmail.com> Co-authored-by:
Ryan McCormick <rmccormick@nvidia.com>
-
- 10 Oct, 2025 1 commit
-
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
- 07 Oct, 2025 1 commit
-
-
blarson-b10 authored
Signed-off-by:
Brian Larson <brian.larson@baseten.co> Signed-off-by:
PeaBrane <yanrpei@gmail.com> Co-authored-by:
PeaBrane <yanrpei@gmail.com>
-
- 26 Sep, 2025 1 commit
-
-
Alec authored
Signed-off-by:
Alec <aflowers@nvidia.com> Signed-off-by:
Alec <35311602+alec-flowers@users.noreply.github.com> Signed-off-by:
krishung5 <krish@nvidia.com> Co-authored-by:
Kris Hung <krish@nvidia.com>
-
- 17 Sep, 2025 2 commits
-
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
- 16 Sep, 2025 1 commit
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 10 Sep, 2025 1 commit
-
-
blarson-b10 authored
Signed-off-by:Brian Larson <brian.larson@baseten.co>
-
- 30 Aug, 2025 1 commit
-
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
- 22 Aug, 2025 1 commit
-
-
Graham King authored
-
- 01 Aug, 2025 1 commit
-
-
Yan Ru Pei authored
-
- 24 Jul, 2025 1 commit
-
-
Yan Ru Pei authored
-
- 30 Jun, 2025 1 commit
-
-
Graham King authored
Move much of what was in the `dynamo-run` crate into `dynamo-llm` so that everyone can use it. Example usage: 1. Create a `LocalModel`: ``` let local_model = LocalModelBuilder::default() .model_path("Qwen/Qwen3-0.6B") .http_port(8080) .build().await?; ``` 2. Make an engine: ``` let engine_config = EngineConfig::StaticFull { engine: dynamo_engine_mistralrs::make_engine(&local_model).await?, model: Box::new(local_model), }; ``` 3. Connect it to an input and run it ``` dynamo_llm::entrypoint::input::run_input(Input::Http, runtime, engine_config).await?; ``` For https://github.com/ai-dynamo/dynamo/issues/1647 Code Rabbit summary, thanks: * Introduced a flexible builder pattern for local model configuration, allowing advanced customization and easier initialization. * Added new input modes and unified input handling, supporting interactive chat, HTTP server, batch file, and distributed endpoint modes. * Centralized engine configuration and routing, enabling more extensible and maintainable engine management. * Simplified and modularized the codebase by moving input and engine logic into dedicated modules. * Replaced direct model construction with an asynchronous builder for improved clarity and extensibility. * Streamlined configuration and validation for flags and router settings. * Added validation to prevent incompatible input and output combinations in endpoint and dynamic modes.
-
- 30 May, 2025 1 commit
-
-
jain-ria authored
-
- 22 May, 2025 1 commit
-
-
Graham King authored
Removed the hard coded sleeps, explained what we're testing. Closes https://github.com/ai-dynamo/dynamo/issues/1132 The race condition is that `apply_event` sends a message on a channel, it does not directly apply the event. At some later point the tokio runtime schedules the task running the channel receiver, which applies the event. If that had not happened yet the test would fail.
-
- 14 May, 2025 1 commit
-
-
Graham King authored
Router: ``` dynamo-run in=http out=dyn://dynamo.endpoint.generate --router-mode kv ``` Worker (* N): ``` dynamo-run in=dyn://dynamo.endpoint.generate out=vllm /data/llms/Qwen/Qwen3-4B ``` You need patched vllm and the C bindings `.so`. Full docs in the updated guide: `docs/guides/dynamo_run.md`. This gives us a pure-Rust ingress node: OpenAI compliant HTTP server + Pre-processor + KV-aware router.
-
- 04 Apr, 2025 1 commit
-
-
Graham King authored
Also upgrade the cargo resolver to v3, the default. New clippy lints: - `next_back()` instead of `last()` for a double-ended iterator. That avoids walking the whole list. - ` repeat_n` instead of `repeat.take`. That avoids cloning. - Doc indenting
-
- 14 Mar, 2025 1 commit
-
-
Ryan Olson authored
-
- 11 Mar, 2025 1 commit
-
-
Alec authored
-
- 09 Mar, 2025 1 commit
-
-
Alec authored
-
- 25 Feb, 2025 2 commits
-
-
Alec authored
Co-authored-by:aflowers <aflowers@nvidia.com>
-
GuanLuo authored
Signed-off-by:
Meenakshi Sharma <163925564+nvda-mesharma@users.noreply.github.com> Signed-off-by:
Neelay Shah <neelays@nvidia.com> Co-authored-by:
Neelay Shah <neelays@nvidia.com> Co-authored-by:
Ryan Olson <ryanolson@users.noreply.github.com> Co-authored-by:
Meenakshi Sharma <163925564+nvda-mesharma@users.noreply.github.com> Co-authored-by:
Biswa Panda <biswapanda@users.noreply.github.com> Co-authored-by:
Ryan McCormick <rmccormick@nvidia.com>
-