Commits · fd5cc2880eaffc0900ce49f48cc17514ddf119d3 · OpenDAS / dynamo

08 Apr, 2026 1 commit
- refactor(3/3): switch dynamo-protocols to upstream async-openai types (#7625) · fd5cc288
  ishandhanani authored Apr 08, 2026
```
Co-authored-by: Dmitry Tokarev <dtokarev@nvidia.com>
```
  fd5cc288
02 Apr, 2026 1 commit
- refactor(protocols): deprecate cache control (#7790) · c09ac697
  ishandhanani authored Apr 01, 2026
  
  c09ac697
01 Apr, 2026 1 commit
- refactor(2/3): rename dynamo-async-openai to dynamo-protocols (#7565) · b6a3b0c6
  ishandhanani authored Apr 01, 2026
  
  b6a3b0c6
30 Mar, 2026 1 commit
- refactor(1/3): move `nvext` to `dynamo-llm` and move `anthropic` to `dynamo-async-openai` (#7564) · 2887cd1c
  ishandhanani authored Mar 30, 2026
  
  2887cd1c
02 Mar, 2026 1 commit

feat: Full Anthropic Messages API cache_control support (top-level, per-block,... · 4d3e1ae3

MatejKosec authored Mar 02, 2026


feat: Full Anthropic Messages API cache_control support (top-level, per-block, system block arrays) (#6629)
Signed-off-by: Matej Kosec <mkosec@nvidia.com>

4d3e1ae3

07 Jan, 2026 1 commit
- feat: Adding support for `response_format` field (#5127) · e994caeb
  KrishnanPrash authored Jan 07, 2026
```
Signed-off-by: Krishnan Prashanth <kprashanth@nvidia.com>
```
  e994caeb
02 Jan, 2026 1 commit
- chore: update all copyright headers in repo to 2026 (#5130) · cf433e68
  Tushar Sharma authored Jan 02, 2026
```
Signed-off-by: Tushar Sharma <tusharma@nvidia.com>
```
  cf433e68
31 Dec, 2025 1 commit
- fix: add chat_template_kwargs alias for compatibility (#5112) · cceeb8e3
  Neelay Shah authored Dec 30, 2025
```
Co-authored-by: Claude <noreply@anthropic.com>
```
  cceeb8e3
22 Dec, 2025 1 commit
- feat: add auto-generated frontend OpenAPI spec and helper binary (#4802) · f63e273c
  smatta-star authored Dec 22, 2025
```
Signed-off-by: Satvik Matta <smatta@nvidia.com>
```
  f63e273c
19 Dec, 2025 1 commit
- feat: Runtime media decoder config (#5011) · d2faf0e6
  milesial authored Dec 18, 2025
```
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
```
  d2faf0e6
09 Dec, 2025 1 commit
- feat: add tool_choice support (#4722) · 5585f803
  Vladislav Nosivskoy authored Dec 09, 2025
```
Signed-off-by: Vladislav Nosivskoy <vladnosiv@gmail.com>
```
  5585f803
08 Nov, 2025 1 commit
- feat: Add support for skip_special_tokens parameter in v1/completions and... · 441473c3
  Ryan McCormick authored Nov 07, 2025
```
feat: Add support for skip_special_tokens parameter in v1/completions and v1/chat/completions endpoints (#4175)
```
  441473c3
03 Nov, 2025 1 commit
- feat: Reject unsupported parameters with 400 Bad Request (#4021) · c837b5ba
  KrishnanPrash authored Nov 03, 2025
```
Signed-off-by: Krishnan Prashanth <kprashanth@nvidia.com>
```
  c837b5ba
27 Oct, 2025 1 commit
- chore: add request validation and better error message for n > 1 and temperature=0 (#3914) · 980bae03
  zhongdaor-nv authored Oct 27, 2025
```
Signed-off-by: zhongdaor <zhongdaor@nvidia.com>
```
  980bae03
10 Oct, 2025 1 commit
- chore: remove deprecated nvext parameters for 6.0 (#3551) · b954a249
  ryan-lempka authored Oct 10, 2025
  
  b954a249
29 Sep, 2025 1 commit
- chore: relaxing constraints on metadata field, adding metadata field to completions API (#3240) · 13156361
  nv-nedelman-1 authored Sep 29, 2025
```
Signed-off-by: Nicholas Edelman <nedelman@nvidia.com>
```
  13156361
26 Sep, 2025 1 commit
- chore: added more api error code validations (#3231) · d4f0d2bc
  Ayush Agarwal authored Sep 26, 2025
```
Signed-off-by: ayushag <ayushag@nvidia.com>
```
  d4f0d2bc
23 Sep, 2025 1 commit
- feat: JailedStream (#3034) · c63cceaa
  Ryan Olson authored Sep 23, 2025
```
Signed-off-by: ayushag <ayushag@nvidia.com>
```
  c63cceaa
17 Sep, 2025 2 commits
- chore: fillout sampling params (seed, n, best_of, min_p) (#3055) · 3de04dd9
  Greg Clark authored Sep 17, 2025
```
Signed-off-by: Greg Clark <grclark@nvidia.com>
```
  3de04dd9
- feat: add chat_template_kwargs param to v1/chat/completion (#3016) · eb6722e3
  Chi McIsaac authored Sep 17, 2025
```
Signed-off-by: Chi McIsaac <chixie.mcisaac@gmail.com>
```
  eb6722e3
16 Sep, 2025 2 commits
- chore(llm): Remove extra license headers (#3065) · bc0a7633
  Graham King authored Sep 16, 2025
```
Signed-off-by: Graham King <grahamk@nvidia.com>
```
  bc0a7633
- chore: 400 Error Code for Bad Completion and ChatCompletion Request (#3038) · dcd331ab
  Ayush Agarwal authored Sep 15, 2025
```
Signed-off-by: ayushag <ayushag@nvidia.com>
```
  dcd331ab
29 Aug, 2025 2 commits
- chore: added include_stop_str_in_output (#2782) · 1f6b83be
  Ayush Agarwal authored Aug 29, 2025
  
  1f6b83be
- chore: deprecate nvext.top_k and nvext.repetition_penalty and make available top level (#2767) · 63f5bbc0
  ryan-lempka authored Aug 28, 2025
```
Signed-off-by: Ryan Lempka <rlempka@nvidia.com>
```
  63f5bbc0
28 Aug, 2025 1 commit
- chore: deprecate duplicate params in nvext (#2754) · e3619ce0
  ryan-lempka authored Aug 27, 2025
```
Signed-off-by: Ryan Lempka <rlempka@nvidia.com>
```
  e3619ce0
22 Aug, 2025 1 commit
- chore: Rust to 1.89 and edition 2024 (#2659) · bce74588
  Graham King authored Aug 22, 2025
  
  bce74588
20 Aug, 2025 1 commit

chore: remove flatten for chat response types, add reasoning_content (#2543) · c12fe501

nachiketb-nvidia authored Aug 19, 2025

Changing the chat completions response objects from structs to types of dynamo_async_openai

Implement aggregator traits for them chat completion structs

add reasoning_content under message and delta message in lib/async-openai

c12fe501

19 Aug, 2025 1 commit
- chore: Bring async-openai into repo as request starter (#2520) · 199b9a30
  nachiketb-nvidia authored Aug 19, 2025
```
Co-authored-by: Graham King <grahamk@nvidia.com>
```
  199b9a30
14 Aug, 2025 1 commit
- feat: logprob handling (#2426) · f476fd74
  Greg Clark authored Aug 14, 2025
```
Signed-off-by: Greg Clark <grclark@nvidia.com>
```
  f476fd74
12 Aug, 2025 1 commit

feat: Add frontend support for `min_tokens` and `ignore_eos` (outside of... · 18bb779e

KrishnanPrash authored Aug 12, 2025

feat: Add frontend support for `min_tokens` and `ignore_eos` (outside of `nvext`) and Structured Output / Guided Decoding (#2380)
Signed-off-by: KrishnanPrash <140860868+KrishnanPrash@users.noreply.github.com>
Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>
Co-authored-by: Ayush Agarwal <ayushag@nvidia.com>

18bb779e

01 Jul, 2025 1 commit
- feat: Validation engine for validating OpenAI api request data (#1674) · ee86bad3
  Nathan Barry authored Jul 01, 2025
  
  ee86bad3
26 Jun, 2025 1 commit
- refactor: remove dead protocols code and organize imports idiomatically (#1669) · 9d7c5df5
  Paul Hendricks authored Jun 26, 2025
  
  9d7c5df5
17 Mar, 2025 1 commit

fix(vllm,sglang): Let the engine enforce max tokens (#216) · 05765cd4

Graham King authored Mar 17, 2025

Previously several parts of the stack ensured max tokens (for this single request) was set.

Now only text input sets it (to 8k). Everything else leaves as is, potentially blank. The engines themselves have very small defaults, 16 for vllm and 128 for sglang.

Also fix dynamo-run CUDA startup message to only print if we're using an engine that would benefit from it (mistralrs, llamacpp).

05765cd4

08 Mar, 2025 1 commit
- chore: rename dynamo (#44) · 602352ce
  Neelay Shah authored Mar 08, 2025
```
Co-authored-by: Biswa Panda <biswa.panda@gmail.com>
```
  602352ce
05 Mar, 2025 1 commit
- refactor: rename triton_distributed to dynemo (#22) · 1af7433b
  Neelay Shah authored Mar 05, 2025
```
Co-authored-by: Graham King <grahamk@nvidia.com>
```
  1af7433b
27 Feb, 2025 5 commits
- fix: add skip_serializing if none (#297) · b20ef999
  Paul Hendricks authored Feb 27, 2025
  
  b20ef999
- refactor: removes wrapper for ChatCompletionContent and adds documentation (#296) · 151a2a1d
  Paul Hendricks authored Feb 27, 2025
  
  151a2a1d
- refactor: rename ChatCompletionResponseDelta to NvCreateChatCompletionStreamResponse (#292) · 110f3f8c
  Paul Hendricks authored Feb 27, 2025
  
  110f3f8c
- refactor: rename ChatCompletionResponse to NvCreateChatCompletionResponse (#291) · c13ea718
  Paul Hendricks authored Feb 27, 2025
  
  c13ea718
- refactor: rename ChatCompletionRequest to NvCreateChatCompletionRequest (#284) · 96866f43
  Paul Hendricks authored Feb 27, 2025
  
  96866f43