Commits · 36172e6efd8929b696068f90df0ebba71d3ca912 · OpenDAS / dynamo

"vscode:/vscode.git/clone" did not exist on "ae2c03a94db9a4948a17a4defaf679c519e13d4e"

22 Apr, 2025 1 commit
- feat: add option to configure separate docker registry for pipelines docker images (#744) · 36172e6e
  julienmancuso authored Apr 22, 2025
  
  36172e6e
21 Apr, 2025 1 commit

chore(dynamo-run): Fix echo_core for EOS tokens (#759) · 4e75b04b

Graham King authored Apr 21, 2025

"echo_core" is an engine that echoes the post-processed request back to you so you can see the template. Good for testing. It needed an extra flag set to work correctly.

4e75b04b

18 Apr, 2025 4 commits
- chore: Remove TRT-LLM C++ engine in favor of Python one (#747) · 675a9bf5
  Graham King authored Apr 18, 2025
  
  675a9bf5
- feat(dynamo-engine-vllm): vllm 0.8.X support (#728) · a745a980
  Graham King authored Apr 18, 2025
```
It's different enough that I made a new engine vllm0_8 and renamed the previous engine to vllm0_7.

`dynamo-run out=vllm` now expects 0.8. This matches the container change in #690.

For older use `dynamo-run out=vllm0_7`.
```
  a745a980
- docs: add dedicated minikube guide (#735) · 9b05a5b7
  mohammedabdulwahhab authored Apr 17, 2025
  
  9b05a5b7
- fix: dynamo deploy helm chart cleanup (#727) · 831bc725
  mohammedabdulwahhab authored Apr 17, 2025
  
  831bc725
15 Apr, 2025 3 commits
- feat: replace dynamo server with dynamo cloud (#696) · da482c2f
  hhzhang16 authored Apr 15, 2025
  
  da482c2f
- docs: move deploy docs to docs/guides (#674) · 1c77531a
  hhzhang16 authored Apr 14, 2025
```
Signed-off-by: hhzhang16 <54051230+hhzhang16@users.noreply.github.com>
Co-authored-by: mohammedabdulwahhab <furkhan324@berkeley.edu>
```
  1c77531a
- docs: Use the same term for dynamo base image across code snippets and text (#670) · efb3e7d4
  Maksim Khadkevich authored Apr 14, 2025
```
Signed-off-by: Maksim Khadkevich <mkhadkevich@nvidia.com>
```
  efb3e7d4
11 Apr, 2025 3 commits

fix: Edit typos in dynamo deploy diagram (#615) · e7a49b03
mohammedabdulwahhab authored Apr 10, 2025

e7a49b03

feat: TRT-LLM disaggregated serving using UCX (#562) · da38e96a

Tanmay Verma authored Apr 10, 2025


Signed-off-by: Tanmay Verma <tanmay2592@gmail.com>
Signed-off-by: Tanmay Verma <tanmayv@nvidia.com>
Co-authored-by: Neelay Shah <neelays@nvidia.com>

da38e96a

docs: add to documentation for Kubernetes deployments, devcontainer improvements (#498) · 441846de

hhzhang16 authored Apr 10, 2025

441846de

09 Apr, 2025 3 commits

feat: Extract Common Configs + Log Configs on Init + Add `test_` to... · 0292feb5

jon-chuang authored Apr 09, 2025


feat: Extract Common Configs + Log Configs on Init + Add `test_` to `sdk/tests` filenames required for pytest (#434)
Co-authored-by: ishandhanani <82981111+ishandhanani@users.noreply.github.com>

0292feb5

docs: Move trtllm dynamo run doc from example to dynamo run guide (#578) · 0186aa7b
Tanmay Verma authored Apr 09, 2025

0186aa7b

docs: Updated dynamo run instructions (#555) · 16124e74

cdgamarose-nv authored Apr 09, 2025



#### Overview:

Updated the dynamo run doc `docs/guides/dynamo_run.md`

#### Details:

- Updated the instructions to make it clear which binary to use for built backends
- Reformatted the doc to make it more readable
- Added missing cmake library for ubuntu
Signed-off-by: Chantal D Gama Rose <cdgamarose@nvidia.com>

16124e74

08 Apr, 2025 1 commit
- docs: add disagg tuning guide (#413) · 0eacef76
  Hongkuan Zhou authored Apr 08, 2025
  
  0eacef76
07 Apr, 2025 1 commit
- docs: update close-deployment in dynamo_serve.md (#535) · df54b9cb
  tlipoca9 authored Apr 08, 2025
```
Co-authored-by: ishandhanani <82981111+ishandhanani@users.noreply.github.com>
```
  df54b9cb
03 Apr, 2025 1 commit
- docs: Remove invalid link (#506) · b865bd4f
  Graham King authored Apr 03, 2025
  
  b865bd4f
25 Mar, 2025 1 commit

feat: Allow passing any arguments to vllm and sglang engines (#368) · 670661f6

Graham King authored Mar 25, 2025

Put the arguments in a JSON file:
```
{
    "dtype": "half",
    "trust_remote_code": true
}
```

Pass it like this:
```
dynamo-run out=sglang ~/llm_models/Llama-3.2-3B-Instruct --extra-engine-args sglang_extra.json
```

Requested here https://github.com/ai-dynamo/dynamo/issues/290 (`dtype`) and here https://github.com/ai-dynamo/dynamo/issues/360 (`trust_remote_code`).

670661f6

24 Mar, 2025 1 commit

feat: Build pre-processor from GGUF (#344) · c7067fc2

Graham King authored Mar 24, 2025

This lets us do:
```
dynamo-run out=llamacpp <gguf_file>
```

Previously a `--model-config <hf-repo>` was also required, to configure our tokenizer.

c7067fc2

21 Mar, 2025 3 commits
- chore: Clarified docs, added more informative error prints (#342) · 1831c9cc
  Olga Andreeva authored Mar 21, 2025
```
Co-authored-by: Olga Andreeva <oandreeva@oandreeva-mlt.client.nvidia.com>
```
  1831c9cc
- docs: fix typo in dynamo_serve.md (#314) · 9242cfa0
  Ikko Eltociear Ashimine authored Mar 22, 2025
  
  9242cfa0
- docs: Update main and guide readmes (#332) · 66c6330a
  Harry Kim authored Mar 21, 2025
```
Co-authored-by: Meenakshi Sharma <163925564+nvda-mesharma@users.noreply.github.com>
```
  66c6330a
20 Mar, 2025 1 commit

chore: Make debug profile use all optimizations (#317) · 00e54337

Graham King authored Mar 20, 2025

It hardly slows the build down, and it makes things run much faster. That allows us to switch to the debug (default) profile for development, and keep the release profile for, well, releasing.

Motivated by changes in https://github.com/ai-dynamo/dynamo/pull/279

00e54337

19 Mar, 2025 2 commits
- feat: `Frontend` component uses served_model_name instead of model (#302) · 1f6ccc7f
  ishandhanani authored Mar 19, 2025
  
  1f6ccc7f
- docs: Move back dynamo deploy file to the guides subfolder in docs (#295) · 48a59890
  mohammedabdulwahhab authored Mar 19, 2025
```
Co-authored-by: mabdulwahhab <mabdulwahhab@nvidia.com>
```
  48a59890
18 Mar, 2025 6 commits
- docs: dynamo serve guide (#270) · a5113e46
  ishandhanani authored Mar 18, 2025
```
Co-authored-by: Dmitry Tokarev <dtokarev@nvidia.com>
```
  a5113e46
- docs: Clean up of readme for deploying to K8s using helm (#266) · 610ef375
  mohammedabdulwahhab authored Mar 18, 2025
```
Co-authored-by: mabdulwahhab <mabdulwahhab@nvidia.com>
Co-authored-by: Meenakshi Sharma <163925564+nvda-mesharma@users.noreply.github.com>
```
  610ef375
- docs(dynamo-run): Move README into docs/guides/ , add Quickstart (#265) · 40c55a24
  Graham King authored Mar 18, 2025
  
  40c55a24
- docs: fix links in docs (#256) · 548578f4
  Dmitry Tokarev authored Mar 18, 2025
```
Co-authored-by: Anant Sharma <anants@nvidia.com>
```
  548578f4
- fix: created documentation to deploy_to_k8s_using_helm (#245) · 3983830e
  Maksim Khadkevich authored Mar 17, 2025
```
Co-authored-by: Meenakshi Sharma <163925564+nvda-mesharma@users.noreply.github.com>
```
  3983830e
- docs: update guides and filenames (#252) · c2a6b368
  Suman Tatiraju authored Mar 17, 2025
  
  c2a6b368
17 Mar, 2025 8 commits
- docs: add guides to docs (#243) · 9be75482
  Suman Tatiraju authored Mar 17, 2025
```
Co-authored-by: Meenakshi Sharma <163925564+nvda-mesharma@users.noreply.github.com>
```
  9be75482
- docs: add disclaimer about examples (#236) · e1553c39
  Alec authored Mar 17, 2025
```
Co-authored-by: Harrison Saturley-Hall <454891+saturley-hall@users.noreply.github.com>
```
  e1553c39
- docs: Add kv cache manager documentation (#225) · 793d024e
  Suman Tatiraju authored Mar 17, 2025
```
Co-authored-by: Vikram Sharma <vsm2@illinois.edu>
Co-authored-by: Ziqi Fan <ziqif@nvidia.com>
Co-authored-by: Meenakshi Sharma <163925564+nvda-mesharma@users.noreply.github.com>
```
  793d024e
- docs: first draft kv-router doc (#228) · d788b63e
  Alec authored Mar 17, 2025
```
Co-authored-by: GuanLuo <41310872+GuanLuo@users.noreply.github.com>
Co-authored-by: Sean <choishsean@gmail.com>
Co-authored-by: Meenakshi Sharma <163925564+nvda-mesharma@users.noreply.github.com>
```
  d788b63e
- docs: remove future plans (#235) · fa373c19
  Suman Tatiraju authored Mar 17, 2025
  
  fa373c19
- docs: Fix links (#233) · a14bafa2
  Suman Tatiraju authored Mar 17, 2025
  
  a14bafa2
- docs: update images to high res (#230) · 8eedb807
  Suman Tatiraju authored Mar 17, 2025
  
  8eedb807
- docs: replace with mermaid (#217) · cea1902d
  Hongkuan Zhou authored Mar 17, 2025
  
  cea1902d