- 01 Apr, 2025 4 commits
-
-
hhzhang16 authored
-
ishandhanani authored
-
Kiv Chen authored
-
Pavithra Vijayakrishnan authored
-
- 31 Mar, 2025 6 commits
-
-
Pavithra Vijayakrishnan authored
-
Pavithra Vijayakrishnan authored
-
Ryan Olson authored
-
Graham King authored
-
Tianer Zhou authored
Signed-off-by:Tianer Zhou <ezhoureal@gmail.com>
-
jon-chuang authored
-
- 28 Mar, 2025 1 commit
-
-
Biswa Panda authored
-
- 27 Mar, 2025 1 commit
-
-
mohammedabdulwahhab authored
-
- 26 Mar, 2025 6 commits
-
-
Ryan Olson authored
-
mohammedabdulwahhab authored
-
ptarasiewiczNV authored
-
Hongkuan Zhou authored
-
Dmitry Tokarev authored
-
Yan Ru Pei authored
-
- 25 Mar, 2025 2 commits
-
-
Sean SH Choi authored
-
Graham King authored
Put the arguments in a JSON file: ``` { "dtype": "half", "trust_remote_code": true } ``` Pass it like this: ``` dynamo-run out=sglang ~/llm_models/Llama-3.2-3B-Instruct --extra-engine-args sglang_extra.json ``` Requested here https://github.com/ai-dynamo/dynamo/issues/290 (`dtype`) and here https://github.com/ai-dynamo/dynamo/issues/360 (`trust_remote_code`).
-
- 24 Mar, 2025 4 commits
-
-
Yiming Cheng authored
-
Graham King authored
This lets us do: ``` dynamo-run out=llamacpp <gguf_file> ``` Previously a `--model-config <hf-repo>` was also required, to configure our tokenizer.
-
Hongkuan Zhou authored
-
Graham King authored
That ensures it gets removed when the process stops.
-
- 22 Mar, 2025 1 commit
-
-
Yiming Cheng authored
-
- 21 Mar, 2025 6 commits
-
-
Olga Andreeva authored
Co-authored-by:Olga Andreeva <oandreeva@oandreeva-mlt.client.nvidia.com>
-
Dmitry Tokarev authored
-
zhaohaidao authored
-
Ikko Eltociear Ashimine authored
-
Harry Kim authored
Co-authored-by:Meenakshi Sharma <163925564+nvda-mesharma@users.noreply.github.com>
-
Anant Sharma authored
-
- 20 Mar, 2025 5 commits
-
-
Yan Ru Pei authored
-
Meenakshi Sharma authored
-
Graham King authored
It hardly slows the build down, and it makes things run much faster. That allows us to switch to the debug (default) profile for development, and keep the release profile for, well, releasing. Motivated by changes in https://github.com/ai-dynamo/dynamo/pull/279
-
Nora authored
Add `AsMut`, `DerefMut` and `IntoIterator` trait impl for the `Tokens` structure. Signed-off-by:
nora-coder-dot <nora6677@gmail.com> Co-authored-by:
nora-coder-dot <nora6677@gmail.com>
-
gujing authored
Signed-off-by:zibai <zibai.gj@alibaba-inc.com>
-
- 19 Mar, 2025 4 commits
-
-
ishandhanani authored
-
ishandhanani authored
-
Elton Leander Pinto authored
Co-authored-by:Ryan Olson <ryanolson@users.noreply.github.com>
-
Anant Sharma authored
Co-authored-by:Dmitry Tokarev <dtokarev@nvidia.com>
-