- 03 Apr, 2025 6 commits
-
-
Tanmay Verma authored
Co-authored-by:Neelay Shah <neelays@nvidia.com>
-
Graham King authored
-
hanweisen authored
Signed-off-by:hanweisen <hanweisen_yewu@cmss.chinamobile.com>
-
Graham King authored
-
tlipoca9 authored
-
Ryan Olson authored
-
- 02 Apr, 2025 8 commits
-
-
julienmancuso authored
-
Tushar Sharma authored
-
Tushar Sharma authored
-
Harrison Saturley-Hall authored
-
mohammedabdulwahhab authored
Co-authored-by:mabdulwahhab <mabdulwahhab@nvidia.com>
-
jon-chuang authored
Co-authored-by:Ryan Olson <ryanolson@users.noreply.github.com>
-
Ryan Olson authored
-
julienmancuso authored
-
- 01 Apr, 2025 7 commits
-
-
ishandhanani authored
Co-authored-by:hongkuanz <hongkuanz@nvidia.com>
-
Tushar Sharma authored
-
Ryan Olson authored
-
hhzhang16 authored
-
ishandhanani authored
-
Kiv Chen authored
-
Pavithra Vijayakrishnan authored
-
- 31 Mar, 2025 6 commits
-
-
Pavithra Vijayakrishnan authored
-
Pavithra Vijayakrishnan authored
-
Ryan Olson authored
-
Graham King authored
-
Tianer Zhou authored
Signed-off-by:Tianer Zhou <ezhoureal@gmail.com>
-
jon-chuang authored
-
- 28 Mar, 2025 1 commit
-
-
Biswa Panda authored
-
- 27 Mar, 2025 1 commit
-
-
mohammedabdulwahhab authored
-
- 26 Mar, 2025 6 commits
-
-
Ryan Olson authored
-
mohammedabdulwahhab authored
-
ptarasiewiczNV authored
-
Hongkuan Zhou authored
-
Dmitry Tokarev authored
-
Yan Ru Pei authored
-
- 25 Mar, 2025 2 commits
-
-
Sean SH Choi authored
-
Graham King authored
Put the arguments in a JSON file: ``` { "dtype": "half", "trust_remote_code": true } ``` Pass it like this: ``` dynamo-run out=sglang ~/llm_models/Llama-3.2-3B-Instruct --extra-engine-args sglang_extra.json ``` Requested here https://github.com/ai-dynamo/dynamo/issues/290 (`dtype`) and here https://github.com/ai-dynamo/dynamo/issues/360 (`trust_remote_code`).
-
- 24 Mar, 2025 3 commits
-
-
Yiming Cheng authored
-
Graham King authored
This lets us do: ``` dynamo-run out=llamacpp <gguf_file> ``` Previously a `--model-config <hf-repo>` was also required, to configure our tokenizer.
-
Hongkuan Zhou authored
-