- 13 Oct, 2025 1 commit
-
-
Harrison Saturley-Hall authored
Signed-off-by:Harrison Saturley-Hall <hsaturleyhal@nvidia.com>
-
- 30 Sep, 2025 1 commit
-
-
Michael Feil authored
feat: python add abi compatability for cross-platform builds + add a unit test to HttpServer (#3044) Signed-off-by:
michaelfeil <me@michaelfeil.eu> Signed-off-by:
Michael Feil <63565275+michaelfeil@users.noreply.github.com> Signed-off-by:
root <root@michaelfeil2-dev-pod-b200-0.michaelfeil2-dev-pod-b200.baseten.svc.cluster.local> Signed-off-by:
root <root@michaelfeildns-dev-pod-h100-0.michaelfeildns-dev-pod-h100.baseten.svc.cluster.local> Co-authored-by:
root <root@michaelfeil2-dev-pod-b200-0.michaelfeil2-dev-pod-b200.baseten.svc.cluster.local> Co-authored-by:
root <root@michaelfeildns-dev-pod-h100-0.michaelfeildns-dev-pod-h100.baseten.svc.cluster.local>
-
- 25 Sep, 2025 1 commit
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 24 Sep, 2025 1 commit
-
-
Harrison Saturley-Hall authored
Signed-off-by:Harrison Saturley-Hall <hsaturleyhal@nvidia.com>
-
- 19 Sep, 2025 2 commits
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 02 Sep, 2025 2 commits
-
-
Ayush Agarwal authored
Signed-off-by:Ayush Agarwal <ayushag@nvidia.com>
-
Harrison Saturley-Hall authored
Signed-off-by:Harrison King Saturley-Hall <hsaturleyhal@nvidia.com>
-
- 22 Aug, 2025 1 commit
-
-
Ziqi Fan authored
-
- 20 Aug, 2025 1 commit
-
-
Dmitry Tokarev authored
-
- 19 Aug, 2025 2 commits
-
-
nachiketb-nvidia authored
Co-authored-by:Graham King <grahamk@nvidia.com>
-
Ryan Olson authored
Signed-off-by:
Ryan Olson <rolson@nvidia.com> Co-authored-by:
Olga Andreeva <oandreeva@nvidia.com> Co-authored-by:
Ziqi Fan <ziqif@nvidia.com> Co-authored-by:
John Thompson <jothomson@nvidia.com> Co-authored-by:
Richard Huo <rihuo@nvidia.com> Co-authored-by:
Zicheng Ma <zichengm@nvidia.com>
-
- 15 Aug, 2025 1 commit
-
-
Harrison Saturley-Hall authored
-
- 30 Jul, 2025 1 commit
-
-
Dmitry Tokarev authored
-
- 16 Jul, 2025 1 commit
-
-
Graham King authored
-
- 08 Jul, 2025 1 commit
-
-
Graham King authored
-
- 07 Jul, 2025 1 commit
-
-
Anant Sharma authored
-
- 01 Jul, 2025 1 commit
-
-
Paul Hendricks authored
-
- 30 Jun, 2025 1 commit
-
-
Paul Hendricks authored
-
- 14 Jun, 2025 1 commit
-
-
Yan Ru Pei authored
Signed-off-by:
PeaBrane <yanrpei@gmail.com> Signed-off-by:
Yan Ru Pei <yanrpei@gmail.com> Signed-off-by:
jain-ria <riajain@NVIDIA.com> Co-authored-by:
coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com> Co-authored-by:
jain-ria <riajain@NVIDIA.com>
-
- 13 Jun, 2025 1 commit
-
-
Anant Sharma authored
-
- 29 May, 2025 1 commit
-
-
Anant Sharma authored
-
- 28 May, 2025 1 commit
-
-
Graham King authored
It was removed from the docs in 0.2.1 and replaced with writing a [standalone Python engine](https://github.com/ai-dynamo/dynamo/blob/main/docs/guides/dynamo_run.md#writing-your-own-engine-in-python). Also remove the associated `dynamo-run` feature `python`. Releasing this in 0.3.0 will resolve #784 and #1109.
-
- 19 May, 2025 1 commit
-
-
Jacky authored
-
- 16 May, 2025 1 commit
-
-
Ryan McCormick authored
-
- 09 May, 2025 1 commit
-
-
Harrison Saturley-Hall authored
-
- 25 Apr, 2025 1 commit
-
-
Harrison Saturley-Hall authored
Signed-off-by:Harrison Saturley-Hall <454891+saturley-hall@users.noreply.github.com>
-
- 09 Apr, 2025 1 commit
-
-
Anant Sharma authored
-
- 03 Apr, 2025 1 commit
-
-
Ryan Olson authored
Moved all of `lib/llm/src/engines` to their own crates as e.g. `lib/engines/mistralrs`. This will allow publishing of the `dynamo-llm` crate as it won't have any github dependencies. The only engines in dynamo-llm will be the demo `echo` ones. Co-authored-by:Graham King <grahamk@nvidia.com>
-
- 13 Mar, 2025 1 commit
-
-
Anant Sharma authored
-
- 10 Mar, 2025 1 commit
-
-
Anant Sharma authored
-
- 09 Mar, 2025 1 commit
-
-
Neelay Shah authored
Co-authored-by:
Harrison Saturley-Hall <454891+saturley-hall@users.noreply.github.com> Co-authored-by:
Harrison King Saturley-Hall <hsaturleyhal@nvidia.com>
-
- 08 Mar, 2025 1 commit
-
-
Neelay Shah authored
Co-authored-by:Biswa Panda <biswa.panda@gmail.com>
-
- 07 Mar, 2025 1 commit
-
-
Graham King authored
Instead of using `out=pystr:<my.py>` we can now do this: ``` dynemo-run out=pytok:/home/graham/my_python_engine.py --model-path <hf-repo-checkout> ``` That engine will receive and respond with tokens. Here's an example engine file: ``` import asyncio async def generate(request): yield {"token_ids":[791]} await asyncio.sleep(0.1) yield {"token_ids":[6864]} await asyncio.sleep(0.1) yield {"token_ids":[315]} await asyncio.sleep(0.1) yield {"token_ids":[9822]} await asyncio.sleep(0.1) yield {"token_ids":[374]} await asyncio.sleep(0.1) yield {"token_ids":[12366]} await asyncio.sleep(0.1) yield {"token_ids":[13]} ``` Also reduce duplication by making the bindings engine use the llm lib engine.
-
- 05 Mar, 2025 1 commit
-
-
Neelay Shah authored
Co-authored-by:Graham King <grahamk@nvidia.com>
-
- 27 Feb, 2025 1 commit
-
-
Anant Sharma authored
-
- 25 Feb, 2025 1 commit
-
-
Neelay Shah authored
Signed-off-by:
Neelay Shah <neelays@nvidia.com> Co-authored-by:
Ryan McCormick <rmccormick@nvidia.com>
-
- 18 Feb, 2025 1 commit
-
-
GuanLuo authored
Signed-off-by:
Neelay Shah <neelays@nvidia.com> Co-authored-by:
aflowers <aflowers@nvidia.com> Co-authored-by:
Ryan McCormick <rmccormick@nvidia.com> Co-authored-by:
hongkuanz <hongkuanz@nvidia.com> Co-authored-by:
Neelay Shah <neelays@nvidia.com>
-
- 12 Feb, 2025 1 commit
-
-
Ryan Olson authored
Signed-off-by:
Ryan Olson <ryanolson@users.noreply.github.com> Co-authored-by:
Ryan McCormick <rmccormick@nvidia.com>
-
- 11 Feb, 2025 1 commit
-
-
Anant Sharma authored
Co-authored-by:Ryan McCormick <rmccormick@nvidia.com>
-