- 02 Apr, 2025 2 commits
-
-
Tushar Sharma authored
-
Tushar Sharma authored
-
- 01 Apr, 2025 2 commits
-
-
Tushar Sharma authored
-
Pavithra Vijayakrishnan authored
-
- 31 Mar, 2025 2 commits
-
-
Pavithra Vijayakrishnan authored
-
Pavithra Vijayakrishnan authored
-
- 20 Mar, 2025 1 commit
-
-
Meenakshi Sharma authored
-
- 19 Mar, 2025 1 commit
-
-
Graham King authored
This makes the Rust parts all use ring / rustls library instead of local install of openssl. It's a step on the journey to being statically linked. Pieces: - `tokenizers` and `mistralrs` now support rustls (mistralrs by default, tokenizers with feature flag). - Move shared dependencies up into workspace - New `rand` crate has some renames for future rust - Ensure the dependency doesn't creep back in by enforcing it with cargo deny.
-
- 17 Mar, 2025 3 commits
-
-
Harrison Saturley-Hall authored
Co-authored-by:Meenakshi Sharma <163925564+nvda-mesharma@users.noreply.github.com>
-
Anant Sharma authored
-
Harrison Saturley-Hall authored
-
- 15 Mar, 2025 1 commit
-
-
Harrison Saturley-Hall authored
-
- 14 Mar, 2025 2 commits
-
-
Graham King authored
Engines mistralrs, sglang and vllm included by default. Can be disabled like this: `cargo build --no-default-features --features <add-back-what-you-want>`. Added `--feature vulkan` option, for llamacpp. Build time message if CUDA or Metal would help and are missing. That's the best we can do: > warning: dynamo-run@0.1.0: CUDA not enabled, re-run with `--features cuda` Runtime message if CUDA, Metal or Vulkan are enabled: > 2025-03-14T21:59:26.501937Z INFO dynamo_run: CUDA on Runtime message if they are missing: > 2025-03-14T22:02:37.439404Z INFO dynamo_run: CPU mode. Rebuild with `--features cuda|metal|vulkan` for better performance Defaut engine message includes available engines: > 2025-03-14T21:59:26.503612Z INFO dynamo_run: Using default engine: mistralrs. Use out=<engine> to specify one of echo_core, echo_full, mistralrs, llamacpp, sglang, vllm, pystr, pytok The really important outcome is that this should now "just work": ``` cargo install dynamo-run dynamo-run Qwen/Qwen2.5-3B-Instruct ``` Sadly you still need `--features cuda|metal` for performance, I couldn't automate that.
-
Pavithra Vijayakrishnan authored
-
- 13 Mar, 2025 1 commit
-
-
Anant Sharma authored
-
- 12 Mar, 2025 1 commit
-
-
Maksim Khadkevich authored
-
- 11 Mar, 2025 2 commits
-
-
Neelay Shah authored
-
Neelay Shah authored
Co-authored-by:Meenakshi Sharma <163925564+nvda-mesharma@users.noreply.github.com>
-
- 10 Mar, 2025 1 commit
-
-
Harrison Saturley-Hall authored
-
- 09 Mar, 2025 1 commit
-
-
Harrison Saturley-Hall authored
-
- 08 Mar, 2025 3 commits
-
-
Harrison Saturley-Hall authored
-
Neelay Shah authored
Co-authored-by:Biswa Panda <biswa.panda@gmail.com>
-
Pavithra Vijayakrishnan authored
-
- 07 Mar, 2025 1 commit
-
-
Neelay Shah authored
-
- 06 Mar, 2025 2 commits
-
-
Anant Sharma authored
-
Ryan McCormick authored
-
- 05 Mar, 2025 3 commits
-
-
Neelay Shah authored
Co-authored-by:Graham King <grahamk@nvidia.com>
-
Harrison Saturley-Hall authored
-
Graham King authored
-
- 04 Mar, 2025 11 commits
-
-
ishandhanani authored
-
Harrison Saturley-Hall authored
-
Harrison King Saturley-Hall authored
-
Harrison King Saturley-Hall authored
-
Harrison King Saturley-Hall authored
-
Harrison King Saturley-Hall authored
-
Anant Sharma authored
-
Anant Sharma authored
-
Harrison King Saturley-Hall authored
-
Harrison King Saturley-Hall authored
-
Harrison King Saturley-Hall authored
-