- 06 May, 2025 1 commit
-
-
Graham King authored
Approved by OSRB in Slack. Note we don't check for the closing delimiter to allow the longer copyright format. Motivation is that it reduces the context usage by 12 lines for every file in the project. That helps things like Cursor and Claude Code fit more, go faster, and cost less.
-
- 09 Apr, 2025 1 commit
-
-
Suman Tatiraju authored
-
- 06 Mar, 2025 1 commit
-
-
Ryan McCormick authored
-
- 25 Feb, 2025 1 commit
-
-
Neelay Shah authored
Signed-off-by:
Neelay Shah <neelays@nvidia.com> Co-authored-by:
Ryan McCormick <rmccormick@nvidia.com>
-
- 20 Feb, 2025 1 commit
-
-
Biswa Panda authored
-
- 14 Feb, 2025 1 commit
-
-
Graham King authored
This allows us to run a real model. Build: ``` cargo build --release --features mistralrs,cuda ``` Run: ``` ./target/release/tio in=text out=mistralrs --model-path Llama-3.2-1B-Instruct-Q4_K_M.gguf ``` Why [mistral.rs](https://github.com/EricLBuehler/mistral.rs)? - It has no dependencies. You don't need a container or a virtual env to get started. - It supports CUDA, Metal (MacOS) and CPU-only. Everyone can join the AI revolution. - It starts fast and serves fast (with CUDA). That makes it fun to experiment with. - It runs many models, not just Mistral, that's just it's name.
-
- 10 Feb, 2025 1 commit
-
-
Graham King authored
-
- 06 Feb, 2025 1 commit
-
-
J Wyman authored
-
- 05 Feb, 2025 1 commit
-
-
J Wyman authored
-