Commits · 670661f6fa85f6ffc77433d21b363285d6cec32f · OpenDAS / dynamo

"lib/bindings/cpp/nvllm-trt/src/engine_trt/config.hpp" did not exist on "e584e96f9b584c750e2f4e9b1073e3cade8c1c9a"

25 Mar, 2025 1 commit

feat: Allow passing any arguments to vllm and sglang engines (#368) · 670661f6

Graham King authored Mar 25, 2025

Put the arguments in a JSON file:
```
{
    "dtype": "half",
    "trust_remote_code": true
}
```

Pass it like this:
```
dynamo-run out=sglang ~/llm_models/Llama-3.2-3B-Instruct --extra-engine-args sglang_extra.json
```

Requested here https://github.com/ai-dynamo/dynamo/issues/290 (`dtype`) and here https://github.com/ai-dynamo/dynamo/issues/360 (`trust_remote_code`).

670661f6

24 Mar, 2025 1 commit

feat: Build pre-processor from GGUF (#344) · c7067fc2

Graham King authored Mar 24, 2025

This lets us do:
```
dynamo-run out=llamacpp <gguf_file>
```

Previously a `--model-config <hf-repo>` was also required, to configure our tokenizer.

c7067fc2

19 Mar, 2025 1 commit
- fix(dynamo-run): Fix build if llamacpp and mistralrs are disabled (#262) · 3ac95a90
  Graham King authored Mar 19, 2025
  
  3ac95a90
14 Mar, 2025 1 commit

fix(mac): Fix for virtual env (#164) · 4f7f4b40

Graham King authored Mar 14, 2025

On Mac embedded python interpreters don't pick up the virtual env. This seems to be a known problem. Fix the sys.path.

4f7f4b40

04 Mar, 2025 1 commit
- feat: vllm engine tensor parallel and pipeline parallel (#16) · a657ec61
  Graham King authored Mar 04, 2025
```
Needs more testing but good enough for now. I get the same results with this as with `vllm serve`.
```
  a657ec61
28 Feb, 2025 1 commit
- feat: vllm engine (#308) · 6e0cfbd9
  Graham King authored Feb 28, 2025
```
triton-distributed-llm component and support in tio
```
  6e0cfbd9