• Graham King's avatar
    feat(dynamo-run): vllm and sglang subprocess engines (#954) · 28fd481c
    Graham King authored
    New vllm and sglang engines that run in a sub-process. Will hopefully replace the existing embedded python engines.
        
    Why?
        
      - Pure Python, does not require knowing Rust to work on it. Much simpler to maintain.
      - No embedded Python interpreter which avoids linking libpython and avoids the MacOS virtualenv issues.
      - Should have better performance as it's "native" vllm / sglang.
      - Works with any version of vllm (including v1!) and sglang. Less upgrade struggle.
    28fd481c
batch.rs 10.9 KB