"vllm/sampling_params.py" did not exist on "55f8b0a5def22ed6b85d3b91b726a7573d54313b"
[Core] Subclass ModelRunner to support cross-attention & encoder sequences...
[Core] Subclass ModelRunner to support cross-attention & encoder sequences (towards eventual encoder/decoder model support) (#4942) Co-authored-by:Andrew Feldman <afeld2012@gmail.com> Co-authored-by:
Nick Hill <nickhill@us.ibm.com>
Showing
vllm/worker/utils.py
0 → 100644
Please register or sign in to comment