"vllm/vscode:/vscode.git/clone" did not exist on "a01df4a65f2518e28b98b66eea9820567b9c1c29"
[Kernel] Correctly invoke prefill & decode kernels for cross-attention...
[Kernel] Correctly invoke prefill & decode kernels for cross-attention (towards eventual encoder/decoder model support) (#4888)
Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu>
Showing
Please register or sign in to comment