"vllm/entrypoints/openai/responses/context.py" did not exist on "955c624915d66e42525f9b6e8e26a51d3892be6f"
-
afeldman-nm authored
[Kernel] Correctly invoke prefill & decode kernels for cross-attention (towards eventual encoder/decoder model support) (#4888) Co-authored-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
543aa485