"vllm/entrypoints/pooling/embed/serving.py" did not exist on "78c13e30e1641869672b4c5fb7685d04e58ca1df"
Use paged_attention_v1 for sliding window decode in rocm_aiter_fa (#34378)
Signed-off-by:Martin Yuan <myuan@meta.com> Co-authored-by:
Martin Yuan <myuan@meta.com>
Showing
Please register or sign in to comment