Use paged_attention_v1 for sliding window decode in rocm_aiter_fa (#34378)
Signed-off-by:Martin Yuan <myuan@meta.com> Co-authored-by:
Martin Yuan <myuan@meta.com>
Showing
Please register or sign in to comment
Signed-off-by:Martin Yuan <myuan@meta.com> Co-authored-by:
Martin Yuan <myuan@meta.com>