[feat] Support different attention backends for prefill and decode (#6338)
Co-authored-by:tianqilin.99 <tianqilin.99@bytedance.com> Co-authored-by:
Baizhou Zhang <sobereddiezhang@gmail.com>
Showing
Please register or sign in to comment
Co-authored-by:tianqilin.99 <tianqilin.99@bytedance.com> Co-authored-by:
Baizhou Zhang <sobereddiezhang@gmail.com>