Fp8 paged attention update (#22222)
Signed-off-by:Xiao Yu <xiao.yu@amd.com> Signed-off-by:
xiao-llm <xiao.yu.dc@outlook.com> Co-authored-by:
Xiao Yu <xiao.yu@metamaterial.com> Co-authored-by:
Xiao Yu <xiao.yu@amd.com> Co-authored-by:
Bowen Bao <bowenbao@amd.com>
Showing
This diff is collapsed.
Please register or sign in to comment