[CI/Build] tests(v1): feed Triton attention the (num_blocks, 2, …) KV cache...
[CI/Build] tests(v1): feed Triton attention the (num_blocks, 2, …) KV cache layout in backend-correctness tests (#26663) Signed-off-by:Huamin Li <3ericli@gmail.com> Co-authored-by:
Ye (Charlotte) Qi <yeq@meta.com>
Showing
Please register or sign in to comment