[TPU] kv cache update kernel doesn't need to be padded slices to multiple of...
[TPU] kv cache update kernel doesn't need to be padded slices to multiple of num_slices_per_block (#22394) Signed-off-by:Chengji Yao <chengjiyao@gmail.com> Co-authored-by:
Chengji Yao <chengjiyao@gmail.com>
Showing
Please register or sign in to comment