Support non-contiguous KV cache in TRTLLM fp8 dequant kernel (#36867)
Signed-off-by:Vadim Gimpelson <vadim.gimpelson@gmail.com> Signed-off-by:
Vadim Gimpelson <156319763+vadiklyutiy@users.noreply.github.com> Co-authored-by:
Pavani Majety <pavanimajety@gmail.com>
Showing
Please register or sign in to comment