[Bugfix][Spec Decode] Fix wrong valid_mask for padded speculation when chunked...
[Bugfix][Spec Decode] Fix wrong valid_mask for padded speculation when chunked prefill occurs (#26231) Signed-off-by:seven-mile <i@7li.moe> Signed-off-by:
Benjamin Chislett <bchislett@nvidia.com> Co-authored-by:
Benjamin Chislett <bchislett@nvidia.com>
Showing
Please register or sign in to comment