[Perf] Optimize `moe_align_block_size` CUDA kernel (#19572)
Signed-off-by:yewentao256 <zhyanwentao@126.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
Showing
Please register or sign in to comment
Signed-off-by:yewentao256 <zhyanwentao@126.com> Co-authored-by:
mgoin <mgoin64@gmail.com>