Add a CUDA kernel for fusing mapping and weighted sum for MoE. (#6916)
Co-authored-by:
Elfie Guo <elfiegxf@gmail.com>
Showing
sgl-kernel/csrc/common_extension.cc
100644 → 100755
sgl-kernel/csrc/moe/fp8_blockwise_moe_kernel.cu
100644 → 100755
sgl-kernel/csrc/moe/prepare_moe_input.cu
100644 → 100755
sgl-kernel/include/sgl_kernel_ops.h
100644 → 100755
Please register or sign in to comment