[Kernel] Expand MoE weight loading + Add Fused Marlin MoE Kernel (#7527)
Co-authored-by:
ElizaWszola <eliza@neuralmagic.com>
Showing
csrc/moe/marlin_moe_ops.cu
0 → 100644
This diff is collapsed.
csrc/moe/marlin_moe_ops.h
0 → 100644
Please register or sign in to comment