Add unpermute-aware fused MoE path and small-batch fallback (#29354)
Signed-off-by:Runkai Tao <rt572@physics.rutgers.edu> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
Showing
Please register or sign in to comment
Signed-off-by:Runkai Tao <rt572@physics.rutgers.edu> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>