[Perf] Eliminate padding and slicing op for GPT-OSS with Flashinfer MXFP4 MXFP8 MoE (#30647)
Signed-off-by:
elvischenv <219235043+elvischenv@users.noreply.github.com>
Showing
Please register or sign in to comment
Signed-off-by:
elvischenv <219235043+elvischenv@users.noreply.github.com>