[Kernel] Support W8A8 channel-wise weights and per-token activations in triton...
[Kernel] Support W8A8 channel-wise weights and per-token activations in triton fused_moe_kernel (#16366)
Signed-off-by:
mgoin <mgoin64@gmail.com>
Showing
tests/kernels/utils_block.py
0 → 100644
This diff is collapsed.
Please register or sign in to comment