[Core] FlashInfer CUTLASS fused MoE backend (NVFP4) (#20037)
Signed-off-by:shuw <shuw@nvidia.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
Showing
vllm/utils/flashinfer.py
0 → 100644
Please register or sign in to comment