[NVIDIA] Update to leverage flashinfer trtllm FP4 MOE throughput kernel (#11563)
Signed-off-by:
jiahanc <173873397+jiahanc@users.noreply.github.com>
Showing
Please register or sign in to comment
Signed-off-by:
jiahanc <173873397+jiahanc@users.noreply.github.com>