"examples/offline_inference/cli.py" did not exist on "aba8d6ee006b78149ac4514f460e4038b2d4f607"
Add TRTLLM MoE NVFP4 kernel to CompressedTensorsW4A4MoeMethod (#28892)
Signed-off-by:mingyuanm <mingyuanm@nvidia.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
Showing
Please register or sign in to comment