[Quant] Support MXFP4 W4A16 for compressed-tensors MoE models (#32285)
Signed-off-by:Dipika Sikka <dipikasikka1@gmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
Showing
Please register or sign in to comment
Signed-off-by:Dipika Sikka <dipikasikka1@gmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>