[Feature][Quantization] MXFP4 support for MOE models (#17888)
Signed-off-by:Felix Marty <felmarty@amd.com> Signed-off-by:
Bowen Bao <bowenbao@amd.com> Signed-off-by:
Felix Marty <Felix.Marty@amd.com> Co-authored-by:
Bowen Bao <bowenbao@amd.com>
Showing
Please register or sign in to comment