MXFP4 support
This implements the Open Compute Microscaling (MX) FP4 format as a tensor type with backend implementations focusing on mulmat and mulmatid on CPU, CUDA, and Metal.
Showing
This diff is collapsed.
Please register or sign in to comment