"docs/source/git@developer.sourcefind.cn:OpenDAS/nni.git" did not exist on "c6c361d80ada8117e926bd24f71f50bb5da9f0b3"
Low-bit kernels fix and implementation (#704)
* [MXFP4] Dequantize FP4 kernel example, MX scale todo * [BugFix] Fix the bug of fp4&fp16 exponential bias * [MXFP4] Add group scale factor for BF16xMXFP4 gemm * [Lint] * [Test] Add test script for BF16xMXFP4 gemm * [Lint] * [BugFix] Fix the shape of scale tensor * Update example_dequant_gemm_fp4_hopper.py --------- Co-authored-by:LeiWang1999 <leiwang1999@outlook.com> Co-authored-by:
Lei Wang <34334180+LeiWang1999@users.noreply.github.com>
Showing
Please register or sign in to comment