[MXFP4] Fix bugs and optimize exponential operation (#750)
* [MXFP4] Fix bugs
- Optimize exp2 with shift operation to boost performance
- Fix bug of simple dequantization function call
- Fix bug of scaling factor with bias
* [Lint]
---------
Co-authored-by:
LeiWang1999 <leiwang1999@outlook.com>
Showing
Please register or sign in to comment