Feature fast cast-only mxfp8 (#2062)
* refactor mxfp8_cast_only kernel Signed-off-by:Jianbing Dong <jianbingd@nvidia.com> * fix ptx.cuh after format Signed-off-by:
Jianbing Dong <jianbingd@nvidia.com> --------- Signed-off-by:
Jianbing Dong <jianbingd@nvidia.com> Co-authored-by:
Oleg Goncharov <64355998+Oleg-Goncharov@users.noreply.github.com>
Showing
This diff is collapsed.
This diff is collapsed.
Please register or sign in to comment