[Perf] Use NVIDIA hardware-accelerated instruction for float to fp8_e4m3 quantization (#24757)
Signed-off-by:
elvischenv <219235043+elvischenv@users.noreply.github.com>
Showing
Please register or sign in to comment
Signed-off-by:
elvischenv <219235043+elvischenv@users.noreply.github.com>