Cast BF16 input/output types for FP8 Q/DQ ONNX ops (#165)
add cast for BF16 input/output types for Q/DQ ONNX ops Signed-off-by:Asfiya Baig <asfiyab@nvidia.com> Co-authored-by:
Kirthi Shankar Sivamani <ksivamani@nvidia.com>
Showing
Please register or sign in to comment