ONNX: Fix FP8 quantization for the second MLP in LayerNormMLP (#2577)
ONNX: Fix FP8 quantization for the second MLP in LayernormMLP
Signed-off-by:
Victor Oliveira <victor.oliveira@getcruise.com>
Showing
Please register or sign in to comment