Use `optimize_module` pass for the quantization to fp16 (#1974)
Fixes #1746 BatchNorm only has x as the runtime input parameter for the following equation. All the other parameters are compile-time constants and related operations can be const-folded before quantizing to fp16 to preserve precision.
Showing
Please register or sign in to comment