qdq for quantization and include subgraph (#891)
Add operators, refactor parsers, add rewrite passes, add tests
Add ref implementations
Move broadcasting of scales and zero points to onnx parser
Allow for x and zero_point to have different types in quantizelinear; fix zero_point default type
fp16 and fp8 quantization to include subgraph and parameters
fix unit test to use qdq operators for int8 quantization
Co-authored-by:
turneram <alturner@amd.com>
Showing
This diff is collapsed.
src/quantize_fp16.cpp
0 → 100644
src/quantize_int8.cpp
0 → 100644
This diff is collapsed.
Please register or sign in to comment