"src/git@developer.sourcefind.cn:lacacy/qwen_lmdeploy.git" did not exist on "71ade772a4a54d9828fe70d398e900fa5d48f6cc"
  • Shucai Xiao's avatar
    qdq for quantization and include subgraph (#891) · b45f7239
    Shucai Xiao authored
    
    
    Add operators, refactor parsers, add rewrite passes, add tests
    Add ref implementations
    Move broadcasting of scales and zero points to onnx parser
    Allow for x and zero_point to have different types in quantizelinear; fix zero_point default type
    fp16 and fp8 quantization to include subgraph and parameters
    fix unit test to use qdq operators for int8 quantization
    Co-authored-by: default avatarturneram <alturner@amd.com>
    b45f7239
program.cpp 27.2 KB