"deploy/cpp_infer/src_det/ocr_det.cpp" did not exist on "ed52619f4b4791fda71ff4b22f517e04c78bd3b8"
Optimization for gridwise group norm (#453)
* use another instance to check the efficiency
* optimize group layer norm
* 1. coalesce load/store data for gridwise layer norm welford. 2. move a sqrt and divison into a outer static loop
* add more instances to layernorm
* add 2 more test cases
* remove ignore in generating tuple of vector
Co-authored-by:
Chao Liu <chao.liu2@amd.com>
Showing
Please register or sign in to comment