"...quantization/git@developer.sourcefind.cn:SIYIXNI/vllm.git" did not exist on "c0c2335ce027486d254c31f665ce00d7db427d22"
Bert fuse slice reshape trans contiguous (#542)
* fix pad calc * Add decompose pass * Add decompose test * Formatting * bert tf passes correctness * formatting * Add remap * Formatting * add test * formatting * remove comment * Add compute method for dot * Formatting * add inline * Add finder for horizontal fusion * Formatting * Formatting * Reuse predicate * formatting * fix order for literal * formatting * add test for gelu * formatting * added add_gelu fusion * Add gemm fusions * Formatting * add files * formatting * test no mul_add * formatting * progress on div * formatting * continue work on pass * remove layernorm opt * revert reduce file * Add some fixes for convolution * Formatting * Fix shape tests * Formatting * Reuse axis equal * Add initial split fusion * Formatting * Update offset * Workaround outputs that cant accept nonstandard shapes * Formatting * Add check for split concat * Formatting * Add missing headers * Formatting * Add tests * Formatting * add optimization for bert * code backup for bert optimization * continue testing * formatting * fix matcher * formatting * add gelu_fn and tests * formatting * fix matcher, remove extra tests * formatting * fix matcher * add missing files * add find_layernorm * add add_transpose to cmake file * code backup for the contigous fusion * refine ops fusion * clang format * fixed bug in previous optimization * clang format * add more optimization * remove unnecessary code * refinement of the fustion code * clang format * fixed a bug * add used_once * formatting * start on new gelu * formatting * add matchers in fuse_ops * formatting * add dce to fix add_gelu * add simplify_rsqrt and test * formatting * debugging value for matcher * formatting * add more to matchers * formatting * fix errors * remove onnx gen * add any_arg, change matchers to use either_arg * formatting * clang format * formatting * add used_once * formatting * code cleanup * clang format * fixed a bug * remove unnecessary code * refine comments * optimize bert to remove more contiguous * clang format * remove unnecessary code * add unit tests for bert optimization * clang format * fix review comments * clang format * refine a fusion of reshape and slice * clang format * fix cppcheck error * fix review comments * add the fusion of slice and transpose * clang format * add another optimization to fuse slice and transpose * clang format * fix review comments * clang format * fix review comments * clang format * fix review comments Co-authored-by:Khalique <15948690+kahmed10@users.noreply.github.com> Co-authored-by:
Paul <pfultz2@yahoo.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com> Co-authored-by:
Shucai Xiao <scxiao@prj47-rack-99.local.lan>
Showing
Please register or sign in to comment