Fuse skip layernorm (#683)
* Unify the vectorized and non-vectorized path
* Formatting
* Make fusion easily extendable
* Add skip layernorm fusion
* Formatting
* Call correct layernorm function
* Fix compile errors
* Add DCE
* Add test for skip layernorm
* Formatting
* Remove unused typedef
* Formatting
* Fix tidy issues
* Formatting
Co-authored-by:
Shucai Xiao <shucai.xiao@amd.com>
Showing
src/include/migraphx/onnx.hpp
100644 → 100755
File mode changed from 100644 to 100755
src/targets/gpu/device/layernorm.cpp
100644 → 100755
src/targets/gpu/fuse_ops.cpp
100644 → 100755
Please register or sign in to comment