- 22 Jun, 2022 1 commit
-
-
Ted Themistokleous authored
Updated each source file in the repo with the existing license.
-
- 11 May, 2022 1 commit
-
-
Paul Fultz II authored
Fuse layernorm and added triadd_layernorm fusion. This is a prep performance booster
-
- 25 Feb, 2021 1 commit
-
-
Paul Fultz II authored
* Add code object op * Formattting * Add more value tests * Formatting * Fix from_value conversion from binary * Formatting * Dont use offload copy * Remove iostream header Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 18 Nov, 2020 1 commit
-
-
Paul Fultz II authored
* Use generic op for eliminate_pad * Formatting * Improve error when loading a missing operator * Add more enum tests * Add more tests for constructing an op * Formatting * Fix failed tests * Avoid duplicate branches * Format file * Default initialize variable Co-authored-by:
Shucai Xiao <shucai.xiao@amd.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 26 Aug, 2020 1 commit
-
-
Paul Fultz II authored
* Add make_op function * Formatting * Add more values * Formatting * Remove templates parse_conv functions * Formatting * Remove mat_mul template * Formatting * Reduce header includes * Fix compiling for gpu * Formatting * Use make_op in lowering * Formatting * Sort lines * Formatting * Add more tests * Formatting * Fix tidy error * Formatting * Add const refs * Add explicit this * Add more const refs * Sort the program * Remove commented out code * Formatting * Infer gpu prefix * Formatting Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-