- 04 Nov, 2020 1 commit
-
-
Paul Fultz II authored
* Add all_targets cmake target * Rename target * Add ref target * Rename tests * Refactor compiler target * Formatting * Verify for every target * Formatting * Add verify test suite * Formatting * Add initial test programs * Formatting * Add rnn tests * Formatting * Validate gpu * Formatting * Remove old gpu tests * Fix gpu tests * Fix ref error * Fix tidy issues * Formatting * Tidy fixes * Fix header in python api * Rename to ref * Use ref in verify_onnx * Fix tidy issue * Build with verbose on * Fix typo * Remove verbose * rename some cpu prefix to ref Co-authored-by:Shucai Xiao <Shucai.Xiao@amd.com>
-
- 25 Aug, 2020 1 commit
-
-
Paul Fultz II authored
* Use increment instead of division to compute register offset * Formatting * Limit layernorm to 1024 elements * Formatting * Add verification to driver * Formatting * Remove early return * Use block_size 256 * Vectorize the kernel * Formatting * Convert to vector type * Add layernorm tests * Formatting * Formatting * Refactor layernorm to run both algos * Formatting * Fix compile error * Fix tidy warnings * Formatting * Add layernorm function * Formatting
-
- 21 Jul, 2020 1 commit
-
-
Paul Fultz II authored
* Fix bug in eliminate_concat with negative axis * Formatting * Fix unused parameter * Formatting Co-authored-by:Shucai Xiao <shucai.xiao@amd.com>
-
- 14 Apr, 2020 1 commit
-
-
Shucai Xiao authored
* code changes for small bugs * clang format * remove standard shape requirement for transpose. * add a unit test * clang format Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 15 Feb, 2020 1 commit
-
-
Shucai Xiao authored
* Add initial api * Formatting * Add more api * Formatting * Add auto api generation * Formatting * Fix some compilation errors * Change handle struct * Formatting * Fix reamining compilation errors * Formatting * Simplify using ctype * Formatting * Initial c++ generation * Formatting * Add C++header * Formatting * Add test * Formatting * Add initial tests * Formatting * Try to fix formatting * Cleanup formatting * Formatting * Fix constructors on the same line * Fix tests * Formatting * Fix tidy issues * Fix tidy issues * Fix naming issue * Add onnx API to parse buffer * Formatting * Add arguments api * Formatting * Fix verify parameters * Fix cppcheck issues * Formatting * Add method to get output shapes and bytes * Formatting * Try formatting * Formatting * Improve the test coverage * Formatting * Add print method * Formatting * Fix cppcheck issue * Fix package dependency * change migraphx api to support multiple program outputs * clang format * change api implementation * clang format * fix a build error * change api for correct automatic generation * clang format * Add nolint * Try fix formatting * Formatting * formatting * formatting * Fix formatting * code cleanup * clang format * fix cppcheck error * fix review comments * clang format Co-authored-by:
Paul Fultz II <pfultz2@yahoo.com> Co-authored-by:
kahmed10 <15948690+kahmed10@users.noreply.github.com>
-
- 05 Jun, 2019 3 commits