- 25 Aug, 2020 1 commit
-
-
Paul Fultz II authored
* Use increment instead of division to compute register offset * Formatting * Limit layernorm to 1024 elements * Formatting * Add verification to driver * Formatting * Remove early return * Use block_size 256 * Vectorize the kernel * Formatting * Convert to vector type * Add layernorm tests * Formatting * Formatting * Refactor layernorm to run both algos * Formatting * Fix compile error * Fix tidy warnings * Formatting * Add layernorm function * Formatting
-
- 18 Aug, 2020 1 commit
-
-
Paul Fultz II authored
* Register ops for main migraphx * Formatting * Register cpu ops * Formatting * Show list of operators in the driver * Formatting * Simplify regiter * Try to register gpu ops * Fix compiler errors * Register rest of the gpu operators * Add some tests * Formatting * Fix gcc compiler warnings * Formatting * Fix tidy warnings * Fix compile error * Use correct op name * Register layer norm * Use const ref * Make run const
-
- 06 Aug, 2020 1 commit
-
-
kahmed10 authored
Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com> Co-authored-by:
Shucai Xiao <shucai.xiao@amd.com>
-
- 21 Jul, 2020 1 commit
-
-
Paul Fultz II authored
* Fix bug in eliminate_concat with negative axis * Formatting * Fix unused parameter * Formatting Co-authored-by:Shucai Xiao <shucai.xiao@amd.com>
-
- 07 May, 2020 1 commit
-
-
Paul Fultz II authored
* Add skip unknown operators flag * Formatting * Add flag to print program on error * Formatting * Fix compile error in py * Formatting * Workaround cppcheck error * Initialize with struct * Formatting * Disable warning * Formatting * Add test for print errors * Formatting * Formatting * Fix compiler error * Formatting * Formatting * Formatting * Use correct map * Formatting Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 19 Dec, 2019 1 commit
-
-
kahmed10 authored
* initial testing * add shape op * formatting * add env variable for batch sizes * formatting * progress on driver * progress on driver * cleanup * cleanup * add and modified prev tests * formatting * remove comment * add shape op test * formatting * manually insert shape op in test * formatting * create options struct for parsers * formatting * Add documentation for python * Fix c++ documentaion * add documentation to parser * formatting * add argmin and tests * fix doc and definitions * formatting * revert test functions * formatting Co-authored-by:Paul Fultz II <pfultz2@yahoo.com>
-
- 27 Nov, 2019 1 commit
-
-
Paul Fultz II authored
* Add experimental support for c++ output * Format * Fix syntax errors * Add resnet50 model * Formatting * Add inceptionv3 model * Formatting * Add alexnet * Formatting * Fix name of pooling mode * Formatting * Fix tidy issues * Ignore driver directory * Show accetable values
-
- 15 Nov, 2019 1 commit
-
-
Paul Fultz II authored
* Add compiler options * Add copy operators * Formatting * Use run_passes in tests * Formatting * Use run_pass in schedule test * Formatting * Add compile_options to get_passes in target * Formatting * Offload copy option * Formatting * Copy using pinned memory * Formatting * Improve performance of gpu copying * Formatting * Dont copy * Formatting * Always make an extra copy * Formatting * Remove unused write op * Add missing include * Remove copy_to_gpu function in python api * Make offload copy disabled by default on C++ * Formatting * Fix tidy issues * Formatting * Fix namespace * Fix python tests * Turn clang format off since its broken * Fix compile error on gcc 5 * Remove commented code
-
- 21 Oct, 2019 1 commit
-
-
Paul Fultz II authored
* Print graph from the driver * Formatting
-
- 16 Sep, 2019 1 commit
-
-
Paul Fultz II authored
* Add flags to quantize in driver * Formatting * Fix compile error
-
- 20 Aug, 2019 2 commits
- 19 Aug, 2019 2 commits
- 02 Jul, 2019 2 commits
- 06 Jun, 2019 3 commits
- 05 Jun, 2019 16 commits