- 08 Jan, 2021 1 commit
-
-
Paul Fultz II authored
* Add build and test github workflow * Fix cget command * Remove def-requirements.txt * Add tmate session to debug workflow * Run tmate session after installing dependencies * Print date periodically * Add clang tidy action * Seperate build and run container in two different jobs * Run bash script * Remove interactive flag * Try to mount the files * Try to use the github workspace * WIthout double braces * Use env variable * Pipe bash script in * Run using hip-clang * Use correct path * Add verbose * Remove j flag * Only run for onnx file to debug * Manually run clang-tidy * Remove quiet flag * Print header file * Printout environment * Remove extra defines * Remove fixits and config flag * Show ldd * Add tmate session * Run onnx protobuf first * Generate proto for tensorflow * Update cppcheck version * Fix some cppcheck issues * Add const * Cppcheck fixes * Formatting * Fix more cppcheck issues * Run two jobs * Cache analysis and run format checking * Fix yaml issues * Fix yaml issues * Fix indentation * Switch to hip-clang for main docker file * Use hip-clang in the readme * Fixes for jenkins * Use ccache to build * Combine file * Set restore keys * Change stage name * Build with ccache * Add missing dependency for ccache * Build debug with codecov * Fix workflow syntax * Fix list * Use quotes * Got to correct build path * Install lcov * Use sudo * Echo all commands * Setup tmate * Add verbose output * Build with cmake directly * Add pthread flag * Remove python config * Continue on error * Use on or off for cmake flag * Use always upload cache * Verbose output * Verbose output from build * Build one target * Reduce debug symbols * Increase garbage collection * Remove dmesg * Increase it to 20 * Update rocm cmake version * Remove jobs from jenkins * Run on all 3 ubuntus * Remove gcc 5 jobs * Dont add flag on 16.04 * Only upload coverage on 18.04 * Dont build for ubuntu 20.04 * Use matrix.os * Use O2 for hip-clang since lower optimizations are broken * Use rocm 3.0 * Pass ccache as cmake variable instead of env variable * Build miopen from source * Show ccache statistics * Print log information * Set compression level * Use hash dir * Set hashdir * Install clang ocl from system * Up compression level * Add locale * Increase cache size to 1G * Lower compression level to 9 * Remove split dwarf * Remove Og * Add back Og * Seperate debug and codecov * Add missing backlash * Garbage collect more often * Add missing locales package * Use Os * Install onednn in docker and run tests * Include target headers in tests * Increase timeout * Remove if condtion * Make flag public * Suppress memory leaks in onednn * Use equal * Add gh annotations * Update rocm-cmake version * Add ldconfig Co-authored-by:Shucai Xiao <shucai@gmail.com>
-
- 04 Nov, 2020 1 commit
-
-
Paul Fultz II authored
* Add all_targets cmake target * Rename target * Add ref target * Rename tests * Refactor compiler target * Formatting * Verify for every target * Formatting * Add verify test suite * Formatting * Add initial test programs * Formatting * Add rnn tests * Formatting * Validate gpu * Formatting * Remove old gpu tests * Fix gpu tests * Fix ref error * Fix tidy issues * Formatting * Tidy fixes * Fix header in python api * Rename to ref * Use ref in verify_onnx * Fix tidy issue * Build with verbose on * Fix typo * Remove verbose * rename some cpu prefix to ref Co-authored-by:Shucai Xiao <Shucai.Xiao@amd.com>
-
- 10 Sep, 2020 1 commit
-
-
Paul Fultz II authored
* Add save/load functions * Formatting * Add loading and saving to the driver * Formatting * Add return * Serialize the context with the program * Formatting * Add python API * Formatting * Add c/c++ apis * Formatting * Add tests * Formatting * Fix tidy error * Fix python doc * Restore python code * Add function name to errors * Formatting * Use lvalue for writing * Serialize context * Fix convolution and pooling operator for miopen * Formatting * Add const ref * Set target name to gpu * Add target tests * Formatting * Move register target to cpp file * Fix target test * Use make_target in driver * Formatting * Use make_target for the API * Formatting * Add cpu include * Increase timeout * Add more tests * Formatting Co-authored-by:
Shucai Xiao <shucai.xiao@amd.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 26 Aug, 2020 1 commit
-
-
Paul Fultz II authored
* Add make_op function * Formatting * Add more values * Formatting * Remove templates parse_conv functions * Formatting * Remove mat_mul template * Formatting * Reduce header includes * Fix compiling for gpu * Formatting * Use make_op in lowering * Formatting * Sort lines * Formatting * Add more tests * Formatting * Fix tidy error * Formatting * Add const refs * Add explicit this * Add more const refs * Sort the program * Remove commented out code * Formatting * Infer gpu prefix * Formatting Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 07 Feb, 2020 1 commit
-
-
Paul Fultz II authored
* Add initial api * Formatting * Add more api * Formatting * Add auto api generation * Formatting * Fix some compilation errors * Change handle struct * Formatting * Fix reamining compilation errors * Formatting * Simplify using ctype * Formatting * Initial c++ generation * Formatting * Add C++header * Formatting * Add test * Formatting * Add initial tests * Formatting * Try to fix formatting * Cleanup formatting * Formatting * Fix constructors on the same line * Fix tests * Formatting * Fix tidy issues * Fix tidy issues * Fix naming issue * Add onnx API to parse buffer * Formatting * Add arguments api * Formatting * Fix verify parameters * Fix cppcheck issues * Formatting * Add method to get output shapes and bytes * Formatting * Try formatting * Formatting * Improve the test coverage * Formatting * Add print method * Formatting * Fix cppcheck issue * Fix package dependency * Add nolint * Try fix formatting * Formatting * formatting * formatting * Fix formatting Co-authored-by:
Shucai Xiao <shucai.xiao@amd.com> Co-authored-by:
kahmed10 <15948690+kahmed10@users.noreply.github.com>
-
- 20 Jun, 2019 1 commit
-
-
Khalique authored
-
- 17 Feb, 2019 2 commits
-
-
Shucai Xiao authored
-
Shucai Xiao authored
-
- 08 Feb, 2019 1 commit
-
-
Paul authored
-
- 04 Feb, 2019 2 commits
- 03 Feb, 2019 1 commit
-
-
Khalique authored
-
- 27 Nov, 2018 1 commit
-
-
Paul authored
-
- 14 Nov, 2018 1 commit
-
-
Paul authored
-
- 29 Oct, 2018 1 commit
-
-
Paul authored
-
- 26 Oct, 2018 2 commits
- 24 Aug, 2018 1 commit
-
-
Paul authored
-
- 23 Aug, 2018 1 commit
-
-
Paul authored
-
- 21 Aug, 2018 1 commit
-
-
Paul authored
-
- 31 Jul, 2018 1 commit
-
-
Paul authored
-
- 27 Jul, 2018 1 commit
-
-
Scott Thornton authored
-
- 25 Jul, 2018 2 commits
- 10 Jul, 2018 1 commit
-
-
Paul authored
-
- 02 Jul, 2018 1 commit
-
-
Paul authored
-
- 21 Jun, 2018 2 commits
- 11 Jun, 2018 1 commit
-
-
Paul authored
-
- 09 Jun, 2018 1 commit
-
-
Paul authored
-
- 08 Jun, 2018 9 commits
- 06 Jun, 2018 1 commit
-
-
Scott Thornton authored
-