- 29 Mar, 2022 1 commit
-
-
Chris Austen authored
modify CI temporarily to stop using Navi hardware
-
- 05 Nov, 2021 1 commit
-
-
kahmed10 authored
Moving our Docker file from ROCm 4.3 to 4.5 Add Navi base GPUs in to the CI infrastructure
-
- 28 Sep, 2021 1 commit
-
-
Paul Fultz II authored
No longer avoid dependency problems and install the half package
-
- 26 Jul, 2021 1 commit
-
-
Paul authored
-
- 25 Jul, 2021 1 commit
-
-
Paul authored
-
- 29 Apr, 2021 1 commit
-
-
SJW authored
* MLIR MIOpen Dialect integration (phase 1) (#768) * Added Findmlir.cmake (using environment variables to import) * Added mlir_conv pass to GPU target * Apply to any gpu::convolution if supported by MLIR * Call MLIR C-API to generate iGEMM kernel with configuration from gpu::convolution * Capture binary in dictionary for matching convolutions * Build a code_object_op with the binary and execution dimensions * Substitute for the gpu::convolution * Changed the parameters for the code_object to reflect the generated MLIR kernel * Expanded out MemRefDescriptor fields in param list * Also updated for MLIR C-API changes * * fixed global_size calculation * MLIR MIOpen Dialect integration (phase 1) (#768) * Added Findmlir.cmake (using environment variables to import) * Added mlir_conv pass to GPU target * Apply to any gpu::convolution if supported by MLIR * Call MLIR C-API to generate iGEMM kernel with configuration from gpu::convolution * Capture binary in dictionary for matching convolutions * Build a code_object_op with the binary and execution dimensions * Substitute for the gpu::convolution * Changed the parameters for the code_object to reflect the generated MLIR kernel * Expanded out MemRefDescriptor fields in param list * Also updated for MLIR C-API changes * * Added command line option: --enable_mlir * * fixed command line switch * updated for new MLIR API changes * * Added cget llvm-project-mlir to import MIIR API libraries into Dockerfile * removed cmake Findmlir * updated for changes in MIIR C-API * * updated CMakeLists.txt to allow disable of MLIR import * fixed memory leaks and removed copies * updated for 5D memrefs * * formatting * * fixed review comments * * fixed merge issues * hip gcnDeviceName now includes specifiers at the end * use major/minor values instead * * disable MLIR by default * * removed command-line switch --enable-mlir * * fix unused when MLIR disabled * * enable jenkins enable/test MLIR * * format * * fixed clang-tidy * * added new type Co-authored-by:
Paul Fultz II <pfultz2@yahoo.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 09 Apr, 2021 1 commit
-
-
Paul Fultz II authored
* Fix tidy warnings for 4.1 * Formatting * Upgrade to 4.1 in docker * Remove hcc build and enable ubsan on clang debug * Add missing openmp package * Construct directly * Construct directly * Upgrade rocm-cmake version
-
- 08 Jan, 2021 1 commit
-
-
Paul Fultz II authored
* Add build and test github workflow * Fix cget command * Remove def-requirements.txt * Add tmate session to debug workflow * Run tmate session after installing dependencies * Print date periodically * Add clang tidy action * Seperate build and run container in two different jobs * Run bash script * Remove interactive flag * Try to mount the files * Try to use the github workspace * WIthout double braces * Use env variable * Pipe bash script in * Run using hip-clang * Use correct path * Add verbose * Remove j flag * Only run for onnx file to debug * Manually run clang-tidy * Remove quiet flag * Print header file * Printout environment * Remove extra defines * Remove fixits and config flag * Show ldd * Add tmate session * Run onnx protobuf first * Generate proto for tensorflow * Update cppcheck version * Fix some cppcheck issues * Add const * Cppcheck fixes * Formatting * Fix more cppcheck issues * Run two jobs * Cache analysis and run format checking * Fix yaml issues * Fix yaml issues * Fix indentation * Switch to hip-clang for main docker file * Use hip-clang in the readme * Fixes for jenkins * Use ccache to build * Combine file * Set restore keys * Change stage name * Build with ccache * Add missing dependency for ccache * Build debug with codecov * Fix workflow syntax * Fix list * Use quotes * Got to correct build path * Install lcov * Use sudo * Echo all commands * Setup tmate * Add verbose output * Build with cmake directly * Add pthread flag * Remove python config * Continue on error * Use on or off for cmake flag * Use always upload cache * Verbose output * Verbose output from build * Build one target * Reduce debug symbols * Increase garbage collection * Remove dmesg * Increase it to 20 * Update rocm cmake version * Remove jobs from jenkins * Run on all 3 ubuntus * Remove gcc 5 jobs * Dont add flag on 16.04 * Only upload coverage on 18.04 * Dont build for ubuntu 20.04 * Use matrix.os * Use O2 for hip-clang since lower optimizations are broken * Use rocm 3.0 * Pass ccache as cmake variable instead of env variable * Build miopen from source * Show ccache statistics * Print log information * Set compression level * Use hash dir * Set hashdir * Install clang ocl from system * Up compression level * Add locale * Increase cache size to 1G * Lower compression level to 9 * Remove split dwarf * Remove Og * Add back Og * Seperate debug and codecov * Add missing backlash * Garbage collect more often * Add missing locales package * Use Os * Install onednn in docker and run tests * Include target headers in tests * Increase timeout * Remove if condtion * Make flag public * Suppress memory leaks in onednn * Use equal * Add gh annotations * Update rocm-cmake version * Add ldconfig Co-authored-by:Shucai Xiao <shucai@gmail.com>
-
- 14 Dec, 2020 1 commit
-
-
Paul Fultz II authored
* Add flag to enable cpu backend * Make buffers shared * Enable optimizations * Add onednn * Formatting * Formatting * Add dnnl header * Formatting * Rewrite rnn first * Formatting * Call reference implementation * Formatting * Make literal data shared * Formatting * Add convolution * Formatting * Compensate for dilation * Formatting * Use name/make_op instead * Formatting * Rename gemm header * Formatting * Add dnnl convolution/gemm operators * Formatting * Add eliminate_contiguous * Add faster pointwise operators * Formatting * Formatting * Formatting * Add dnnl op class * Formatting * Add add op * Formatting * Add concat operator * Formatting * Add more ops * Create descriptor during finalization * Formatting * Dont rewrite pooling * Enable memory coloring * Formatting * Add output aliases * Formatting * Fix errors * Formatting * Convert literals * Add missing file * Remove batch_norm * Formatting * Use strides * Formatting * Add some debug checks * Formatting * Fix big in adjusting shape for gemm * Formatting * Fix fallback dot operator * Zero initialize buffers * Add suport for group convolutions * Formatting * Make adjust allocation target independent * Formatting * Enable adjust_allocation for gpu/cpu * Formatting * Add copy to allocation model * Formatting * Add copy operator * Formatting * Better handling of output parameters in adjust_allocation * Formatting * Build with dnnl * Make dnnl required * Fix compile error * Tidy fixes * Formatting * Tidy fixes * Formatting * Fix more tidy issues * Formatting * Add mul op * Add mul op * Set c compiler to clang as well * Compensate for normalized compute shape * Formatting * Fix cppcheck errors * Formatting * Add onednn library to hcc * Guard clang pragmas * Disable cpu mode for gcc for now * Leave it enabled it for gcc 7 * Fix cppcheck suppresion * Fix compile error on gcc 5 * Remove unused code Co-authored-by:
Shucai Xiao <shucai.xiao@amd.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 25 Nov, 2020 1 commit
-
-
Paul Fultz II authored
* Fix issue with module wrapper * Formatting * Remove custom build of python 3.6 * Fix print function in tests * Formatting * Install onnx * Download the models * Add gcc 5 * Boost time limit for now * Try to use oauth instead of app * Disable all warnings
-
- 18 Nov, 2020 2 commits
- 10 Nov, 2020 1 commit
-
-
Paul Fultz II authored
* Add flag to enable cpu backend * Make buffers shared * Enable optimizations * Formatting * Enable cpu backend for gcc builds
-
- 22 Oct, 2020 1 commit
-
-
Paul Fultz II authored
* Target multiple python versions * Fix overriding c++ standard * Add a cmake module for python * Remove python 3 stage * Suppress warnings * Formatting Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 07 Oct, 2020 1 commit
-
-
Paul authored
-
- 03 Oct, 2020 2 commits
- 02 Oct, 2020 1 commit
-
-
Paul authored
-
- 01 Oct, 2020 1 commit
-
-
Paul authored
-
- 30 Sep, 2020 1 commit
-
-
Paul Fultz II authored
* Make global variables const * Tidy fixes * Disable some lints * Formatting * Fix tidy const * Formatting * Add missing const keywords * Formatting * More fixes * Fix remaining tidy issues * Formatting * Fix rocblas function call * Formatting * Fix nodiscard warnings * Formatting * Use named parameters * Remove overload * Add overload * Remove noncps * Use named param for node * Add auto register header * Use named parameters * Refactor jenkinsfile * Fix shadow * Add missing body variable * Add more const methods * Add hip-clang docker builds * Remove comments * Add clang-format * Add more const * Formatting * Rename stage * Disable check * Add another const * Add python 2 dev packages * Add sphinx to dockerfile
-
- 09 Jul, 2020 1 commit
-
-
Paul Fultz II authored
* Initial cpu conv-nd * Formatting * Make index signed * Formatting * Assert the indices are greater than 0 * Use equal instead of lexicographical_compare * Formatting * Fix tidy errors * Formatting * Handle different types * Formatting * Fix nested visits * Formatting * Add 3d conv test * Formatting * revert unnecessary changes * remove a print line * Fix ICE * Formatting * Use absolute path Co-authored-by:
Shucai Xiao <shucai.xiao@amd.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 18 Jun, 2020 1 commit
-
-
Paul Fultz II authored
* Build and test onnxrt * Add sudo command * Add sudo * Add pkgconfig * Make root user * Move unstash out * Remove noncps * Add NonCPS back * Remove all noncps * Use each method * Move unstash command * Unstash before * Move stash command up * Move unstash to noncps function * Remove noncps * Use a function to unstash * Remove call to unused function * Change order of args * Add another rocmtestnode overload * List files * Use capital R * Search in build directory * Use force * Use newer cmake with onnx * Install requirements * Print out pip list * Install pip3 * Add cxxflags for hip * Generate locale * Install wheel with pip3 * Disable pip installation * Disable build wheel
-
- 16 Apr, 2020 1 commit
-
-
Paul Fultz II authored
* Make contiguous preserve scalar shape * Formatting * Improve standard shape calulation * Formatting * Enable some optimizations on debug build * Up optimization level * Remove debug symbols Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 26 Feb, 2020 1 commit
-
-
Paul authored
-
- 24 Feb, 2020 1 commit
-
-
Paul authored
-
- 02 Oct, 2019 1 commit
-
-
kahmed10 authored
* test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook * # This is a combination of 3 commits. # The first commit's message is: # This is a combination of 2 commits. # The first commit's message is: test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook # This is the 2nd commit message: test hook # This is the 2nd commit message: test hook # This is the 3rd commit message: test hook * fix format * fix format * ignore doc dir * fix regex * fix jenkins error * exclude another dir * formatting test_array * fix version of yapf * test hook * formatting * reinclude dirs
-
- 30 Sep, 2019 1 commit
-
-
Paul authored
-
- 16 Aug, 2019 2 commits
- 15 Mar, 2019 1 commit
-
-
Paul authored
-
- 14 Mar, 2019 2 commits
- 18 Feb, 2019 1 commit
-
-
Paul authored
-
- 08 Feb, 2019 1 commit
-
-
Paul authored
-
- 06 Feb, 2019 1 commit
-
-
Paul authored
-
- 05 Feb, 2019 1 commit
-
-
Paul authored
-
- 18 Jan, 2019 3 commits
- 11 Dec, 2018 1 commit
-
-
Paul authored
-