- 01 Nov, 2023 1 commit
-
-
Chris Austen authored
-
- 30 Oct, 2023 1 commit
-
-
Ahsan Saghir authored
-
- 14 Oct, 2023 1 commit
-
-
Chris Austen authored
-
- 13 Oct, 2023 1 commit
-
-
turneram authored
-
- 06 Oct, 2023 1 commit
-
-
Chris Austen authored
-
- 29 Sep, 2023 2 commits
-
-
Umang Yadav authored
add flags for ck, Enable CK with hipRTC. CK can be used with the MIGRAPHX_ENABLE_CK=1 and MIGRAPHX_TUNE_CK=1
-
Chris Austen authored
Enable MLIR performance enhancements with MIGRAPHX_ENABLE_MLIR=1
-
- 12 Aug, 2023 1 commit
-
-
Paul Fultz II authored
-
- 11 Aug, 2023 1 commit
-
-
Paul Fultz II authored
-
- 24 Jul, 2023 1 commit
-
-
Lakhinder Walia authored
Update ROCmSoftwarePlatform/half from 5.4.2 to 5.6.0
-
- 02 Jul, 2023 1 commit
-
-
Paul Fultz II authored
Add a CI job to test CK Add MIGRAPHX_TUNE_CK env variable to only do tuning for CK Continue tuning even when there is invalid configs Fix a bug with parallel compilation not using all available threads Add additional test for gemms using half types Removed int32 as supported type since it doesnt pass our test suite
-
- 17 Jun, 2023 1 commit
-
-
turneram authored
* Add initial ck_gemm code * Format * Add additional src files * Format * Add include * Simplify fuse_ck * Format * Rename var * Enable pass * Update ck version * Fix include * Add group stride * Disable warnings for ck headers * Format * Add unpack array * Add interface to enable tuning * Format * Update compile_ops to handle tuning config * Format * Add some comments * Move time_op to migraphx_gpu * Add banchmarking * Refactor * Format * Add lift class macro * Use device name * Format * Generate configs * Format * Pass tuning parameter * Move data type to is_ck_gemm matcher * Format * Add problem_cache to avoid retuning same configs * Format * Format * Mark the problems * Format * Use is_null * Format * Resize vector * Only tune with exaustive tuning * Format * Use assert * FOrmat * Tidy fixes * More tidy fixes * Format * Add license to missing files * Format * Use transform * Format * Fix tidy * Format * Fix cppcheck issues * Format * Add static_assert * Add ops header * Add assertion in batcher * Format * Improve the batch fold check * Format * Add where op workaround for CK * Skip if any input is not a supported ck type * Format * Check batch is standard * Format * Remove redundant static keyword * Update commit hash * Fix error when running without --exhaustive-tune * Formatting * Formatting * Remove fuse_ck_gemm_softmax_gemm * Update ck hash * Correct spelling mistake * Remove commented out logic from fuse_ck * Remove unused include and add comment * Formatting * Remove redundant get_shape and remove ck_gemm from names * Formatting * Allow for mixed types with int8 gemms * Formatting * Add back find_package from merge * Update CK commit hash and add gfx940 to fuse_ops supported archs * Formatting * Update CK hash
-
- 08 Jun, 2023 1 commit
-
-
Paul Fultz II authored
Enable with MIGRAPHX_ENABLE_CK=1 and --exhaustive-tune tune flag
-
- 04 Mar, 2023 1 commit
-
-
Umang Yadav authored
resolve half info messages while building
-
- 21 Oct, 2022 1 commit
-
-
Umang Yadav authored
-
- 02 Aug, 2022 1 commit
-
-
jungpark-mlir authored
-
- 12 Jul, 2022 1 commit
-
-
Chris Austen authored
bitbucket needs a port that some servers do not make available. Move the Blaze dependency from a bitbucket to a github source repo.
-
- 22 Jun, 2022 1 commit
-
-
Ted Themistokleous authored
Updated each source file in the repo with the existing license.
-
- 26 May, 2021 1 commit
-
-
Paul Fultz II authored
* Add rbuild.ini file * Skip rbuild step * Use rbuild for gh runners * Update ini file * Invalidate cache * Use full path for ccache * Use master Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 11 May, 2021 1 commit
-
-
Shucai Xiao authored
* upgrade protobuf to 3.16.0 to be compatible with onnxruntime * change the link flag to fix a protobuf version conflict with ort
-
- 22 Apr, 2021 1 commit
-
-
Paul authored
-
- 09 Apr, 2021 1 commit
-
-
Paul Fultz II authored
* Fix tidy warnings for 4.1 * Formatting * Upgrade to 4.1 in docker * Remove hcc build and enable ubsan on clang debug * Add missing openmp package * Construct directly * Construct directly * Upgrade rocm-cmake version
-
- 17 Mar, 2021 1 commit
-
-
Paul authored
-
- 29 Jan, 2021 1 commit
-
-
Paul authored
-
- 08 Jan, 2021 1 commit
-
-
Paul Fultz II authored
* Add build and test github workflow * Fix cget command * Remove def-requirements.txt * Add tmate session to debug workflow * Run tmate session after installing dependencies * Print date periodically * Add clang tidy action * Seperate build and run container in two different jobs * Run bash script * Remove interactive flag * Try to mount the files * Try to use the github workspace * WIthout double braces * Use env variable * Pipe bash script in * Run using hip-clang * Use correct path * Add verbose * Remove j flag * Only run for onnx file to debug * Manually run clang-tidy * Remove quiet flag * Print header file * Printout environment * Remove extra defines * Remove fixits and config flag * Show ldd * Add tmate session * Run onnx protobuf first * Generate proto for tensorflow * Update cppcheck version * Fix some cppcheck issues * Add const * Cppcheck fixes * Formatting * Fix more cppcheck issues * Run two jobs * Cache analysis and run format checking * Fix yaml issues * Fix yaml issues * Fix indentation * Switch to hip-clang for main docker file * Use hip-clang in the readme * Fixes for jenkins * Use ccache to build * Combine file * Set restore keys * Change stage name * Build with ccache * Add missing dependency for ccache * Build debug with codecov * Fix workflow syntax * Fix list * Use quotes * Got to correct build path * Install lcov * Use sudo * Echo all commands * Setup tmate * Add verbose output * Build with cmake directly * Add pthread flag * Remove python config * Continue on error * Use on or off for cmake flag * Use always upload cache * Verbose output * Verbose output from build * Build one target * Reduce debug symbols * Increase garbage collection * Remove dmesg * Increase it to 20 * Update rocm cmake version * Remove jobs from jenkins * Run on all 3 ubuntus * Remove gcc 5 jobs * Dont add flag on 16.04 * Only upload coverage on 18.04 * Dont build for ubuntu 20.04 * Use matrix.os * Use O2 for hip-clang since lower optimizations are broken * Use rocm 3.0 * Pass ccache as cmake variable instead of env variable * Build miopen from source * Show ccache statistics * Print log information * Set compression level * Use hash dir * Set hashdir * Install clang ocl from system * Up compression level * Add locale * Increase cache size to 1G * Lower compression level to 9 * Remove split dwarf * Remove Og * Add back Og * Seperate debug and codecov * Add missing backlash * Garbage collect more often * Add missing locales package * Use Os * Install onednn in docker and run tests * Include target headers in tests * Increase timeout * Remove if condtion * Make flag public * Suppress memory leaks in onednn * Use equal * Add gh annotations * Update rocm-cmake version * Add ldconfig Co-authored-by:Shucai Xiao <shucai@gmail.com>
-
- 22 Oct, 2020 1 commit
-
-
Paul Fultz II authored
* Target multiple python versions * Fix overriding c++ standard * Add a cmake module for python * Remove python 3 stage * Suppress warnings * Formatting Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 04 Oct, 2020 1 commit
-
-
Paul authored
-
- 13 Aug, 2020 1 commit
-
-
Paul Fultz II authored
* Add msgpack serialization * Formatting * Fix compiler errors on gcc * Remove redundant initialization * Use type to fix compile error with gcc 5 * Formatting * Add more tests * Change to migraphx throw * Fix message Co-authored-by:
Shucai Xiao <shucai.xiao@amd.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 10 Aug, 2020 1 commit
-
-
Shucai Xiao authored
* Add initial serialization * Formatting * Add unit tests * Formatting * Add tests for serialization * Formatting * Use or not and * Add value test * Formatting * Add more tests * Add shape serialization * Formatting * Add serializtion for literal and argument * Formatting * Serialize empty types * Formatting * Tidy fixes * Formatting * Fix tidy issues * Formatting * Reformat value type macro * Formatting * Handle enum types * Formatting * Add float_equal * Fix tidy issue * Use declval for better sfinae * Formatting * backup code change * Fix maro name * Add more test cases to improve coverage * Formatting * Add more tests * Formatting * Fix assertion * code backup * clang format * Fix bug with keyless assignment * Formatting * add implementation of converting json to migraphx::value * clang format * code backup * clang format * code backup * clang format * code backup * code refactor * add unit tests * separate json conversion from serialization * clang format * add the missing files * refine code * clang format * fix cppcheck error * clang format * fix review comments * clang format * code backup * fix cppcheck error * refine an implementation * clang format * fix review comments * clang format * fix review comments and add more unit tests * clang format * code backup to try * fix review comments and add more unit tests * clang format * some refinement * clang format * fix review comments * remove unnecessary code * add assignment of null Co-authored-by:
Paul <pfultz2@yahoo.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 08 Jul, 2020 1 commit
-
-
kahmed10 authored
* initial progress * formatting * add pooling changes * formatting * change eliminate_pad * formatting * rename var * fomratting * update op shape test and compute * formatting * revert conv constructor * formatting * change initializer * formatting * fix tidy * change quant conv and shape check * add tests and fixes * formatting * fix type * fix conv test * formatting * add pooling and bn tests * formatting * add inconsistent attr tests * fix padding issue * formatting * progress on 1d to 2d * formatting * change compute and compile functions * formatting * fix duplicate * fix conflict * fix issue with 1d conv * formatting * add check for 3d limit * rename function * formatting * update to MIOPen 2.3 * add support for nd pooling * formatting * test miopen 2.4 * change function name * rename functions * formatting * add op_shape test * add gpu ops tests * formatting * add pkg-config * change functions * formatting * change to copy_backward * formatting * test diff miopen version * add pooling shape tests * temp disable test * revert to miopen 2.4 Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 06 Apr, 2020 1 commit
-
-
Paul Fultz II authored
* Upgrade compiler to hcc * Add missing perl package for hcc * Update hip and rocblas version * Add flag to speedup build * Remove symlink * Add no-gpu-rdc to linking * Add missing define * Disable no unused argument * Suppress warning * Add definition to all targets * Fix compiler flags * Refer to rocblas
-
- 24 Feb, 2020 1 commit
-
-
Shucai Xiao authored
-
- 05 Nov, 2019 1 commit
-
-
Paul authored
-
- 04 Sep, 2019 1 commit
-
-
Paul authored
-
- 12 Jul, 2019 1 commit
-
-
Shucai Xiao authored
-
- 11 Jul, 2019 2 commits
- 07 Jun, 2019 1 commit
-
-
Paul authored
-
- 05 Jun, 2019 2 commits