- 09 Nov, 2023 1 commit
-
-
Krzysztof Drewniak authored
This is a companion PR to https://github.com/ROCmSoftwarePlatform/rocMLIR/pull/1312 . The updated commit hash points onto that PR branch, so coordinated merges are advised. With the above rocMLIR changes, the MLIR MIGraphX dialect now represents both the dimensions and strides of tensors inside MLIR, thus allowing NHWC convolutions to be correctly offloaded. In this PR, we: - Remove special handling for the case where non-standard shapes become input to MLIR modules - Fold broadcast and multibroadcast operations into the input size od MLIR modules - Update tests - Add an extra TRACE_MLIR print to help debug crashes in the high-level pipeline
-
- 07 Nov, 2023 1 commit
-
-
Chris Austen authored
-
- 01 Nov, 2023 1 commit
-
-
Chris Austen authored
-
- 30 Oct, 2023 1 commit
-
-
Ahsan Saghir authored
-
- 14 Oct, 2023 1 commit
-
-
Chris Austen authored
-
- 13 Oct, 2023 1 commit
-
-
turneram authored
-
- 06 Oct, 2023 1 commit
-
-
Chris Austen authored
-
- 29 Sep, 2023 2 commits
-
-
Umang Yadav authored
add flags for ck, Enable CK with hipRTC. CK can be used with the MIGRAPHX_ENABLE_CK=1 and MIGRAPHX_TUNE_CK=1
-
Chris Austen authored
Enable MLIR performance enhancements with MIGRAPHX_ENABLE_MLIR=1
-
- 12 Aug, 2023 1 commit
-
-
Paul Fultz II authored
-
- 11 Aug, 2023 1 commit
-
-
Paul Fultz II authored
-
- 24 Jul, 2023 1 commit
-
-
Lakhinder Walia authored
Update ROCmSoftwarePlatform/half from 5.4.2 to 5.6.0
-
- 02 Jul, 2023 1 commit
-
-
Paul Fultz II authored
Add a CI job to test CK Add MIGRAPHX_TUNE_CK env variable to only do tuning for CK Continue tuning even when there is invalid configs Fix a bug with parallel compilation not using all available threads Add additional test for gemms using half types Removed int32 as supported type since it doesnt pass our test suite
-
- 17 Jun, 2023 1 commit
-
-
turneram authored
* Add initial ck_gemm code * Format * Add additional src files * Format * Add include * Simplify fuse_ck * Format * Rename var * Enable pass * Update ck version * Fix include * Add group stride * Disable warnings for ck headers * Format * Add unpack array * Add interface to enable tuning * Format * Update compile_ops to handle tuning config * Format * Add some comments * Move time_op to migraphx_gpu * Add banchmarking * Refactor * Format * Add lift class macro * Use device name * Format * Generate configs * Format * Pass tuning parameter * Move data type to is_ck_gemm matcher * Format * Add problem_cache to avoid retuning same configs * Format * Format * Mark the problems * Format * Use is_null * Format * Resize vector * Only tune with exaustive tuning * Format * Use assert * FOrmat * Tidy fixes * More tidy fixes * Format * Add license to missing files * Format * Use transform * Format * Fix tidy * Format * Fix cppcheck issues * Format * Add static_assert * Add ops header * Add assertion in batcher * Format * Improve the batch fold check * Format * Add where op workaround for CK * Skip if any input is not a supported ck type * Format * Check batch is standard * Format * Remove redundant static keyword * Update commit hash * Fix error when running without --exhaustive-tune * Formatting * Formatting * Remove fuse_ck_gemm_softmax_gemm * Update ck hash * Correct spelling mistake * Remove commented out logic from fuse_ck * Remove unused include and add comment * Formatting * Remove redundant get_shape and remove ck_gemm from names * Formatting * Allow for mixed types with int8 gemms * Formatting * Add back find_package from merge * Update CK commit hash and add gfx940 to fuse_ops supported archs * Formatting * Update CK hash
-
- 08 Jun, 2023 1 commit
-
-
Paul Fultz II authored
Enable with MIGRAPHX_ENABLE_CK=1 and --exhaustive-tune tune flag
-
- 04 Mar, 2023 1 commit
-
-
Umang Yadav authored
resolve half info messages while building
-
- 21 Oct, 2022 1 commit
-
-
Umang Yadav authored
-
- 02 Aug, 2022 1 commit
-
-
jungpark-mlir authored
-
- 12 Jul, 2022 1 commit
-
-
Chris Austen authored
bitbucket needs a port that some servers do not make available. Move the Blaze dependency from a bitbucket to a github source repo.
-
- 22 Jun, 2022 1 commit
-
-
Ted Themistokleous authored
Updated each source file in the repo with the existing license.
-
- 26 May, 2021 1 commit
-
-
Paul Fultz II authored
* Add rbuild.ini file * Skip rbuild step * Use rbuild for gh runners * Update ini file * Invalidate cache * Use full path for ccache * Use master Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 11 May, 2021 1 commit
-
-
Shucai Xiao authored
* upgrade protobuf to 3.16.0 to be compatible with onnxruntime * change the link flag to fix a protobuf version conflict with ort
-
- 22 Apr, 2021 1 commit
-
-
Paul authored
-
- 09 Apr, 2021 1 commit
-
-
Paul Fultz II authored
* Fix tidy warnings for 4.1 * Formatting * Upgrade to 4.1 in docker * Remove hcc build and enable ubsan on clang debug * Add missing openmp package * Construct directly * Construct directly * Upgrade rocm-cmake version
-
- 17 Mar, 2021 1 commit
-
-
Paul authored
-
- 29 Jan, 2021 1 commit
-
-
Paul authored
-
- 08 Jan, 2021 1 commit
-
-
Paul Fultz II authored
* Add build and test github workflow * Fix cget command * Remove def-requirements.txt * Add tmate session to debug workflow * Run tmate session after installing dependencies * Print date periodically * Add clang tidy action * Seperate build and run container in two different jobs * Run bash script * Remove interactive flag * Try to mount the files * Try to use the github workspace * WIthout double braces * Use env variable * Pipe bash script in * Run using hip-clang * Use correct path * Add verbose * Remove j flag * Only run for onnx file to debug * Manually run clang-tidy * Remove quiet flag * Print header file * Printout environment * Remove extra defines * Remove fixits and config flag * Show ldd * Add tmate session * Run onnx protobuf first * Generate proto for tensorflow * Update cppcheck version * Fix some cppcheck issues * Add const * Cppcheck fixes * Formatting * Fix more cppcheck issues * Run two jobs * Cache analysis and run format checking * Fix yaml issues * Fix yaml issues * Fix indentation * Switch to hip-clang for main docker file * Use hip-clang in the readme * Fixes for jenkins * Use ccache to build * Combine file * Set restore keys * Change stage name * Build with ccache * Add missing dependency for ccache * Build debug with codecov * Fix workflow syntax * Fix list * Use quotes * Got to correct build path * Install lcov * Use sudo * Echo all commands * Setup tmate * Add verbose output * Build with cmake directly * Add pthread flag * Remove python config * Continue on error * Use on or off for cmake flag * Use always upload cache * Verbose output * Verbose output from build * Build one target * Reduce debug symbols * Increase garbage collection * Remove dmesg * Increase it to 20 * Update rocm cmake version * Remove jobs from jenkins * Run on all 3 ubuntus * Remove gcc 5 jobs * Dont add flag on 16.04 * Only upload coverage on 18.04 * Dont build for ubuntu 20.04 * Use matrix.os * Use O2 for hip-clang since lower optimizations are broken * Use rocm 3.0 * Pass ccache as cmake variable instead of env variable * Build miopen from source * Show ccache statistics * Print log information * Set compression level * Use hash dir * Set hashdir * Install clang ocl from system * Up compression level * Add locale * Increase cache size to 1G * Lower compression level to 9 * Remove split dwarf * Remove Og * Add back Og * Seperate debug and codecov * Add missing backlash * Garbage collect more often * Add missing locales package * Use Os * Install onednn in docker and run tests * Include target headers in tests * Increase timeout * Remove if condtion * Make flag public * Suppress memory leaks in onednn * Use equal * Add gh annotations * Update rocm-cmake version * Add ldconfig Co-authored-by:Shucai Xiao <shucai@gmail.com>
-
- 22 Oct, 2020 1 commit
-
-
Paul Fultz II authored
* Target multiple python versions * Fix overriding c++ standard * Add a cmake module for python * Remove python 3 stage * Suppress warnings * Formatting Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 04 Oct, 2020 1 commit
-
-
Paul authored
-
- 13 Aug, 2020 1 commit
-
-
Paul Fultz II authored
* Add msgpack serialization * Formatting * Fix compiler errors on gcc * Remove redundant initialization * Use type to fix compile error with gcc 5 * Formatting * Add more tests * Change to migraphx throw * Fix message Co-authored-by:
Shucai Xiao <shucai.xiao@amd.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 10 Aug, 2020 1 commit
-
-
Shucai Xiao authored
* Add initial serialization * Formatting * Add unit tests * Formatting * Add tests for serialization * Formatting * Use or not and * Add value test * Formatting * Add more tests * Add shape serialization * Formatting * Add serializtion for literal and argument * Formatting * Serialize empty types * Formatting * Tidy fixes * Formatting * Fix tidy issues * Formatting * Reformat value type macro * Formatting * Handle enum types * Formatting * Add float_equal * Fix tidy issue * Use declval for better sfinae * Formatting * backup code change * Fix maro name * Add more test cases to improve coverage * Formatting * Add more tests * Formatting * Fix assertion * code backup * clang format * Fix bug with keyless assignment * Formatting * add implementation of converting json to migraphx::value * clang format * code backup * clang format * code backup * clang format * code backup * code refactor * add unit tests * separate json conversion from serialization * clang format * add the missing files * refine code * clang format * fix cppcheck error * clang format * fix review comments * clang format * code backup * fix cppcheck error * refine an implementation * clang format * fix review comments * clang format * fix review comments and add more unit tests * clang format * code backup to try * fix review comments and add more unit tests * clang format * some refinement * clang format * fix review comments * remove unnecessary code * add assignment of null Co-authored-by:
Paul <pfultz2@yahoo.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 08 Jul, 2020 1 commit
-
-
kahmed10 authored
* initial progress * formatting * add pooling changes * formatting * change eliminate_pad * formatting * rename var * fomratting * update op shape test and compute * formatting * revert conv constructor * formatting * change initializer * formatting * fix tidy * change quant conv and shape check * add tests and fixes * formatting * fix type * fix conv test * formatting * add pooling and bn tests * formatting * add inconsistent attr tests * fix padding issue * formatting * progress on 1d to 2d * formatting * change compute and compile functions * formatting * fix duplicate * fix conflict * fix issue with 1d conv * formatting * add check for 3d limit * rename function * formatting * update to MIOPen 2.3 * add support for nd pooling * formatting * test miopen 2.4 * change function name * rename functions * formatting * add op_shape test * add gpu ops tests * formatting * add pkg-config * change functions * formatting * change to copy_backward * formatting * test diff miopen version * add pooling shape tests * temp disable test * revert to miopen 2.4 Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 06 Apr, 2020 1 commit
-
-
Paul Fultz II authored
* Upgrade compiler to hcc * Add missing perl package for hcc * Update hip and rocblas version * Add flag to speedup build * Remove symlink * Add no-gpu-rdc to linking * Add missing define * Disable no unused argument * Suppress warning * Add definition to all targets * Fix compiler flags * Refer to rocblas
-
- 24 Feb, 2020 1 commit
-
-
Shucai Xiao authored
-
- 05 Nov, 2019 1 commit
-
-
Paul authored
-
- 04 Sep, 2019 1 commit
-
-
Paul authored
-
- 12 Jul, 2019 1 commit
-
-
Shucai Xiao authored
-
- 11 Jul, 2019 2 commits
- 07 Jun, 2019 1 commit
-
-
Paul authored
-