- 28 Mar, 2023 2 commits
-
-
Alan Turner authored
-
Alan Turner authored
-
- 21 Mar, 2023 1 commit
-
-
Charlie Lin authored
Refactor to have select_module use output parameters Disable select_module verify tests on cpu
-
- 18 Mar, 2023 1 commit
-
-
Umang Yadav authored
Fixes #1595
-
- 13 Mar, 2023 1 commit
-
-
Manupa Karunaratne authored
* [MLIR] Adds a runtime switch to trigger MLIR
-
- 10 Mar, 2023 2 commits
-
-
Paul Fultz II authored
-
Paul Fultz II authored
-
- 01 Mar, 2023 1 commit
-
-
Charlie Lin authored
Add additional documentation to explain the passes.
-
- 28 Feb, 2023 1 commit
-
-
Charlie Lin authored
Creates the select_module operator that selects one of the submodules passed to it to run based on the submodule parameters. The submodule is selected by having the exact same static shapes for the arguments to select_module as the parameters in the submodule
-
- 23 Feb, 2023 1 commit
-
-
shivadbhavsar authored
-
- 22 Feb, 2023 1 commit
-
-
Alan Turner authored
-
- 17 Feb, 2023 2 commits
-
-
Alan Turner authored
-
Alan Turner authored
-
- 16 Feb, 2023 3 commits
-
-
Paul Fultz II authored
Avoids double global loads. Strided loops are unrolled which lets store results in array which compiler will use registers for since the index access is constant. Updated to handle large reductions so which results with a better stable diffusion result
-
Umang Yadav authored
* deprecate HCC
-
Umang Yadav authored
* Add driver flag "--exhaustive-tune" to enable tuning, add support for the same in C/C++ and python API
-
- 14 Feb, 2023 1 commit
-
-
shivadbhavsar authored
Currently, we default to device 0 when loading programs. Updating this to use hipGetDevice to set the device for the loaded program.
-
- 10 Feb, 2023 1 commit
-
-
Umang Yadav authored
-
- 07 Feb, 2023 2 commits
-
-
Alan Turner authored
-
Alan Turner authored
-
- 06 Feb, 2023 1 commit
-
-
Paul Fultz II authored
* Fuse layernorm with different patterns * Only match when using the last axis Co-authored-by:
kahmed10 <15948690+kahmed10@users.noreply.github.com> Co-authored-by:
kahmed10 <15948690+kahmed10@users.noreply.github.com>
-
- 03 Feb, 2023 5 commits
- 01 Feb, 2023 1 commit
-
-
Alan Turner authored
-
- 31 Jan, 2023 3 commits
-
-
Paul authored
-
Umang Yadav authored
Added CMakeFlag for hipRTC. MIGRAPHX_USE_HIPRTC. Added stages in Jenkins for hipRTC. Fixes for some of the pending issues from hipRTC.
-
Paul Fultz II authored
* Add general optimize pass * Fuse gemm multiplies by scalar * Handle zero epsilon
-
- 27 Jan, 2023 2 commits
- 26 Jan, 2023 8 commits
-
-
Paul authored
-
Paul authored
-
Paul authored
-
Paul authored
-
Paul authored
-
Alan Turner authored
-
Alan Turner authored
-
Paul authored
-