- 14 Mar, 2023 3 commits
-
-
Alan Turner authored
-
Alan Turner authored
-
Alan Turner authored
-
- 17 Feb, 2023 2 commits
-
-
Alan Turner authored
-
Alan Turner authored
-
- 07 Feb, 2023 2 commits
-
-
Alan Turner authored
-
Alan Turner authored
-
- 06 Feb, 2023 1 commit
-
-
Paul Fultz II authored
* Fuse layernorm with different patterns * Only match when using the last axis Co-authored-by:
kahmed10 <15948690+kahmed10@users.noreply.github.com> Co-authored-by:
kahmed10 <15948690+kahmed10@users.noreply.github.com>
-
- 03 Feb, 2023 5 commits
- 01 Feb, 2023 1 commit
-
-
Alan Turner authored
-
- 31 Jan, 2023 3 commits
-
-
Paul authored
-
Umang Yadav authored
Added CMakeFlag for hipRTC. MIGRAPHX_USE_HIPRTC. Added stages in Jenkins for hipRTC. Fixes for some of the pending issues from hipRTC.
-
Paul Fultz II authored
* Add general optimize pass * Fuse gemm multiplies by scalar * Handle zero epsilon
-
- 27 Jan, 2023 2 commits
- 26 Jan, 2023 9 commits
-
-
Paul authored
-
Paul authored
-
Paul authored
-
Paul authored
-
Paul authored
-
Alan Turner authored
-
Alan Turner authored
-
Paul authored
-
Paul authored
-
- 19 Jan, 2023 1 commit
-
-
Paul Fultz II authored
This prevents multiple adds.
-
- 17 Jan, 2023 1 commit
-
-
Paul Fultz II authored
-
- 15 Jan, 2023 1 commit
-
-
Paul authored
-
- 11 Jan, 2023 1 commit
-
-
Paul Fultz II authored
* Use cosine to compute half sin
-
- 09 Jan, 2023 1 commit
-
-
Ted Themistokleous authored
JIT implementation of the gather operator Added a few more unit tests to this one as well since I saw some odd behavior during bring up.
-
- 14 Dec, 2022 2 commits
-
-
Alan Turner authored
-
Alan Turner authored
-
- 11 Dec, 2022 1 commit
-
-
Umang Yadav authored
HIP had change in previous rocm releases to use --offload-arch instead of --cuda-gpu-arch. This should be backwards compatbile. hipRTC also supports --offload-arch.
-
- 07 Dec, 2022 2 commits
-
-
Paul Fultz II authored
* Add implicit_conversion
-
Paul authored
-
- 06 Dec, 2022 2 commits