- 31 Mar, 2023 1 commit
-
-
Charlie Lin authored
Adds a new GPU compiler pass split_single_dyn_dim that handles when one input parameter has a single non-fixed dynamic_dimension. commonly occurs for dynamic batch or BERT sequence length Splits the dynamic shape into several submodules will static input parameters to handle all of the cases in the dynamic_dimension range. Essentially does what I manually did for the select_module verify tests Adds a compile option split_single_dyn_dim that toggles the pass on/off. Defaults to false. Updates verify_program.hpp and run_verify.cpp to allow for the tests to change the compile_options
-
- 30 Mar, 2023 1 commit
-
-
Paul Fultz II authored
* Add hiprtc driver
-
- 29 Mar, 2023 1 commit
-
-
Paul Fultz II authored
-
- 28 Mar, 2023 1 commit
-
-
Umang Yadav authored
* Remove version from check_context and bump program version
-
- 27 Mar, 2023 1 commit
-
-
Manupa Karunaratne authored
* [MLIR] add dot offloads with manual tuning support * This commit adds dot + pointwise fusion support along with manual tuning using rocMLIR.
-
- 25 Mar, 2023 1 commit
-
-
Umang Yadav authored
Co-authored-by:Chris Austen <causten@users.noreply.github.com>
-
- 22 Mar, 2023 1 commit
-
-
Umang Yadav authored
prevent dynamically loading the target library that is not compiled with the same version of MIGraphX core lib.
-
- 21 Mar, 2023 2 commits
-
-
Charlie Lin authored
Refactor to have select_module use output parameters Disable select_module verify tests on cpu
-
Umang Yadav authored
Recent changes #1608 removed migraphx_all_target lib from driver and that led to missing compile time definitions. Missing compile definitions led to change of default target in driver.
-
- 18 Mar, 2023 1 commit
-
-
Umang Yadav authored
Fixes #1595
-
- 17 Mar, 2023 1 commit
-
-
Paul Fultz II authored
This is the original testcase that sparked the error with missing proper const folding. Pushing changes up to this branch and closing out the PR #1622
-
- 13 Mar, 2023 2 commits
-
-
Umang Yadav authored
-
Manupa Karunaratne authored
* [MLIR] Adds a runtime switch to trigger MLIR
-
- 10 Mar, 2023 2 commits
-
-
Paul Fultz II authored
-
Paul Fultz II authored
-
- 09 Mar, 2023 1 commit
-
-
Akash Patel authored
fallback to msgpack for older msgpack versions
-
- 07 Mar, 2023 1 commit
-
-
Umang Yadav authored
* NMS improvements
-
- 04 Mar, 2023 1 commit
-
-
Umang Yadav authored
resolve half info messages while building
-
- 01 Mar, 2023 1 commit
-
-
Charlie Lin authored
Add additional documentation to explain the passes.
-
- 28 Feb, 2023 1 commit
-
-
Charlie Lin authored
Creates the select_module operator that selects one of the submodules passed to it to run based on the submodule parameters. The submodule is selected by having the exact same static shapes for the arguments to select_module as the parameters in the submodule
-
- 23 Feb, 2023 1 commit
-
-
shivadbhavsar authored
-
- 16 Feb, 2023 3 commits
-
-
Paul Fultz II authored
Avoids double global loads. Strided loops are unrolled which lets store results in array which compiler will use registers for since the index access is constant. Updated to handle large reductions so which results with a better stable diffusion result
-
Umang Yadav authored
* deprecate HCC
-
Umang Yadav authored
* Add driver flag "--exhaustive-tune" to enable tuning, add support for the same in C/C++ and python API
-
- 15 Feb, 2023 1 commit
-
-
Brian Pickrell authored
Add dynamic shape support to slice operator. First draft of this feature doesn't support ops slicing non-fixed, dynamic axes. Resulting shape in such cases is not guaranteed.* Also, onnx parsing doesn't support any arguments other than "axes".
-
- 14 Feb, 2023 3 commits
-
-
shivadbhavsar authored
Currently, we default to device 0 when loading programs. Updating this to use hipGetDevice to set the device for the loaded program.
-
Charlie Lin authored
Expands on the documentation and corrects default option documentation error.
-
Paul Fultz II authored
* Add serialization of tuples and optional types
-
- 13 Feb, 2023 1 commit
-
-
kahmed10 authored
Using add_instruction for the neg op was causing issues on replace_instruction. Changed to use insert_instruction. Tests and added a new one that is failing without the change.
-
- 11 Feb, 2023 1 commit
-
-
Brian Pickrell authored
* add dynamic shape support to concat operator. Includes new op_shape_test and ref_ops_test cases
-
- 10 Feb, 2023 2 commits
-
-
Brian Pickrell authored
dyn shape support for Where operator. Includes shape test, ref_ops test, onx_test.
-
Umang Yadav authored
-
- 06 Feb, 2023 1 commit
-
-
Paul Fultz II authored
* Fuse layernorm with different patterns * Only match when using the last axis Co-authored-by:
kahmed10 <15948690+kahmed10@users.noreply.github.com> Co-authored-by:
kahmed10 <15948690+kahmed10@users.noreply.github.com>
-
- 03 Feb, 2023 2 commits
-
-
Paul Fultz II authored
Refactors memory coloring to only handle allocation instructions. It also handles allocations for tuple shapes.
-
Brian Pickrell authored
* Implement dynamic shapes for scatterND operators.
-
- 02 Feb, 2023 1 commit
-
-
Brian Pickrell authored
Dynamic shape support for gathernd op.
-
- 01 Feb, 2023 1 commit
-
-
Ted Themistokleous authored
Allows migraphx to inline the IF operator when we run into an IF that can be evaluated at compile time, thus avoiding us injecting IF and just inserting the instructions directly.
-
- 31 Jan, 2023 3 commits
-
-
Chris Austen authored
upgrade to ROCm 5.4.2 in CI
-
Umang Yadav authored
Added CMakeFlag for hipRTC. MIGRAPHX_USE_HIPRTC. Added stages in Jenkins for hipRTC. Fixes for some of the pending issues from hipRTC.
-
Paul Fultz II authored
* Add general optimize pass * Fuse gemm multiplies by scalar * Handle zero epsilon
-