- 18 Jul, 2023 2 commits
-
-
Khalique Ahmed authored
-
Khalique Ahmed authored
Merge branch 'pointwise-nhwc' of https://github.com/ROCmSoftwarePlatform/AMDMIGraphX into nhwc_workaround
-
- 13 Jul, 2023 6 commits
-
-
Paul authored
-
Paul authored
-
Paul authored
-
Paul authored
-
Paul authored
-
Charlie Lin authored
Renames deconvolution -> convolution_backwards to be more consistent with the literature Note: this is not the cross-correlation operator (which is the adjoint of convolution). This is technically a standard convolution operator combined with an upsampling operator rather than a downsampling operator. Adds unit tests for the padding, strides, dilations, and other op attributes. Throws on auto_pad attribute since it has not been implemented Previously it read the attribute and set it but then did nothing with it Extended for dynamic shapes Does not support using asymmetric padding (padding_L != padding_R) and output_shape with dynamic shapes.
-
- 11 Jul, 2023 4 commits
-
-
Khalique Ahmed authored
-
Umang Yadav authored
* do not use int8x4 format for the rocblas
-
Khalique Ahmed authored
-
Khalique Ahmed authored
-
- 10 Jul, 2023 3 commits
-
-
Paul Fultz II authored
-
github-actions[bot] authored
-
Brian Pickrell authored
Changes to the way Pooling operation calculates pooling when there's padding. Old code would clip off any padding values before computing; for instance if an Average pooling window contained 0 1 2 where the 0 is padding, the result was 1.5 instead of 1.0. See Issue 1766
-
- 09 Jul, 2023 1 commit
-
-
Umang Yadav authored
-
- 08 Jul, 2023 2 commits
-
-
Artur Wojcik authored
-
Artur Wojcik authored
Export API symbols for migraphx, migraphx_ref, migraphx_cpu, migrphx_gpu, migraphx_device, migraphx_tf, and migraphx_onnx. There is a separate PR for migrahx_c. API symbol exporting affects only Windows. It is transparent on Linux.
-
- 07 Jul, 2023 3 commits
-
-
Umang Yadav authored
-
Umang Yadav authored
* move tidy,cppcheck to rocm-ubuntu runners
-
-
- 06 Jul, 2023 3 commits
-
-
Artur Wojcik authored
-
Paul Fultz II authored
This will also annotate the function with the block size so the compiler can do a better job of optimizing.
-
Paul Fultz II authored
This is to help enable multi-target execution. We store a vector of targets and contexts. Currently this will only compile a single target, the PR #1672 is needed to enable multiple targets. This will also serialize the targets and contexts. When using the execution_environment or prog.get_context() it will always use the context from the first target assuming this is the "primary" target. Although, its unlikely a user would use execution_environment with a multi-target environment.
-
- 05 Jul, 2023 3 commits
-
-
kahmed10 authored
Fixes the failing test case in #1815. Added a test that would otherwise fail without the change.
-
Umang Yadav authored
Needed to run multi-targeted program where "main" isn't the only root module. There could be many root modules other than main.
-
Artur Wojcik authored
-
- 02 Jul, 2023 3 commits
-
-
Charlie Lin authored
Updates ref version of clip to work with dynamic shapes Encountered in agentmodel
-
Paul Fultz II authored
Add a CI job to test CK Add MIGRAPHX_TUNE_CK env variable to only do tuning for CK Continue tuning even when there is invalid configs Fix a bug with parallel compilation not using all available threads Add additional test for gemms using half types Removed int32 as supported type since it doesnt pass our test suite
-
Umang Yadav authored
-
- 01 Jul, 2023 1 commit
-
-
Paul Fultz II authored
This will show the instructions that are inserted for each onnx operator. This should help debugging where certain instructions come from.
-
- 30 Jun, 2023 4 commits
-
-
Khalique Ahmed authored
-
Umang Yadav authored
-
-
Paul Fultz II authored
-
- 29 Jun, 2023 2 commits
-
-
Krzysztof Drewniak authored
Bump MLIR commit to include latest supported pointwise ops. Expand the MLIR approve list Ensure that operations such as tanh() that don't have integer implementations (at least in MLIR) aren't used within MLIR modules. Add additional tests.
-
Artur Wojcik authored
Co-authored-by:Artur Wojcik <artur.wojcik@amd.com>
-
- 28 Jun, 2023 2 commits
-
-
Umang Yadav authored
-
Krzysztof Drewniak authored
Update `mlir_program` to only create one dialect registry, and to call registerRocMLIRPasses() (which is needed and may not be thread-safe) exactly once. In addition, use a single thread pool across all contexts. This is recommended practice upstream for libraries that perform a lot of compile jobs, and saves on the overhead of creating and destroying a lot of threads
-
- 27 Jun, 2023 1 commit
-
-
Artur Wojcik authored
-