- 26 Jul, 2023 2 commits
-
-
Brian Pickrell authored
-
Lakhinder Walia authored
* Remove rocblas_gemm_flags_pack_int8x4 in rocblas v3.0 or higher (#2002) * Update src/targets/gpu/gemm_impl.cpp
-
- 25 Jul, 2023 4 commits
-
-
Artur Wojcik authored
-
Brian Pickrell authored
-
Artur Wojcik authored
-
Brian Pickrell authored
* Add dynamic input to prefix_scan_op * Added a shape test. The op should return the same dynamic shape as the input. * add 2d shape test for prefix_scan
-
- 24 Jul, 2023 1 commit
-
-
Brian Pickrell authored
-
- 23 Jul, 2023 1 commit
-
-
Charlie Lin authored
-
- 22 Jul, 2023 3 commits
-
-
Charlie Lin authored
Throwing on these calls catches dynamic shape errors earlier rather than having to backpedal from a bad call
-
Paul Fultz II authored
This adds a format.py file which will check or apply formatting using git clang-format or yapf.
-
kahmed10 authored
-
- 21 Jul, 2023 4 commits
-
-
Brian Pickrell authored
-
Umang Yadav authored
Fixes #1957 Clamping was removed in #1853. Turns out clamping as necessary to handle overflow/underflow cases. during downcasting, if it overflowed then without clamping it returned infinity.
-
Umang Yadav authored
Fixes #1746 BatchNorm only has x as the runtime input parameter for the following equation. All the other parameters are compile-time constants and related operations can be const-folded before quantizing to fp16 to preserve precision.
-
Umang Yadav authored
HIP requires global work items in multiple of local work items. If it is not it is not guaranteed to generate correct results all the time. Fixes #1977 Fixes #1644 MIGraphX CI has moved to rocm-5.6 which doesn't require hipRTC workarounds
-
- 20 Jul, 2023 1 commit
-
-
Brian Pickrell authored
-
- 19 Jul, 2023 3 commits
-
-
Umang Yadav authored
* add multitarget evaluation * Add evaluation for the multi-target unit-tests
-
Umang Yadav authored
-
Paul Fultz II authored
The find_reshaper is supposed to do this, but it doesnt work and there were no tests. So I updated for it to work and I added unit tests for it.
-
- 18 Jul, 2023 1 commit
-
-
Umang Yadav authored
Fixes #1946
-
- 17 Jul, 2023 4 commits
-
-
Chris Austen authored
* add support for rocm 5.6 in CI * Disable anonymous namespace check * add default c'tors to avoid warnings
-
Brian Pickrell authored
Added dynamic input handling to multinomial.hpp. Tests don't pass because it still requires both a random uniform, and dynamic-friendly prefix_scap operations to accept dynamic inputs
-
Krzysztof Drewniak authored
This commit removes the build options to disable threading and removes the mutex in compile_mlir. The commit being tested is a draft PR on rocMLIR that'll get merged if this passes
-
Charlie Lin authored
throw on use_len with non-fixed dynamic dimensions change normalize_attributes to use input shape rather than input dimensions
-
- 16 Jul, 2023 1 commit
-
-
Umang Yadav authored
-
- 13 Jul, 2023 3 commits
-
-
Krzysztof Drewniak authored
Allows the rocMLIR CI (which builds rocMLIR tip against MIGraphX tip) to pass.
-
Brian Pickrell authored
-
Charlie Lin authored
Renames deconvolution -> convolution_backwards to be more consistent with the literature Note: this is not the cross-correlation operator (which is the adjoint of convolution). This is technically a standard convolution operator combined with an upsampling operator rather than a downsampling operator. Adds unit tests for the padding, strides, dilations, and other op attributes. Throws on auto_pad attribute since it has not been implemented Previously it read the attribute and set it but then did nothing with it Extended for dynamic shapes Does not support using asymmetric padding (padding_L != padding_R) and output_shape with dynamic shapes.
-
- 11 Jul, 2023 1 commit
-
-
Umang Yadav authored
* do not use int8x4 format for the rocblas
-
- 10 Jul, 2023 2 commits
-
-
Paul Fultz II authored
-
Brian Pickrell authored
Changes to the way Pooling operation calculates pooling when there's padding. Old code would clip off any padding values before computing; for instance if an Average pooling window contained 0 1 2 where the 0 is padding, the result was 1.5 instead of 1.0. See Issue 1766
-
- 09 Jul, 2023 1 commit
-
-
Umang Yadav authored
-
- 08 Jul, 2023 2 commits
-
-
Artur Wojcik authored
-
Artur Wojcik authored
Export API symbols for migraphx, migraphx_ref, migraphx_cpu, migrphx_gpu, migraphx_device, migraphx_tf, and migraphx_onnx. There is a separate PR for migrahx_c. API symbol exporting affects only Windows. It is transparent on Linux.
-
- 06 Jul, 2023 6 commits
-
-
Artur Wojcik authored
-
Brian Pickrell authored
-
Paul Fultz II authored
This will also annotate the function with the block size so the compiler can do a better job of optimizing.
-
Brian Pickrell authored
-
Brian Pickrell authored
-
Brian Pickrell authored
-