Commits · 64217aafd04c98bfa925f8f44a65c00fc3323bf6 · gaoqiong / MIGraphX

28 Oct, 2022 1 commit
- Add tests · 64217aaf
  charlie authored Oct 28, 2022
  
  64217aaf
27 Oct, 2022 5 commits
- Fix stuff and add tests · 17abf67e
  charlie authored Oct 27, 2022
  
  17abf67e
- add multibroadcast ref_ops tests · de7e6675
  charlie authored Oct 27, 2022
  
  de7e6675
- Add broadcast_2in ref_ops tests · b0cbf351
  charlie authored Oct 27, 2022
  
  b0cbf351
- Add JIT pad (#1411) · 0d841ded
  kahmed10 authored Oct 27, 2022
```
updated GPU pad to now use JIT version.
added range functions for JIT kernels.
```
  0d841ded
- Shape tests and update implement · 295c2abb
  charlie authored Oct 26, 2022
  
  295c2abb
26 Oct, 2022 3 commits
- rearrange default pass list; adjust_allocation must be run after rep… (#1418) · 7b9ce460
  Brian Pickrell authored Oct 26, 2022
```
Fixes an observed regression error on certain Frozen Protobuf models due to PR 1280
```
  7b9ce460
- codecov update · 654fff15
  charlie authored Oct 26, 2022
  
  654fff15
- Add dyn binary ref tests · 58a1da5e
  charlie authored Oct 25, 2022
  
  58a1da5e
24 Oct, 2022 2 commits
- Add ref ops test · d2fa123c
  charlie authored Oct 24, 2022
  
  d2fa123c
- Add ONNX tests · 5120f911
  charlie authored Oct 24, 2022
  
  5120f911
20 Oct, 2022 2 commits
- start binary_ops tests · 2ce6ca15
  charlie authored Oct 20, 2022
  
  2ce6ca15
- revert broadcast and multibroadcast update · e92fef6e
  charlie authored Oct 20, 2022
```
update multibroadcast shape tests
```
  e92fef6e
19 Oct, 2022 2 commits

Refactor dynamic compute; Dynamic ref unary functions (#1407) · 693cb5d8

Charlie Lin authored Oct 19, 2022

Refactor dynamic compute
- add a compute_output_shape object that implicitly converts to a new dyn_output or shape object
- dyn_output object can handle computing the static output shape of an operator given the input arguments shapes
  change an operator's compute function to argument compute(const dyn_output& dyn_out, std::vector<argument> args) to 
  use dyn_output object

Dynamic ref unary functions
-  Included these changes to have an example of the refactored dynamic compute being used
-  Changes to unary base class to handle dynamic shapes
-  Changed elu and leaky_relu to use unary base class and pointwise JIT

693cb5d8

Find2.0 changes for the Quant and De-Convolution (#1408) · 5fa42993

Umang Yadav authored Oct 19, 2022



* use find2.0 for the convolution
Co-authored-by: Vasilii Filippov <DrizztDoUrden@users.noreply.github.com>
Co-authored-by: Chris Austen <causten@users.noreply.github.com>

5fa42993

18 Oct, 2022 1 commit

Add support in mlir for transposed and broadcasted shaped (#1378) · c3e02b18

Paul Fultz II authored Oct 18, 2022



* Enable non-standard shape
* Use perfdb for non xdlops
* Fix transpose+broadcast strides
Co-authored-by: jungpark-mlir <jungwook.park@amd.com>

c3e02b18

17 Oct, 2022 3 commits
- fixed-fixed test and fix strides bug · a67e6327
  charlie authored Oct 17, 2022
  
  a67e6327
- Add op_shape tests · bb7c3a25
  charlie authored Oct 17, 2022
  
  bb7c3a25
- memset fix (#1414) · 83784c52
  Umang Yadav authored Oct 17, 2022
```
hipMemset is causing random failure.
hipMemsetAsync is doing the correct synchronization.
```
  83784c52
14 Oct, 2022 1 commit

Fix rank 2 batch norm (#1412) · 01d0ecfc

Charlie Lin authored Oct 14, 2022

Allows for rank 2 tensors into batchnorm.  Specifically when spatial dimensions are all 1 and removed

01d0ecfc

13 Oct, 2022 2 commits

Refactor dynamic padding mode (#1387) · 32f6388c

Charlie Lin authored Oct 13, 2022

Removes use_dynamic_same_auto_pad
Change padding_mode to be used for dynamic padding
Move compute_padded_shape to pad_calc.cpp as it will be used in other dynamic padding cases
Fix same_lower compute_padded_shape bug and add a test.

32f6388c

Rewrite TF batch norm; remove batch_norm_inference (#1371) · be309bfb

Charlie Lin authored Oct 13, 2022

Rewrites the TF batch norm like operators to other MIGX operators
Removes the code related to batch_norm_inference

be309bfb

12 Oct, 2022 1 commit
- Progress? · 940220c8
  charlie authored Oct 12, 2022
  
  940220c8
11 Oct, 2022 1 commit
- Fix sinh_dynamic onnx test · fbe13c96
  charlie authored Oct 11, 2022
  
  fbe13c96
10 Oct, 2022 1 commit
- Remove leaku_relu and elu verify tests · 8c4ae897
  charlie authored Oct 10, 2022
  
  8c4ae897
07 Oct, 2022 1 commit

Simplify unit algebraic ops (#1281) · 4f3cc417

Ted Themistokleous authored Oct 07, 2022

Simplified algebraic operations (x*1), x*(-1), x/1, 0+x & x+0,  x-0, 0-x, 0*x, x*0, and 0/x operations

4f3cc417

04 Oct, 2022 2 commits
- Stream sync Changset (#1358) · f7d987ba
  Ted Themistokleous authored Oct 04, 2022
```
Stream sync changes and associated API level changes
```
  f7d987ba
- Fast softmax (#1290) · a9a47402
  Paul Fultz II authored Oct 04, 2022
```
optimize the softmax operator
```
  a9a47402
03 Oct, 2022 2 commits

Add output_alias and runs_on_offload_target flags for the custom ops (#1309) · c9ffb38d

Umang Yadav authored Oct 03, 2022

Adds two methods for the custom_ops virtual class.

bool runs_on_offload_target(), if the custom op runs directly on the gpu then it should be set to true. in this case, custom op expects its parameters to reside in GPU memory and writes output to the GPU memory. If it is set to false then, custom op expects it's parameter to reside on the host and puts back the result into the host memory.

output_alias, if output of the custom op is aliasing the input buffer. i.e. interpreting the same input buffer with differnet shape and strides.

Update as_vector() in C++ API to handle non-standard shapes. It required exposing element_index to space_index conversion method for the shape class.

c9ffb38d

Revert "Fix onnx_test" · 24626f8f
charlie authored Oct 03, 2022
```
This reverts commit 0b6ce109.
```
24626f8f

30 Sep, 2022 1 commit
- Fix onnx_test · 0b6ce109
  charlie authored Sep 30, 2022
  
  0b6ce109
29 Sep, 2022 3 commits
- Fix elu and leaky_relu pointwise JIT · 48c7c810
  charlie authored Sep 29, 2022
  
  48c7c810
- Use find_2.0 API for the convolution (#1346) · e19f78ae
  Umang Yadav authored Sep 29, 2022
```
Improvements/Additions to be made:

changes for the quant_convolution,
changes for the deconvolution,
Macros for MIOpen status checks
```
  e19f78ae
- Fix test typo? · 5793740d
  charlie authored Sep 29, 2022
  
  5793740d
28 Sep, 2022 3 commits
- Add onnx files · 69900d77
  charlie authored Sep 28, 2022
  
  69900d77
- Unary ops changes and tests · 65e14286
  charlie authored Sep 28, 2022
  
  65e14286
- Add compute_fp32 flag for quant_gemm tests (#1360) · 70e63960
  Umang Yadav authored Sep 28, 2022
```
test_gpu_pack_int8_args fails on gfx908 machine, because it doesn't set compute_fp32 flag correctly. This PR fixes the test such that it checks for the device-name, and rocblas-versions and sets this flag accordingly.
```
  70e63960
27 Sep, 2022 2 commits
- Adding tests · 30243d2c
  charlie authored Sep 27, 2022
  
  30243d2c
- Add onnx mod operator gpu cpu (#1306) · 40118191
  Ted Themistokleous authored Sep 26, 2022
```
Implement operator for CPU and GPU implementations
```
  40118191
26 Sep, 2022 1 commit

Rewrite ONNX parse batch norm (#1362) · c00f8202

Charlie Lin authored Sep 26, 2022

Rewrites the BatchNormalization ONNX operator into other MIGX operators
- Added handling of 1D input tensor case (edge case in ONNX spec)
Removes the spatial and per_activation functionality (not in the ONNX spec)
- Did not remove the batch_norm_inference related code as the TensorFlow parser still uses it
- Can remove that code when the TF version is updated

c00f8202