Commits · 6be7f1fb96764feae6f3c157a4b3428e1af2f7b5 · gaoqiong / MIGraphX

28 Oct, 2022 6 commits
- cosmetic · 48cd6fa1
  Brian Pickrell authored Oct 28, 2022
  
  48cd6fa1
- Refactored reduce_op::normalize_compute_shape() handling of dynamic shapes to... · b0b02e63
  Brian Pickrell authored Oct 28, 2022
```
Refactored reduce_op::normalize_compute_shape() handling of dynamic shapes to set reduced dimensions to {1,1} instead of removing them.
Updated shape tests for reduce ops.
Spurious change to reduce_mean.cpp reverted.
```
  b0b02e63
- comment fix · 33da54a7
  Brian Pickrell authored Oct 28, 2022
  
  33da54a7
- dynamic shape support for reduce_XXX operations. One test in ref_ops_test;... · b3f0f482
  Brian Pickrell authored Oct 28, 2022
```
dynamic shape support for reduce_XXX operations.  One test in ref_ops_test; one test in op_shape_test
```
  b3f0f482
- dynamic shape support for reduce_XXX operations. One test in ref_ops_test;... · 8d23ef5d
  Brian Pickrell authored Oct 28, 2022
```
dynamic shape support for reduce_XXX operations.  One test in ref_ops_test; one test in op_shape_test
```
  8d23ef5d
- Use minimum block size of 64 threads (#1427) · 25a0e433
  Umang Yadav authored Oct 28, 2022
```
Local Threads of multiples 32 were introduced in #1348
But LocalThreads that are not multiple of 64 are causing correctness issues.
```
  25a0e433
27 Oct, 2022 2 commits

Upgrade CI environment to 5.3.0 (#1198) · 4b1c1c41

Chris Austen authored Oct 27, 2022

Upgraded Dockerfiles and fixed tidy issues to make Ubuntu 20.04 and ROCm 5.3.0 the default

4b1c1c41

Add JIT pad (#1411) · 0d841ded

kahmed10 authored Oct 27, 2022

updated GPU pad to now use JIT version.
added range functions for JIT kernels.

0d841ded

26 Oct, 2022 2 commits

rearrange default pass list; adjust_allocation must be run after rep… (#1418) · 7b9ce460
Brian Pickrell authored Oct 26, 2022
```
Fixes an observed regression error on certain Frozen Protobuf models due to PR 1280
```
7b9ce460

Regenerate driver models (#1422) · d8756a4e

kahmed10 authored Oct 26, 2022

use_dynamic_same_auto_pad was removed from convolution, but the driver models still retain the fields. This PR regenerates the files so that they are compatible again.

d8756a4e

24 Oct, 2022 1 commit

Add relaxed standard shape assertion (#1416) · f1ecad75

jungpark-mlir authored Oct 24, 2022

Reiterate the assertion on the standard shape but relax it for the multibroadcast ops deliberately inserted to explicit the broadcast.

f1ecad75

21 Oct, 2022 1 commit
- work in progress; code builds but incomplete · 5696ac5f
  Brian Pickrell authored Oct 21, 2022
  
  5696ac5f
19 Oct, 2022 2 commits

Refactor dynamic compute; Dynamic ref unary functions (#1407) · 693cb5d8

Charlie Lin authored Oct 19, 2022

Refactor dynamic compute
- add a compute_output_shape object that implicitly converts to a new dyn_output or shape object
- dyn_output object can handle computing the static output shape of an operator given the input arguments shapes
  change an operator's compute function to argument compute(const dyn_output& dyn_out, std::vector<argument> args) to 
  use dyn_output object

Dynamic ref unary functions
-  Included these changes to have an example of the refactored dynamic compute being used
-  Changes to unary base class to handle dynamic shapes
-  Changed elu and leaky_relu to use unary base class and pointwise JIT

693cb5d8

Find2.0 changes for the Quant and De-Convolution (#1408) · 5fa42993

Umang Yadav authored Oct 19, 2022



* use find2.0 for the convolution
Co-authored-by: Vasilii Filippov <DrizztDoUrden@users.noreply.github.com>
Co-authored-by: Chris Austen <causten@users.noreply.github.com>

5fa42993

18 Oct, 2022 1 commit

Add support in mlir for transposed and broadcasted shaped (#1378) · c3e02b18

Paul Fultz II authored Oct 18, 2022



* Enable non-standard shape
* Use perfdb for non xdlops
* Fix transpose+broadcast strides
Co-authored-by: jungpark-mlir <jungwook.park@amd.com>

c3e02b18

14 Oct, 2022 1 commit

Fix rank 2 batch norm (#1412) · 01d0ecfc

Charlie Lin authored Oct 14, 2022

Allows for rank 2 tensors into batchnorm.  Specifically when spatial dimensions are all 1 and removed

01d0ecfc

13 Oct, 2022 2 commits

Refactor dynamic padding mode (#1387) · 32f6388c

Charlie Lin authored Oct 13, 2022

Removes use_dynamic_same_auto_pad
Change padding_mode to be used for dynamic padding
Move compute_padded_shape to pad_calc.cpp as it will be used in other dynamic padding cases
Fix same_lower compute_padded_shape bug and add a test.

32f6388c

Rewrite TF batch norm; remove batch_norm_inference (#1371) · be309bfb

Charlie Lin authored Oct 13, 2022

Rewrites the TF batch norm like operators to other MIGX operators
Removes the code related to batch_norm_inference

be309bfb

07 Oct, 2022 1 commit

Simplify unit algebraic ops (#1281) · 4f3cc417

Ted Themistokleous authored Oct 07, 2022

Simplified algebraic operations (x*1), x*(-1), x/1, 0+x & x+0,  x-0, 0-x, 0*x, x*0, and 0/x operations

4f3cc417

06 Oct, 2022 11 commits
- pad_calc works with 1D convolutions · 21ee3fc0
  charlie authored Oct 06, 2022
  
  21ee3fc0
- Update src/pad_calc.cpp · 63ada580
  Charlie Lin authored Oct 06, 2022
```
Co-authored-by: Umang Yadav <29876643+umangyadav@users.noreply.github.com>
```
  63ada580
- Move rest of stuff to dyn_output.hpp · b377012f
  charlie authored Oct 06, 2022
  
  b377012f
- Add pad_calc assert · c1caf40a
  charlie authored Oct 06, 2022
  
  c1caf40a
- Remove mistake pad_calc change · 558afddb
  charlie authored Oct 06, 2022
  
  558afddb
- Remove rest of elu and leaky_relu stuff · a459b2b8
  charlie authored Oct 06, 2022
  
  a459b2b8
- remove include<migraphx/operation.hpp> · f16e05af
  charlie authored Oct 06, 2022
  
  f16e05af
- remove gpu elu and leaky_relu · ef8d4b18
  charlie authored Oct 06, 2022
  
  ef8d4b18
- Use ${function:exp} for elu · 4ba2083d
  charlie authored Oct 06, 2022
  
  4ba2083d
- dyn_output header · dc79a00a
  charlie authored Oct 06, 2022
  
  dc79a00a
- add assert to num_spatial_dims · a61f5416
  charlie authored Oct 06, 2022
  
  a61f5416
04 Oct, 2022 2 commits
- Stream sync Changset (#1358) · f7d987ba
  Ted Themistokleous authored Oct 04, 2022
```
Stream sync changes and associated API level changes
```
  f7d987ba
- Fast softmax (#1290) · a9a47402
  Paul Fultz II authored Oct 04, 2022
```
optimize the softmax operator
```
  a9a47402
03 Oct, 2022 5 commits

Add output_alias and runs_on_offload_target flags for the custom ops (#1309) · c9ffb38d

Umang Yadav authored Oct 03, 2022

Adds two methods for the custom_ops virtual class.

bool runs_on_offload_target(), if the custom op runs directly on the gpu then it should be set to true. in this case, custom op expects its parameters to reside in GPU memory and writes output to the GPU memory. If it is set to false then, custom op expects it's parameter to reside on the host and puts back the result into the host memory.

output_alias, if output of the custom op is aliasing the input buffer. i.e. interpreting the same input buffer with differnet shape and strides.

Update as_vector() in C++ API to handle non-standard shapes. It required exposing element_index to space_index conversion method for the shape class.

c9ffb38d

Revert "Update TODO comment" · c36d88f1
charlie authored Oct 03, 2022
```
This reverts commit cb0da1cb.
```
c36d88f1
Update comments · ed2acdc4
charlie authored Oct 03, 2022

ed2acdc4
padding_mode enum comment update · 58ef773a
charlie authored Oct 03, 2022

58ef773a
Update TODO comment · cb0da1cb
charlie authored Oct 03, 2022

cb0da1cb

30 Sep, 2022 1 commit
- Revert ref_conv changes · a9c0252a
  charlie authored Sep 30, 2022
```
Not needed, special case with dynamic padding
```
  a9c0252a
29 Sep, 2022 2 commits
- Remove shape.empty() function, wasn't used · 91f89fcc
  charlie authored Sep 29, 2022
  
  91f89fcc
- Fix elu and leaky_relu pointwise JIT · 48c7c810
  charlie authored Sep 29, 2022
  
  48c7c810