Commits · c4b6469ab8dfc7bb2d341650faa527da3023ab6a · gaoqiong / MIGraphX

06 Apr, 2022 1 commit

Python Binding for the Manual Graph Buidling (#1143) · c4b6469a

Umang Yadav authored Apr 06, 2022

Adds following API binding and tests to python :

add_return
add_instruction
add_parameter
create_module.

c4b6469a

31 Mar, 2022 1 commit
- Change the doc to mention only gpu or ref as targets (#1153) · c59f4079
  Umang Yadav authored Mar 31, 2022
```
Documentation update for valid targets
```
  c59f4079
29 Mar, 2022 1 commit

Refactor runtime compiled kernels to use the same compile_ops pipeline (#1125) · 661046c6

Paul Fultz II authored Mar 29, 2022

This adds the infrastructure so we can compile everything in parallel, whereas before only pointwise kernels were compiled in parallel. This will also directly integrate with lowering and the gpu-driver. The kernels for pointwise and roialign are using this infrastructure. Scatternd is not since it does require standard shape.

This also makes it easier to add new runtime compiled kernels in the future.

661046c6

28 Mar, 2022 2 commits

Use ifdef instead of comment for the auto-generated method declarations for... · 8e4d622f

Paul Fultz II authored Mar 28, 2022

Use ifdef instead of comment for the auto-generated method declarations for type erased classes (#1138)

It seems the formatting of comments are unreadable for larger methods, so instead just generate a struct with the methods in the interface and add a comment if its optional. It wraps this in #ifdef TYPE_ERASED_DECLARATION(assuming this would never be defined) instead of #if 0, so most editors can still provide syntax highlighting(although I think vscode with clangd will still gray it out unfortunately).

8e4d622f

Use ccache for runtime compilation (#1131) · ad056b1f
Paul Fultz II authored Mar 28, 2022
```
* Use ccache for runtime compilation
```
ad056b1f

25 Mar, 2022 1 commit
- Improve handling of string literals in value class (#1141) · c73c0dae
  Paul Fultz II authored Mar 25, 2022
```
* Handle string literal in construction
* Improve get_default with vector
```
  c73c0dae
24 Mar, 2022 1 commit
- Add initial experimental custom op (#1109) · 251cdd74
  Paul Fultz II authored Mar 24, 2022
```
This creates a custom op which has name() and compute_shape() methods. 
```
  251cdd74
22 Mar, 2022 1 commit
- Remove borrowed lifetime from operators that are no longer borrowing their lifetime (#1134) · cd165ebd
  Paul Fultz II authored Mar 22, 2022
```
Operators using arg.reshape() method the lifetime will be extended.
```
  cd165ebd
21 Mar, 2022 1 commit
- Lp normalization op (#1129) · 03225b57
  Charlie Lin authored Mar 21, 2022
```
* LpNormalization ONNX parser
```
  03225b57
18 Mar, 2022 2 commits

Complete GPU implementation of CumSum op (#1094) · 548783c8

turneram authored Mar 18, 2022

Add exclusive and reverse modes to gpu implementation of prefix_scan_sum, which completes support for ONNX op CumSum

548783c8

Make get_context experimental (#1137) · e521fa3f

Paul Fultz II authored Mar 18, 2022

The get_context may change in the future(when we support multi-targets) so make this experimental for now.

e521fa3f

15 Mar, 2022 2 commits

Expose APIs for the MIGraphX program (#1093) · 64e79a94

Umang Yadav authored Mar 15, 2022

API includes following
create_module,
get_main_module
add_instruction without module args
add_instruction with module args
add_parameter
add_return

64e79a94

Add iterators to kernels tensor_view and fix roialign to work with non-standard shape (#1126) · 31e63991

Paul Fultz II authored Mar 15, 2022

This adds iterators to tensor_view, which can allow kernels to work with non-standard shapes like for roialign.

To improve the performance of indexing when using the iterators, the shape class was updated to use integral_constants since the compiler doesn't always fold the const values. An integral_constant will at least enforce that in the AST.

Finally, since index calculations with single integers are improved, I also updated pointwise to use single index rather than multi index. There is about 4% improvement in some cases.

31e63991

14 Mar, 2022 2 commits
- Increase max groups in kernel (#1120) · d353641d
  Shucai Xiao authored Mar 14, 2022
```
change max number of groups in a kernel to 1B for greater performance
```
  d353641d
- Show the operator fields in the driver (#1103) · 9077db18
  Paul Fultz II authored Mar 14, 2022
```
* Show the operator fields in the driver
```
  9077db18
11 Mar, 2022 1 commit

Improve print ins (#1096) · b3b44f5d

Shucai Xiao authored Mar 11, 2022

The module::debug_print(ins) is very slow, which makes the trave_eval==1/2 very slow. The reason is printing an ins involves search the whole module to get the instruction, the print it.  This change is to fix that by calling module::print() to get names of all instructions of a program, then print the instruction by getting its name from a hash map.

b3b44f5d

09 Mar, 2022 3 commits
- Celu ONNX parser and tests (#1114) · 5b37c53c
  Charlie Lin authored Mar 09, 2022
```
Add Celu ONNX operator
```
  5b37c53c
- Add python API to construct shape class (#1128) · 4467c158
  Paul Fultz II authored Mar 09, 2022
```
Add python API to construct shape class
```
  4467c158
- Expose context in C++ API (#1118) · 0e6bd17c
  kahmed10 authored Mar 09, 2022
```
Add a callable C++ API to migraphx
```
  0e6bd17c
08 Mar, 2022 1 commit
- Size ONNX op (#1122) · d71a7b6a
  Charlie Lin authored Mar 08, 2022
```
* Implement size ONNX operator and tests
```
  d71a7b6a
07 Mar, 2022 1 commit
- Use `add_common_op` for handling types and broadcast in Clip Onnx parsing (#1121) · a0ae2f79
  Umang Yadav authored Mar 07, 2022
```
add_common_op for parse_clip
Should fix #1119
```
  a0ae2f79
04 Mar, 2022 2 commits

EyeLike Operator (#1087) · 8f184d4a
Charlie Lin authored Mar 04, 2022
```
Adds EyeLike ONNX parser and unit tests.
```
8f184d4a

Mode as enum for pooling and roi_align (#1091) · a2e90b5d

bpickrel authored Mar 04, 2022

Changed the pooling values for two structures from strings to specialized enum classes. Many test and operator parsing changes to support this. Introduces one new source file, op_enums.cpp.

a2e90b5d

03 Mar, 2022 3 commits
- Boost the max number of workgroups for pointwise ops (#1113) · d9d17a11
  Paul Fultz II authored Mar 03, 2022
```
Boost the max number of workgroups for pointwise ops by matching what we are doing in launch.hpp
```
  d9d17a11
- Use fp32 compute_type when calling rocBLAS API (#1085) · 36b01ba5
  kahmed10 authored Mar 03, 2022
```
better performance doing it this way
```
  36b01ba5
- Add ScatterND operator (#1074) · 832f28c6
  turneram authored Mar 02, 2022
```
Add onnx parser and ref and gpu implementations of ONNX op ScatterND
```
  832f28c6
02 Mar, 2022 2 commits
- isnan operator (#1100) · bfedcd45
  Charlie Lin authored Mar 02, 2022
```
Implements the IsNaN operator, ref, gpu, and onnx parser.
```
  bfedcd45
- Clang format ver10 (#1106) · 9852aaef
  bpickrel authored Mar 02, 2022
```
Update the base version of clang-format from 5.0 to 10.0
```
  9852aaef
25 Feb, 2022 3 commits
- Add with_type to shape class (#1102) · 85b0563c
  Paul Fultz II authored Feb 25, 2022
```
Add with_type to shape class
```
  85b0563c
- Add reverse lookup of c++ class to c class (#1099) · 40c087bd
  Paul Fultz II authored Feb 25, 2022
```
Needed for custom_op so we can generically convert the C type back to the C++ type in the function pointer.
```
  40c087bd
- Add get_queue to context to get the current stream (#1097) · e5242676
  Paul Fultz II authored Feb 24, 2022
```
wrapped in a any_ptr class so the type can be checked at runtime for a mismatch.
```
  e5242676
24 Feb, 2022 1 commit

Some cmake fixes and updates (#1088) · cd0a4aa5

Paul Fultz II authored Feb 23, 2022

Make doc/CMakeLists.txt standalone
Switch to use rocm-cmake modules for document generation
Add CONFIGURE_DEPENDS to file(GLOB) so it will update without an explicit cmake run
Add STRINGS property for build type to make it easier to switch build types with ccmake
Various fixes and improvements

cd0a4aa5

23 Feb, 2022 1 commit

Keep std shape (#1059) · 98dfdf15

Shucai Xiao authored Feb 23, 2022

This PR is the resolve two problems in the issue#999, i.e., non_standard_shape input to reshape and reduce_mean.
Three fixes:

Any operator that has a standard shape requirement will add a contiguous input for its input.
Eliminate_contiguous, when computing whether a contiguous can be removed, we should use all the updated args, not just the one that is being checked.
In two optimization in the simplify_reshape, we remove the contiguous in the reshaper name list, since eliminate_contiguous will remove the contiguous if it can be removed.
the solution is add an attribute to the operator that requires standard input shape, then in the auto_contiguous pass, add a contiguous to every input of such operators.

98dfdf15

16 Feb, 2022 2 commits
- Support nonstandard shapes for the UnSqueeze Op (#1071) · 4480eb79
  Umang Yadav authored Feb 16, 2022
```
Support nonstandard shapes like slice, broadcast and transpose for the unsqueeze op
```
  4480eb79
- Add assign_to method for C++ API (#1075) · ecb1545c
  kahmed10 authored Feb 16, 2022
  
  ecb1545c
11 Feb, 2022 1 commit
- Fix hang with CSE pass when using submodules (#1050) · 48585bad
  kahmed10 authored Feb 11, 2022
```
* add submodule test
* remove for loop
* simplify reshape test
```
  48585bad
09 Feb, 2022 2 commits
- Enable pointwise fusion by default (#1082) · c7419a9c
  Paul Fultz II authored Feb 09, 2022
```
There is now a MIGRAPHX_DISABLE_POINTWISE_FUSION to disable it
```
  c7419a9c
- Support nonstandard shapes for the Squeeze Op (#1068) · e64b773f
  Umang Yadav authored Feb 09, 2022
```
Support slice, broadcast and transpose shapes for the squeeze op.
```
  e64b773f
08 Feb, 2022 2 commits

Add missing output_alias to miopen_fusion op (#1076) · b304d97d

Paul Fultz II authored Feb 08, 2022

This causes incorrect memory coloring, which was causing the accuracy failures in the vision model when enabling the pointwise fusions. Resnet50, inceptionv3 and inceptionv4 do verify now in the driver.

b304d97d

Enforce types to avoid compilation error in pointwise fusions (#1077) · 73b8a773
Paul Fultz II authored Feb 08, 2022
```
Enforce types to avoid compilation error in pointwise fusions
This fixes compile failure: gpt-2, fp16 on Navi
```
73b8a773