Commits · 43c7d307918074984152cbb6cf579448ac4a7a14 · gaoqiong / MIGraphX

07 Feb, 2022 1 commit
- Add fuse mlir · 43c7d307
  Paul authored Feb 07, 2022
  
  43c7d307
05 Feb, 2022 2 commits
- Format · cea63d59
  Paul authored Feb 04, 2022
  
  cea63d59
- Add fuse_mlir pass · 6e7c8be8
  Paul authored Feb 04, 2022
  
  6e7c8be8
02 Feb, 2022 1 commit

Update trace_eval to preview the output buffers (#1073) · b20e3d4d

Paul Fultz II authored Feb 02, 2022

Currently, MIGRAPHX_TRACE_EVAL=2 prints out the entire output buffer, but this can produce a lot of output. To make it easier to inspect and debug, using MIGRAPHX_TRACE_EVAL=2 now only prints 10 elements from the buffer(the first 5 and last 5) and shows any fp classifications found in the buffer(ie nans, infinity, etc). The previous behavior can still be enabled with MIGRAPHX_TRACE_EVAL=3.

b20e3d4d

31 Jan, 2022 1 commit
- Parse upsample (#1060) · 7e7ef0b8
  Shucai Xiao authored Jan 31, 2022
```
* use the parse_resize to parse the upsample operator
```
  7e7ef0b8
28 Jan, 2022 2 commits

Add auto-vectorization of pointwise operators (#1047) · 78a3c9b7

Paul Fultz II authored Jan 28, 2022

* Enable auto vectorization
* Handle vector types with convert function
* Dont vectorize when it will cause problems with preload

78a3c9b7

Add Mean op ONNX parser (#1065) · b7218806

turneram authored Jan 28, 2022

* Add mean op onnx parser and unit tests
* Refactor parse_mean to use add_broadcastable_binary_op

b7218806

27 Jan, 2022 1 commit
- Remove Standard Shape requirement for ArgOps (#1042) · 332cb710
  Umang Yadav authored Jan 27, 2022
```
allow nonstd shape for the arg ops, non-standard shapes include broadcast, slice and transpose
```
  332cb710
26 Jan, 2022 2 commits

Add HardSwish op ONNX parser (#1066) · 7477aeb8

turneram authored Jan 26, 2022

Add HardSwish to HardSigmoid parser

HardSwish formula is y = x * HardSigmoid<alpha=1/6, beta=0.5>(x)
HardSigmoid parser sets alpha to 1/6 and adds the mul instruction if op name is HardSwish

Resolves #1062

7477aeb8

Updates · 1cc6c88c
Paul authored Jan 25, 2022

1cc6c88c

21 Jan, 2022 4 commits
- GreaterOrEqual ONNX parser (#1044) · 60aa1c85
  turneram authored Jan 21, 2022
```
Add onnx parser for operator GreaterOrEqual
```
  60aa1c85
- SoftSign ONNX parser (#1046) · ebb15dd3
  turneram authored Jan 21, 2022
```
Add onnx parser and unit tests for Softsign
```
  ebb15dd3
- SoftPlus ONNX parser (#1045) · 4c90e9a3
  turneram authored Jan 20, 2022
```
* Add onnx parser and unit test
```
  4c90e9a3
- Improve handling of generator expressions when getting the flags for hip (#1055) · 3f392a3b
  Paul Fultz II authored Jan 20, 2022
```
* Improve handling of generator expressions when getting the flags for hip
```
  3f392a3b
17 Jan, 2022 1 commit
- Make clip a pointwise op (#1043) · b0ece214
  Paul Fultz II authored Jan 17, 2022
```
Make clip a pointwise op
```
  b0ece214
11 Jan, 2022 1 commit

HardSigmoid ONNX parser (#1040) · fc42d852

turneram authored Jan 11, 2022

Add HardSigmoid onnx parser and unit tests
Produces mathematical equivalent to ONNX operator through combination of existing pointwise ops.
Resolves #1028

fc42d852

10 Jan, 2022 3 commits
- Handle miopen fusions when using pointwise fusions (#1019) · 534a05c1
  Paul Fultz II authored Jan 10, 2022
```
* Add matcher for conv_bias pointwise
* Add fusion op
```
  534a05c1
- Format · 467a7cb8
  Paul authored Jan 09, 2022
  
  467a7cb8
- Fix output arg · 88f549e2
  Paul authored Jan 09, 2022
  
  88f549e2
07 Jan, 2022 2 commits
- Formatting · b7aa8f2a
  Paul authored Jan 06, 2022
  
  b7aa8f2a
- Fix device name · a652e90c
  Paul authored Jan 06, 2022
  
  a652e90c
06 Jan, 2022 3 commits
- Format · 13418e23
  Paul authored Jan 05, 2022
  
  13418e23
- Set kernal name · 4ba8706f
  Paul authored Jan 05, 2022
  
  4ba8706f
- Disable eliminate_data_type · eda8df70
  Paul authored Jan 05, 2022
  
  eda8df70
05 Jan, 2022 1 commit
- Fix time seed bug in random sequence ops (#1027) · 594f2802
  turneram authored Jan 05, 2022
```
Fix bug caused by casting time seed to float
```
  594f2802
11 Dec, 2021 7 commits
- Enable pointwise_fusion · 8a251fec
  Paul authored Dec 10, 2021
  
  8a251fec
- Formatting · d0feb6b4
  Paul authored Dec 10, 2021
  
  d0feb6b4
- Add mlir verification · c83ee9f8
  Paul authored Dec 10, 2021
  
  c83ee9f8
- Format · e2967e04
  Paul authored Dec 10, 2021
  
  e2967e04
- Add code to insert memrefs · df3749cd
  Paul authored Dec 10, 2021
  
  df3749cd
- Format · 60ab44c7
  Paul authored Dec 10, 2021
  
  60ab44c7
- Dont provide output for return instruction · 2c952efd
  Paul authored Dec 10, 2021
  
  2c952efd
09 Dec, 2021 2 commits

Softmax perf optimization (#1014) · 2e337c7f

Shucai Xiao authored Dec 09, 2021

Changed the number of threads in a block from 256 to 128
Increased the max number of blocks in the kernel from 256 to 1M.
For the case that the axis is the last dimension, we removed the computation of index since it is not required.

With these change, we can get about 2x speedup compared to the develop branch for the softmax op used in the BertSquad model.

2e337c7f

Fuse last instruction in fuse_pointwise (#1015) · e758d457
Paul Fultz II authored Dec 09, 2021
```
Fuse last instruction in fuse_pointwise
This is also fixes a bug with using an invalid iterator.
```
e758d457

08 Dec, 2021 1 commit
- Fuse convert ops (#1020) · 00bfed4d
  Paul Fultz II authored Dec 08, 2021
  
  00bfed4d
07 Dec, 2021 1 commit
- Rename reduce_inputs to virtual_inputs (#1021) · 1793cc54
  Paul Fultz II authored Dec 07, 2021
```
simple variable rename
```
  1793cc54
02 Dec, 2021 1 commit
- Fix pointwise compile error with half sqrt (#1010) · 7b3e58a0
  Paul Fultz II authored Dec 02, 2021
```
Fix pointwise compile error with half sqrt 
```
  7b3e58a0
01 Dec, 2021 3 commits
- Handle unsinged integers · b406a418
  Paul authored Dec 01, 2021
  
  b406a418
- Register dialect · 1851e975
  Paul authored Dec 01, 2021
  
  1851e975
- Format · e6f8a2cf
  Paul authored Dec 01, 2021
  
  e6f8a2cf