Commits · 01342ae159d925018facfad2e66b18decfcded6c · gaoqiong / MIGraphX

22 Jun, 2023 1 commit
- [mlir] Adding mlir quant_dot operator support (#1816) · 01342ae1
  Zhuoran Yin authored Jun 22, 2023
```
Add mlir quant_dot operator support
```
  01342ae1
20 Jun, 2023 1 commit

Update onnxruntime main fbf08c4b4dce5da245189203d9f6cfc41f6663a2 (#1843) · db63cc77

github-actions[bot] authored Jun 20, 2023


Co-authored-by: causten <causten@users.noreply.github.com>
Co-authored-by: Ted Themistokleous <107195283+TedThemistokleous@users.noreply.github.com>

db63cc77

17 Jun, 2023 1 commit

Fix convert operation for NaNs (#1840) · 2d635f91

Umang Yadav authored Jun 17, 2023

* Fix convert for the NaNs

* NaNs can't be compared, use std::isnan()

* formatting

* formatting

* formatting

* add extra tests

2d635f91

16 Jun, 2023 1 commit

2+ input multibroadcast and 2+ input dynamic shape insert_common_op (#1836) · 27bb8ca6

Charlie Lin authored Jun 16, 2023



* initial

* Added tests and new functionality

* Update optimals handling

* Simplify conditionals

* Ref test, update docs

* Remove comment, suggestion unclear

---------
Co-authored-by: Umang Yadav <29876643+umangyadav@users.noreply.github.com>

27bb8ca6

15 Jun, 2023 1 commit

fix parse_instancenorm to create broadcast and multibroadcast instruc… (#1715) · 41ba30d5

Brian Pickrell authored Jun 15, 2023

* fix parse_instancenorm to create broadcast and multibroadcast instructions with two dynamic shape arguments instead of 1.  Their make_op() functions don't support dynamic shapes when called with one input.  This caused an error when parsing an ONNX 3duunet model

* Use add_common_op() to create multibroadcast op.

* add verification and parsing test for instance_norm with dynamic input.  Parse test doesn't pass.

* fix for test; still doesn't pass

* another fix for test; still doesn't pass

* work in progress, instance_norm_dyn_batch_test works but instance_norm_test doesn't

* fix onnx instancenorm tests to match parser changes.  Passes all check tests

* Updated comments explaining usage of add_common_op()

* hand-merged conflicts with develop

* fix instance_norm_half_test after merge

* add Onnx test instance_norm_dyn_batch_half_test

* add shape test cases broadcast_1in_dyn_error and multibroadcast_1in_dyn_error_0

41ba30d5

13 Jun, 2023 1 commit
- Fix shape typo in API test (#1787) · 193f105d
  Charlie Lin authored Jun 13, 2023
  
  193f105d
12 Jun, 2023 1 commit
- Enable reshape on nonstandard shapes (#1681) · 0dae73fa
  Paul Fultz II authored Jun 12, 2023
  
  0dae73fa
05 Jun, 2023 1 commit

Test and doc update for shape.from_permutation() (#1742) · 68446f7a

Charlie Lin authored Jun 05, 2023

Changed the doc for find_permutation(shape) to be more clear that it is finding the permutation that would make the shape standard

68446f7a

02 Jun, 2023 1 commit
- replace np.bool with bool as per numpy request (#1640) · 10c42663
  Chris Austen authored Jun 02, 2023
  
  10c42663
01 Jun, 2023 1 commit

Convert Fp16 instance-norm to FP32 temporarily (#1779) · 49b341d3

Umang Yadav authored Jun 01, 2023

By converting to fp32 : fp16 3d-unet model accuracy comes out the same as FP32 accuracy.

By using reduce_sum method on Fp16 : accuracy comes out ~0.9% lower compared to fp32 while keeping entire model in fp16.

49b341d3

31 May, 2023 1 commit
- Update pass manager to handle multi-target compilation (#1672) · 9473e3a2
  Umang Yadav authored May 31, 2023
```
partially solves #1656
This PR only handles compilation part of multitarget.
```
  9473e3a2
30 May, 2023 1 commit

Improvements to driver output (#1710) · d32ab85b

Paul Fultz II authored May 30, 2023

Use generate_argument instead of generate_literal for python output as generate_literal doesnt exists
Shorten the names for variables from the main module
Use prefix p_ for parameters
Use shorter variable m for main module in python

d32ab85b

25 May, 2023 1 commit
- Update cpp generator to handle inf from float (#1758) · 763dd1da
  Ted Themistokleous authored May 25, 2023
```
Use std::numeric_limits::min/max() functions plus the appropriate value to encode -inf/inf 
```
  763dd1da
20 May, 2023 1 commit
- Use half HIP APIs to compute max and min (#1764) · 88fb551c
  Umang Yadav authored May 19, 2023
```
* use half hip functions to compute max and min
* add verify test for min and max
```
  88fb551c
19 May, 2023 1 commit
- Enabling native int32 type support (#1721) · 8d9d5d1c
  Zhuoran Yin authored May 19, 2023
```
Co-authored-by: Paul Fultz II <pfultz2@yahoo.com>
```
  8d9d5d1c
17 May, 2023 2 commits

adjust docker files to support new rocm 5.5 (#1729) · 5e35957b
Chris Austen authored May 17, 2023
```
Move CI to support the rocm5.5 release
```
5e35957b

scalar unsqueeze broadcast support (#1753) · 2140fe19

shivadbhavsar authored May 16, 2023

Adding support for broadcasted scalars to unsqueeze op.

Specifying steps other than 1 is disallowed in this implementation since we want the output the always be a tensor. We can support varying step sizes if we allow a broadcasted scalar output from this op.

2140fe19

11 May, 2023 1 commit
- Update onnxruntime main 5a43828b3d73028bfd33b3856f82698d9ab02cb1 (#1741) · 177e5dbc
  github-actions[bot] authored May 10, 2023
```
Co-authored-by: causten <causten@users.noreply.github.com>
```
  177e5dbc
05 May, 2023 1 commit
- Python API update for dynamic batch (#1723) · ccc4b8a4
  Charlie Lin authored May 05, 2023
```
Python API with documentation updates
```
  ccc4b8a4
04 May, 2023 2 commits

Rewrite multiplies with dot operator (#1685) · 457703a8

Paul Fultz II authored May 04, 2023

When multiplying either the input or output across the K dimensions then the multiple can be applied to the constant which can then be folded with propagate_const.

457703a8

[mlir] Adding quant convolution fusion as anchor op (#1683) · 7f105952

Zhuoran Yin authored May 03, 2023

Exposed the mlir_enabled() call the decide for lowering pipeline's enablement
Disabled the rewrite quantization pipeline in mlir compilation
Added quant convolution as anchor ops
Fixed the return type expectations
Added the fall back hip implementation for quantizelinear and dequantizelinear
Will need advises to improve the implementation for quantizelinear

7f105952

03 May, 2023 1 commit

Update C/C++ API for dynamic batch (#1712) · 0ff00ef6

Charlie Lin authored May 02, 2023

Relies on Removed split_single_dyn_dim compile flag #1711
Exposes dynamic_dimension as a opaque object with dynamic_dimensions and optimals
Exposes ONNX dyn_input_dims and default_dyn_dim to run with dynamic batch
Updates api.py to be able to create objects from aggregate initialization (used for dynamic_dimension)
Uses offload copy for now

0ff00ef6

02 May, 2023 1 commit

Handle broadcasts across dot and concat (#1689) · a8ace295

Paul Fultz II authored May 02, 2023

Improves the constant propagation for bert models. Larger batch size no longer use as large of constants.  Also improves the speed of model compilation

a8ace295

28 Apr, 2023 1 commit
- Removed split_single_dyn_dim compile flag (#1711) · bcc1f64a
  Charlie Lin authored Apr 28, 2023
  
  bcc1f64a
24 Apr, 2023 2 commits
- Fix compile failure in reduction fusion of instance norm (#1702) · 08360e83
  Paul Fultz II authored Apr 24, 2023
```
This fixes #1700
```
  08360e83
- Fix incorrect assertion in vec_packed_at (#1704) · 4339af75
  Paul Fultz II authored Apr 23, 2023
  
  4339af75
20 Apr, 2023 1 commit
- Update multi() to work with non-std shapes (#1690) · 71c8181c
  Umang Yadav authored Apr 19, 2023
```
Solves #1311
```
  71c8181c
19 Apr, 2023 1 commit
- Expose instruction shape and operator through python api (#1696) · f92e7994
  shivadbhavsar authored Apr 19, 2023
```
Expose get_shape and get_operator methods for instruction_ref object in the python API.
```
  f92e7994
18 Apr, 2023 1 commit
- Make JIT and pointwise work with zero input args (#1587) · 177eb1b0
  Ted Themistokleous authored Apr 17, 2023
```
Ensure that we don't have empty inputs when computing shape for pointwise function
```
  177eb1b0
17 Apr, 2023 2 commits

Convert a fully fixed map_dyn_input_dims value to a static shape when parsing ONNX (#1682) · c5eee1a3

Charlie Lin authored Apr 17, 2023

Fixes the above behavior
This needs to be changed to allow for setting static shapes with map_dyn_input_dims since you cannot also use map_input_dims

c5eee1a3

expose enum datatypes to python api (#1655) · 42685803

shivadbhavsar authored Apr 17, 2023

Expose the shape::type_t values to be used by the python api and is required by torch_migraphx to support torchbench models.

42685803

13 Apr, 2023 1 commit
- [mlir] Adding quantizelinear, dequantizelinear and quant_convolution support (#1675) · 7b2a5ccf
  Zhuoran Yin authored Apr 13, 2023
  
  7b2a5ccf
11 Apr, 2023 1 commit
- Onnxruntime Weekly Sync 2023-04-07 (#1676) · cc8dda73
  github-actions[bot] authored Apr 11, 2023
  
  cc8dda73
10 Apr, 2023 2 commits

Always build ref target when building MIGraphX (#1636) · cce35871
Umang Yadav authored Apr 10, 2023

cce35871

Fix 2 input broadcast bug for dynamic batch and output parameter ordering (#1669) · d3eb5609

Charlie Lin authored Apr 10, 2023

Adds a matcher to split_single_dyn_dim to find all broadcast or multibroadcast with two static shape inputs and replaces the instruction with the one input version.
Sorts the get_output_parameters() list to ensure the correct ordering. (Was getting an error for some models.)

d3eb5609

07 Apr, 2023 1 commit

Require the same type for the inputs and scales for QuantizeLinear (#1642) · f6e22d56

Paul Fultz II authored Apr 06, 2023

Converts can be inserted when the scales and input differ in the onnx file(we are already doing this implicit conversion in the ref implementation). This will also improve the compile-time of quantizelinear.hpp since we can remove the nested visit method.

f6e22d56

06 Apr, 2023 2 commits

Driver dynamic batch update (#1652) · adccec52

Charlie Lin authored Apr 06, 2023

Examples..

bin/driver verify /codes/onnx_models/resnet50-v1-7/resnet50-v1-7.onnx --split-single-dyn-dim --batch 3 --dyn-input-dim @data "[{min:1, max:4}, 3, 224, 224]"

bin/driver compile /codes/onnx_models/resnet50-v1-7/resnet50-v1-7.onnx --split-single-dyn-dim --default-dyn-dim "{min:1, max:10}" --output resnet50_batch1-10.mxr

bin/driver perf resnet50_batch1-10.mxr --batch 4

adccec52

Add reduction fusion (#1614) · f201285c
Paul Fultz II authored Apr 05, 2023
```
Automatically fuse multiple reductions and pointwise operations.
```
f201285c

05 Apr, 2023 1 commit

Optimize add convolution (#1549) · df32040d

Paul Fultz II authored Apr 05, 2023

This will replace conv(x+a, w) with conv(x, w) + conv(a, w) where a is a constant so conv(a, w) can be replaced with a constant.

df32040d

04 Apr, 2023 1 commit

fix bug in transpose_slice simplification (#1660) · 30af1697

shivadbhavsar authored Apr 04, 2023

Bug found due to failing torch benchmark. Added test case to reproduce issue causing the model to error out on compile.
Original logic results in the following error:
AMDMIGraphX/src/include/migraphx/op/unsqueeze.hpp:128: normalize_compute_shape: UNSQUEEZE: Axis dimenstion is not divisible by step

30af1697