Commits · 7f705c1deb93f4353d72a8a8aecc568957bc719d · gaoqiong / MIGraphX

17 Apr, 2023 6 commits

Change up tests to huge of static and random data · 7f705c1d
Ted Themistokleous authored Mar 27, 2023
```
Remove the need to use gpu, switch this to ref.
change names to reflect static vs random data
```
7f705c1d

Huge test cases that capture error state of NMS with single huge batch size · 5f61e85f

Ted Themistokleous authored Mar 27, 2023

In this case we have a batch size with no bound on the score threshold.
We end up evaluating a single huge batch on its own.

The concern here is this should just all the way through without completely
stalling or intractably running in a single thread fashion currently.

5f61e85f

Suppress IOU based on box references instead of copying over boxes · 14b20fdf

Ted Themistokleous authored Mar 27, 2023

This saves us two copies of the entire box class to this call and instead
works on reference of these objects that are created within the loops instead

14b20fdf

Create next_box from next_top_score once during IOU suppression · aa3fba66

Ted Themistokleous authored Mar 27, 2023

We're continually creating/destroying batch box in the while() check as we
run through the boxes_heap() by calling batch_box() constantly.

Make this next_box and only calculate it before we pop that box from the boxes_heap.

should get rid of function overhead of constant calls in the case of a large
batch size

aa3fba66

Add early return for suppress function based on box area · 7b42f05c

Ted Themistokleous authored Mar 27, 2023

Just quickly return if either boxes have zero area. Searching for intersection
and union is irrelevant here logically.

7b42f05c

expose enum datatypes to python api (#1655) · 42685803

shivadbhavsar authored Apr 17, 2023

Expose the shape::type_t values to be used by the python api and is required by torch_migraphx to support torchbench models.

42685803

13 Apr, 2023 1 commit
- [mlir] Adding quantizelinear, dequantizelinear and quant_convolution support (#1675) · 7b2a5ccf
  Zhuoran Yin authored Apr 13, 2023
  
  7b2a5ccf
12 Apr, 2023 3 commits
- Print out pass name when tracing passes (#1667) · 551b927c
  Paul Fultz II authored Apr 12, 2023
  
  551b927c
- Updates to README (#1671) · ec4b79c2
  Paul Fultz II authored Apr 12, 2023
```
This removes the --cxx flags from the rbuild commands since it is not necessary. Also added a section about using rbuild to set up an environment for development.
```
  ec4b79c2
- Update workflow to support rocm image overwrite (#1662) · 851f8f3e
  Djordje Petrovic authored Apr 12, 2023
  
  851f8f3e
11 Apr, 2023 3 commits
- Onnxruntime Weekly Sync 2023-04-07 (#1676) · cc8dda73
  github-actions[bot] authored Apr 11, 2023
  
  cc8dda73
- Enable tidy on gpu driver (#1659) · 3385dcc8
  Paul Fultz II authored Apr 11, 2023
  
  3385dcc8
- Update name of github action script (#1624) · 744c6ab7
  Ted Themistokleous authored Apr 11, 2023
  
  744c6ab7
10 Apr, 2023 3 commits
- Always build ref target when building MIGraphX (#1636) · cce35871
  Umang Yadav authored Apr 10, 2023
  
  cce35871
- Fix 2 input broadcast bug for dynamic batch and output parameter ordering (#1669) · d3eb5609
  Charlie Lin authored Apr 10, 2023
```
Adds a matcher to split_single_dyn_dim to find all broadcast or multibroadcast with two static shape inputs and replaces the instruction with the one input version.
Sorts the get_output_parameters() list to ensure the correct ordering. (Was getting an error for some models.)
```
  d3eb5609
- Add dockerignore file (#1661) · 2e754cdd
  Paul Fultz II authored Apr 10, 2023
  
  2e754cdd
09 Apr, 2023 1 commit
- Enable hiprtc by default (#1658) · db6c75e7
  Paul Fultz II authored Apr 09, 2023
```
* Enable hiprtc by default
```
  db6c75e7
07 Apr, 2023 1 commit

Require the same type for the inputs and scales for QuantizeLinear (#1642) · f6e22d56

Paul Fultz II authored Apr 06, 2023

Converts can be inserted when the scales and input differ in the onnx file(we are already doing this implicit conversion in the ref implementation). This will also improve the compile-time of quantizelinear.hpp since we can remove the nested visit method.

f6e22d56

06 Apr, 2023 2 commits

Driver dynamic batch update (#1652) · adccec52

Charlie Lin authored Apr 06, 2023

Examples..

bin/driver verify /codes/onnx_models/resnet50-v1-7/resnet50-v1-7.onnx --split-single-dyn-dim --batch 3 --dyn-input-dim @data "[{min:1, max:4}, 3, 224, 224]"

bin/driver compile /codes/onnx_models/resnet50-v1-7/resnet50-v1-7.onnx --split-single-dyn-dim --default-dyn-dim "{min:1, max:10}" --output resnet50_batch1-10.mxr

bin/driver perf resnet50_batch1-10.mxr --batch 4

adccec52

Add reduction fusion (#1614) · f201285c
Paul Fultz II authored Apr 05, 2023
```
Automatically fuse multiple reductions and pointwise operations.
```
f201285c

05 Apr, 2023 3 commits
- Add MIGRAPHX_VALIDATE_MATCHES env variable to validate each matcher (#1372) · a123cb2e
  Paul Fultz II authored Apr 05, 2023
```
* Add MIGRAPHX_VALIDATE_MATCHES env variable to validate each matcher
```
  a123cb2e
- Optimize add convolution (#1549) · df32040d
  Paul Fultz II authored Apr 05, 2023
```
This will replace conv(x+a, w) with conv(x, w) + conv(a, w) where a is a constant so conv(a, w) can be replaced with a constant.
```
  df32040d
- Add missing header for sles and centos (#1665) · 8beb6680
  Paul Fultz II authored Apr 04, 2023
  
  8beb6680
04 Apr, 2023 2 commits

fix bug in transpose_slice simplification (#1660) · 30af1697

shivadbhavsar authored Apr 04, 2023

Bug found due to failing torch benchmark. Added test case to reproduce issue causing the model to error out on compile.
Original logic results in the following error:
AMDMIGraphX/src/include/migraphx/op/unsqueeze.hpp:128: normalize_compute_shape: UNSQUEEZE: Axis dimenstion is not divisible by step

30af1697

Refactor dynamic_dimension to have multiple optimals (#1625) · e7ec374f

Charlie Lin authored Apr 04, 2023

Makes the optimals into a std::set<std::size_t>
Changes shape object functions to handle the opts change
Changes to convolution, flatten, pooling, and convolution in that they no longer calculate the output optimal dimensions. Instead returns empty opts. Will need to change this in the future if we want to support dynamic shapes fully.
Many changes to tests and shape calls with respect to the new optimals

e7ec374f

03 Apr, 2023 2 commits

fix stable diffusion decoder non standard shape issue (#1594) · 1329b9be
shivadbhavsar authored Apr 03, 2023

1329b9be

promote_literals pass (#1593) · e3fb3a0d

Charlie Lin authored Apr 03, 2023

Adds the promote_literals compiler pass that moves literals from the submodules to the main module.
With the eliminate_common_subexpression pass, it will remove copies of literals created during split_single_dyn_dim.
Pass is enabled with the split_single_dyn_dim compile option.

e3fb3a0d

01 Apr, 2023 1 commit
- Enable header tests for FPGA and CPU backend (#1634) · 6a0a5ffe
  Umang Yadav authored Apr 01, 2023
  
  6a0a5ffe
31 Mar, 2023 1 commit

Split single dynamic dimension compiler pass (#1580) · e9e3eacc

Charlie Lin authored Mar 30, 2023

Adds a new GPU compiler pass split_single_dyn_dim that handles when one input parameter has a single non-fixed dynamic_dimension.
commonly occurs for dynamic batch or BERT sequence length
Splits the dynamic shape into several submodules will static input parameters to handle all of the cases in the dynamic_dimension range.
Essentially does what I manually did for the select_module verify tests
Adds a compile option split_single_dyn_dim that toggles the pass on/off. Defaults to false.
Updates verify_program.hpp and run_verify.cpp to allow for the tests to change the compile_options

e9e3eacc

30 Mar, 2023 1 commit
- Enable parallel compilation with hiprtc (#1647) · 32b9fd08
  Paul Fultz II authored Mar 30, 2023
```
* Add hiprtc driver
```
  32b9fd08
29 Mar, 2023 2 commits
- Fix bug when concatting with the vectorization axis (#1653) · b1506c73
  Paul Fultz II authored Mar 29, 2023
  
  b1506c73
- Add organization for history workflow (#1648) · 7d26eb9d
  Pavle Jacovic authored Mar 29, 2023
  
  7d26eb9d
28 Mar, 2023 2 commits
- Add model timeout (#1649) · ab76c2fa
  Pavle Jacovic authored Mar 28, 2023
  
  ab76c2fa
- Remove version name from check_context (#1639) · 49fc6138
  Umang Yadav authored Mar 28, 2023
```
* Remove version from check_context and bump program version
```
  49fc6138
27 Mar, 2023 1 commit

[MLIR] add dot offloads with manual tuning support (#1631) · 7c4dc99a

Manupa Karunaratne authored Mar 27, 2023

* [MLIR] add dot offloads with manual tuning support
* This commit adds dot + pointwise fusion support
along with manual tuning using rocMLIR.

7c4dc99a

26 Mar, 2023 1 commit
- enable onnx 1.12 to run (#1431) · c614588b
  Chris Austen authored Mar 26, 2023
  
  c614588b
25 Mar, 2023 1 commit
- remove /opt/rocm (#1623) · 018e5318
  Umang Yadav authored Mar 24, 2023
```
Co-authored-by: Chris Austen <causten@users.noreply.github.com>
```
  018e5318
24 Mar, 2023 1 commit

Add Additional flags to accuracy_checker.py (#1637) · 6c8b978d

Ted Themistokleous authored Mar 24, 2023

Useful to get more insight into Onnxruntime. Allows us to reuse the accuracy checker code while also allowing us to capture Execution Provider output with the --ort_run and --ort_logging flags

Also added the --target flag as well to allow us to force using either a specific target for the accuracy checking. Originally this was defaulting to the GPU. This now allows us to use ref, fpga, etc to quickly change targets.

6c8b978d

22 Mar, 2023 1 commit
- Use version number as part of internal namespace symbol (#1633) · 09aaa63e
  Umang Yadav authored Mar 21, 2023
```
prevent dynamically loading the target library that is not compiled with the same version of MIGraphX core lib.
```
  09aaa63e
21 Mar, 2023 1 commit

select_module refactor (#1615) · 94a7f6ee

Charlie Lin authored Mar 21, 2023

Refactor to have select_module use output parameters
Disable select_module verify tests on cpu

94a7f6ee