Commits · rocm-5.6.1 · gaoqiong / MIGraphX

14 Jun, 2023 2 commits
- Revert "Revert "Handle broadcasts across dot and concat (#1689) (#1731)"" (#1837) · 17c03174
  Umang Yadav authored Jun 14, 2023
  
  17c03174
- Merge pull request #1814 from ROCmSoftwarePlatform/revert_1689_handle_broadcasts_dot_concat · 7b82fc39
  Umang Yadav authored Jun 14, 2023
```
Revert "Handle broadcasts across dot and concat (#1689) (#1731)"
```
  7b82fc39
06 Jun, 2023 2 commits
- Convert Fp16 instance-norm to FP32 temporarily (#1779) (#1799) · 20b423e0
  Chris Austen authored Jun 06, 2023
```
* Convert Fp16 instance-norm to FP32 temporarily (#1779)
* Conditionally enable GeLU approximation  (#1810)
```
  20b423e0
- Revert "Handle broadcasts across dot and concat (#1689) (#1731)" · af6d2ec9
  Ted Themistokleous authored Jun 06, 2023
```
This reverts commit a46f378e.
```
  af6d2ec9
25 May, 2023 2 commits

Update cpp generator to handle inf from float (#1758) (#1781) · 6a7de283

Chris Austen authored May 25, 2023



Use std::numeric_limits::min/max() functions plus the appropriate value to encode -inf/inf
Co-authored-by: Ted Themistokleous <107195283+TedThemistokleous@users.noreply.github.com>

6a7de283

Documentation updates for 5.6 (#1773) · f9ddbc8e

Chris Austen authored May 25, 2023

* Use action to free space which uses apt  remove to remove all the dependencies as well (#1756)
* Docsupdate (#1748)
* adjust docker files to support new rocm 5.5 (#1729)
* update to v0.11.0 of rocm-docs-core (#1763)

f9ddbc8e

06 May, 2023 1 commit

Rocm56 dynbatch (#1737) · 2e128d9d

Chris Austen authored May 06, 2023



* Removed split_single_dyn_dim compile flag (#1711)
* Update C/C++ API for dynamic batch (#1712)
* Python API update for dynamic batch (#1723)
* Dynamic batch C++ API example #1728
* Optimize file space of github runners (#1743)
Co-authored-by: Charlie Lin <charlie.lin@amd.com>

2e128d9d

05 May, 2023 1 commit

Handle broadcasts across dot and concat (#1689) (#1731) · a46f378e

Chris Austen authored May 05, 2023



Improves the constant propagation for bert models. Larger batch size no longer use as large of constants.  Also improves the speed of model compilation
Co-authored-by: Paul Fultz II <pfultz2@yahoo.com>

a46f378e

25 Apr, 2023 3 commits

update rocBLAS version check to support 3.0 and above (#1716) · ed6542ee
kahmed10 authored Apr 25, 2023
```
update rocBLAS version check to support 3.0 and above with simplified logic
```
ed6542ee

Bump tensorflow from 2.9.3 to 2.11.1 in /examples/nlp/python_bert_squad (#1646) · b4cba0b8

dependabot[bot] authored Apr 25, 2023

Bumps [tensorflow](https://github.com/tensorflow/tensorflow) from 2.9.3 to 2.11.1.
- [Release notes](https://github.com/tensorflow/tensorflow/releases)
- [Changelog](https://github.com/tensorflow/tensorflow/blob/master/RELEASE.md)
- [Commits](https://github.com/tensorflow/tensorflow/compare/v2.9.3...v2.11.1)

---
updated-dependencies:
- dependency-name: tensorflow
  dependency-type: direct:production
...

b4cba0b8

Disable hipRTC revert to hipClang (#1714) · eb69b36c
Chris Austen authored Apr 24, 2023

eb69b36c

24 Apr, 2023 3 commits
- Dynamic shape hip::copy_to_gpu and hip::copy_from_gpu (#1694) · 84acaea0
  Charlie Lin authored Apr 24, 2023
```
Updates the hip::copy_to_gpu and hip::copy_from_gpu operators to work with dynamic shapes

Allows for offload_copy to be used with dynamic batch

Changed assert in select_module because the argument might now be smaller with how offload_copy will work with dynamic batch. (maximum buffer size will be used)
```
  84acaea0
- Fix compile failure in reduction fusion of instance norm (#1702) · 08360e83
  Paul Fultz II authored Apr 24, 2023
```
This fixes #1700
```
  08360e83
- Fix incorrect assertion in vec_packed_at (#1704) · 4339af75
  Paul Fultz II authored Apr 23, 2023
  
  4339af75
21 Apr, 2023 1 commit
- disable fusion only but create pointwise modules (#1706) · 2a44dfe9
  Umang Yadav authored Apr 21, 2023
  
  2a44dfe9
20 Apr, 2023 1 commit
- Update multi() to work with non-std shapes (#1690) · 71c8181c
  Umang Yadav authored Apr 19, 2023
```
Solves #1311
```
  71c8181c
19 Apr, 2023 1 commit
- Expose instruction shape and operator through python api (#1696) · f92e7994
  shivadbhavsar authored Apr 19, 2023
```
Expose get_shape and get_operator methods for instruction_ref object in the python API.
```
  f92e7994
18 Apr, 2023 3 commits

Use hash key for docker layer computed from Dockerfile (#1691) · 3e8d7196

Umang Yadav authored Apr 18, 2023

* Use hash for docker layer
* Remove `layer-` prefix. it gets added by action automatically
* Add requirements file to docker key hash

3e8d7196

Add trace flag for propagate_constant (#1686) · 16675681

Paul Fultz II authored Apr 18, 2023

This will show whats being replaced with a constant. This is useful for debugging where a literal comes from.

16675681

Make JIT and pointwise work with zero input args (#1587) · 177eb1b0
Ted Themistokleous authored Apr 17, 2023
```
Ensure that we don't have empty inputs when computing shape for pointwise function
```
177eb1b0

17 Apr, 2023 3 commits
- Add clean up for CI cache and change keys for caching (#1680) · 1a41c9e9
  Umang Yadav authored Apr 17, 2023
```
CI changes to improve github cache management 
```
  1a41c9e9
- Convert a fully fixed map_dyn_input_dims value to a static shape when parsing ONNX (#1682) · c5eee1a3
  Charlie Lin authored Apr 17, 2023
```
Fixes the above behavior
This needs to be changed to allow for setting static shapes with map_dyn_input_dims since you cannot also use map_input_dims
```
  c5eee1a3
- expose enum datatypes to python api (#1655) · 42685803
  shivadbhavsar authored Apr 17, 2023
```
Expose the shape::type_t values to be used by the python api and is required by torch_migraphx to support torchbench models.
```
  42685803
13 Apr, 2023 1 commit
- [mlir] Adding quantizelinear, dequantizelinear and quant_convolution support (#1675) · 7b2a5ccf
  Zhuoran Yin authored Apr 13, 2023
  
  7b2a5ccf
12 Apr, 2023 3 commits
- Print out pass name when tracing passes (#1667) · 551b927c
  Paul Fultz II authored Apr 12, 2023
  
  551b927c
- Updates to README (#1671) · ec4b79c2
  Paul Fultz II authored Apr 12, 2023
```
This removes the --cxx flags from the rbuild commands since it is not necessary. Also added a section about using rbuild to set up an environment for development.
```
  ec4b79c2
- Update workflow to support rocm image overwrite (#1662) · 851f8f3e
  Djordje Petrovic authored Apr 12, 2023
  
  851f8f3e
11 Apr, 2023 3 commits
- Onnxruntime Weekly Sync 2023-04-07 (#1676) · cc8dda73
  github-actions[bot] authored Apr 11, 2023
  
  cc8dda73
- Enable tidy on gpu driver (#1659) · 3385dcc8
  Paul Fultz II authored Apr 11, 2023
  
  3385dcc8
- Update name of github action script (#1624) · 744c6ab7
  Ted Themistokleous authored Apr 11, 2023
  
  744c6ab7
10 Apr, 2023 3 commits
- Always build ref target when building MIGraphX (#1636) · cce35871
  Umang Yadav authored Apr 10, 2023
  
  cce35871
- Fix 2 input broadcast bug for dynamic batch and output parameter ordering (#1669) · d3eb5609
  Charlie Lin authored Apr 10, 2023
```
Adds a matcher to split_single_dyn_dim to find all broadcast or multibroadcast with two static shape inputs and replaces the instruction with the one input version.
Sorts the get_output_parameters() list to ensure the correct ordering. (Was getting an error for some models.)
```
  d3eb5609
- Add dockerignore file (#1661) · 2e754cdd
  Paul Fultz II authored Apr 10, 2023
  
  2e754cdd
09 Apr, 2023 1 commit
- Enable hiprtc by default (#1658) · db6c75e7
  Paul Fultz II authored Apr 09, 2023
```
* Enable hiprtc by default
```
  db6c75e7
07 Apr, 2023 1 commit

Require the same type for the inputs and scales for QuantizeLinear (#1642) · f6e22d56

Paul Fultz II authored Apr 06, 2023

Converts can be inserted when the scales and input differ in the onnx file(we are already doing this implicit conversion in the ref implementation). This will also improve the compile-time of quantizelinear.hpp since we can remove the nested visit method.

f6e22d56

06 Apr, 2023 2 commits

Driver dynamic batch update (#1652) · adccec52

Charlie Lin authored Apr 06, 2023

Examples..

bin/driver verify /codes/onnx_models/resnet50-v1-7/resnet50-v1-7.onnx --split-single-dyn-dim --batch 3 --dyn-input-dim @data "[{min:1, max:4}, 3, 224, 224]"

bin/driver compile /codes/onnx_models/resnet50-v1-7/resnet50-v1-7.onnx --split-single-dyn-dim --default-dyn-dim "{min:1, max:10}" --output resnet50_batch1-10.mxr

bin/driver perf resnet50_batch1-10.mxr --batch 4

adccec52

Add reduction fusion (#1614) · f201285c
Paul Fultz II authored Apr 05, 2023
```
Automatically fuse multiple reductions and pointwise operations.
```
f201285c

05 Apr, 2023 3 commits
- Add MIGRAPHX_VALIDATE_MATCHES env variable to validate each matcher (#1372) · a123cb2e
  Paul Fultz II authored Apr 05, 2023
```
* Add MIGRAPHX_VALIDATE_MATCHES env variable to validate each matcher
```
  a123cb2e
- Optimize add convolution (#1549) · df32040d
  Paul Fultz II authored Apr 05, 2023
```
This will replace conv(x+a, w) with conv(x, w) + conv(a, w) where a is a constant so conv(a, w) can be replaced with a constant.
```
  df32040d
- Add missing header for sles and centos (#1665) · 8beb6680
  Paul Fultz II authored Apr 04, 2023
  
  8beb6680