Commits · db816c6f4c2f4eef27c372b9db38825bc5766ebc · gaoqiong / MIGraphX

24 Jan, 2023 1 commit
- Add fused_reduce jit · db816c6f
  Paul authored Jan 24, 2023
  
  db816c6f
21 Jan, 2023 7 commits
- Format · dbb480dd
  Paul authored Jan 20, 2023
  
  dbb480dd
- Some fixes · 4e55c401
  Paul authored Jan 20, 2023
  
  4e55c401
- Add pass · 40762b08
  Paul authored Jan 20, 2023
  
  40762b08
- Format · 7b101e49
  Paul authored Jan 20, 2023
  
  7b101e49
- Update funtion name · c5de9766
  Paul authored Jan 20, 2023
  
  c5de9766
- Format · 6bc30259
  Paul authored Jan 20, 2023
  
  6bc30259
- Add fuse_reduce pass · 8bc67132
  Paul authored Jan 20, 2023
  
  8bc67132
17 Jan, 2023 4 commits
- Merge branch 'develop' into jit-reduce-reg · a5c87ec5
  Paul authored Jan 17, 2023
  
  a5c87ec5
- Dynamic ref reshape (one non-fixed case) (#1500) · 3f49f8eb
  Charlie Lin authored Jan 17, 2023
```
Extends reshape to handle the case of a single non-fixed dynamic_dimension
```
  3f49f8eb
- Use float accumulator when reduction size is too large for half (#1515) · 3af50e07
  Paul Fultz II authored Jan 17, 2023
  
  3af50e07
- Dynamic ref pad (#1487) · 8202e411
  Charlie Lin authored Jan 16, 2023
```
Extends pad operator to handle dynamic input shapes
Only handles computing the shape for adding constant padding to a dynamic shape
- adds the padding to the min, max, and opt values (unless opt is 0, where it keeps it 0)
- does not handle reflect padding with dynamic shapes
```
  8202e411
16 Jan, 2023 1 commit
- add to project, revert PAT (#1524) · c6efdf8c
  Charlie Lin authored Jan 16, 2023
```
Create a workflows to allow issues to be assigned to a project in github
```
  c6efdf8c
13 Jan, 2023 2 commits

Transpose slice fix (#1499) · 2c8149f6
shivadbhavsar authored Jan 13, 2023
```
This PR resolves the bug addressed in #1496. 
```
2c8149f6

Charlie Lin authored Jan 13, 2023

Extends parse_matmul.hpp to handle dynamic input shapes
Does not support broadcasting of the outer dimensions for dynamic shapes at this time

1eb5a1d4

11 Jan, 2023 3 commits
- Use cosine to compute half sin (#1508) · 3fb5c0ef
  Paul Fultz II authored Jan 11, 2023
```
* Use cosine to compute half sin
```
  3fb5c0ef
- Dynamic Conv bias fix (#1502) · 8497e9dc
  Charlie Lin authored Jan 11, 2023
```
Fixes ONNX parsing of convolution to handle dynamic broadcasting of bias input
```
  8497e9dc
- Update python ONNX version (#1501) · 84ce66ec
  Charlie Lin authored Jan 10, 2023
```
Change to python ONNX package 1.10.0 to handle new ONNX test generation.
```
  84ce66ec
09 Jan, 2023 2 commits

Update GH project workflow (#1510) · ad1cfd92

Charlie Lin authored Jan 09, 2023



* Getting incorrect token error on workflow
* Try using the other token from ci.yaml
* Rename workflow
Co-authored-by: Chris Austen <causten@users.noreply.github.com>

ad1cfd92

Add JIT Gather Operator (#1492) · 054364cd

Ted Themistokleous authored Jan 09, 2023

JIT implementation of the gather operator
Added a few more unit tests to this one as well since I saw some odd behavior during bring up.

054364cd

06 Jan, 2023 2 commits
- Create workflow to add to Github project (#1505) · 03c39761
  Charlie Lin authored Jan 06, 2023
```
* Create workflow to add to Github project
```
  03c39761
- Add gpu debug builds (#1377) · 863bdfbf
  Paul Fultz II authored Jan 06, 2023
```
Run a stage using MIGRAPHX_GPU_DEBUG=1.
```
  863bdfbf
04 Jan, 2023 1 commit

Dynamic reduce (#1432) · 0d197f27

Brian Pickrell authored Jan 04, 2023

Implements dynamic shapes in reduce_op and all its child operator classes (reduce_max etc.)

0d197f27

29 Dec, 2022 1 commit
- Bump rocMLIR hash to the rel5.5 (#1498) · 4394e9b3
  jungpark-mlir authored Dec 29, 2022
  
  4394e9b3
14 Dec, 2022 2 commits
- Merge branch 'develop' into jit-reduce-reg · dae94657
  Chris Austen authored Dec 14, 2022
  
  dae94657
- Print program as python (#1490) · 56c43445
  Paul Fultz II authored Dec 14, 2022
```
* Print python code
```
  56c43445
13 Dec, 2022 2 commits
- Update file reading function to fix external data loading (#1460) · b8c8d09b
  kahmed10 authored Dec 14, 2022
  
  b8c8d09b
- Refactor dynamic_dimension fixed compare (#1470) · a9d6071a
  Charlie Lin authored Dec 13, 2022
```
Implements the operator==(dynamic_dimension, size_t) functions
```
  a9d6071a
11 Dec, 2022 1 commit

change target flag (#1488) · b41c1f01

Umang Yadav authored Dec 11, 2022

HIP had change in previous rocm releases to use --offload-arch instead of --cuda-gpu-arch.

This should be backwards compatbile. hipRTC also supports --offload-arch.

b41c1f01

08 Dec, 2022 4 commits

Dynamic ref dot operator (#1457) · d411aa69

Charlie Lin authored Dec 08, 2022

Extends dot MIGX operator to handle dynamic input shapes
Only allow dot between two dynamic shapes that have exactly matching outer dimensions
Inner dimensions must also match correspondingly
Updates dot related tests
Change check_shapes to use shape.ndim()
ONNX parsers for GEMM and MatMult will be updated in a separate PR

d411aa69

Dynamic reference Softmax (#1475) · 8e7d2efe

Charlie Lin authored Dec 08, 2022

No major changes required, use dyn_output and pass dynamic shape when calling compute_shape()
Adds dynamic shape tests

8e7d2efe

Dynamic ref flatten (#1482) · 4c32afcc

Charlie Lin authored Dec 08, 2022

Changes flatten's compute_shape() to handle dynamic shapes
Calculates the flattened shape with the min, max, and opt

4c32afcc

fix issues with compiling lstm ops in fp16 mode (#1450) · 352c2465

shivadbhavsar authored Dec 07, 2022

Currently, quantizing a program with rnn layers to fp16 results in segmentation faults due to a "convert" operation being applied to an "undefined" instruction.

The following changes are implemented to fix this issue:

Added is_undefined method to the instruction class that returns true if all inputs to the instruction are from an undefined op.
Updated rewrite_rnn pass to use the new is_undefined method rather than checking ins->name()
Updated the dead_code_elimination pass to also use this new method rather than only checking the instruction name

352c2465

07 Dec, 2022 2 commits
- Fix conversion issue in layernorm fusion (#1483) · 37c3c4a9
  Paul Fultz II authored Dec 07, 2022
```
* Add implicit_conversion
```
  37c3c4a9
- Dynamic ref Argmax (#1478) · 231d60a2
  Charlie Lin authored Dec 07, 2022
```
Extends the Argmax operator to handle dynamic input shapes.
Only shape function changes
```
  231d60a2
06 Dec, 2022 3 commits

Add tupleVisitor for from_gpu (#1465) · a4c2b889

Ted Themistokleous authored Dec 06, 2022

Need this for when we debug and use MIGRAPHX_TRACE_EVAL() to show tuples
Without this we break when reading our buffer due to the use of visit()
This came up as part of #1283 debugging.

a4c2b889

Dynamic ref squeeze and unsqueeze (#1426) · 48cc33e4

Charlie Lin authored Dec 06, 2022

Extends unsqueeze and squeeze to work for dynamic input shapes
Does not handle the steps parameter
Adds some additional negative axes shape tests

48cc33e4

Update MLIR integration (#1451) · be70702d

jungpark-mlir authored Dec 06, 2022

Update dialect registration interface
Update 2nd build pipeline call and use full arch name

be70702d

02 Dec, 2022 2 commits

Refactor non-standard literal construction (#1443) · fdc3f00a

Charlie Lin authored Dec 02, 2022

Fix problem with the contiguous operator constructing non-standard shape literals.  A non-standard literal will almost never be used, since a literal is known at compile time.  Added some comments on the intended behavior:

- literal{shape, vector} constructor with a non-standard shape is intended to keep the same ordering as the given vector. The data buffer will be populated such that when the non-standard indexing is used the original order is as given.
- literal{shape, argument} constructor directly copies the data buffer from the argument
- Changed non-standard literal fill() to use tensor_view iterators as it handles non-standard shapes now
- Changed the contiguous ref_ops_test to be more helpful

fdc3f00a

Dynamic ref pooling (#1449) · 0e40ebaa

Charlie Lin authored Dec 02, 2022

Extends the pooling operators for dynamic shape inputs

AveragePooling
GlobalAveragePooling
MaxPooling
GlobalMaxPooling
LpNormPooling
GlobalLpNormPooling
y.github.com>

0e40ebaa