Commits · 1eb5a1d48037bb03c4dc7347837c5a548c2d1036 · gaoqiong / MIGraphX

13 Jan, 2023 1 commit

Charlie Lin authored Jan 13, 2023

Extends parse_matmul.hpp to handle dynamic input shapes
Does not support broadcasting of the outer dimensions for dynamic shapes at this time

1eb5a1d4

11 Jan, 2023 2 commits
- Use cosine to compute half sin (#1508) · 3fb5c0ef
  Paul Fultz II authored Jan 11, 2023
```
* Use cosine to compute half sin
```
  3fb5c0ef
- Dynamic Conv bias fix (#1502) · 8497e9dc
  Charlie Lin authored Jan 11, 2023
```
Fixes ONNX parsing of convolution to handle dynamic broadcasting of bias input
```
  8497e9dc
09 Jan, 2023 1 commit

Add JIT Gather Operator (#1492) · 054364cd

Ted Themistokleous authored Jan 09, 2023

JIT implementation of the gather operator
Added a few more unit tests to this one as well since I saw some odd behavior during bring up.

054364cd

04 Jan, 2023 1 commit

Dynamic reduce (#1432) · 0d197f27

Brian Pickrell authored Jan 04, 2023

Implements dynamic shapes in reduce_op and all its child operator classes (reduce_max etc.)

0d197f27

14 Dec, 2022 1 commit
- Print program as python (#1490) · 56c43445
  Paul Fultz II authored Dec 14, 2022
```
* Print python code
```
  56c43445
13 Dec, 2022 2 commits
- Update file reading function to fix external data loading (#1460) · b8c8d09b
  kahmed10 authored Dec 14, 2022
  
  b8c8d09b
- Refactor dynamic_dimension fixed compare (#1470) · a9d6071a
  Charlie Lin authored Dec 13, 2022
```
Implements the operator==(dynamic_dimension, size_t) functions
```
  a9d6071a
11 Dec, 2022 1 commit

change target flag (#1488) · b41c1f01

Umang Yadav authored Dec 11, 2022

HIP had change in previous rocm releases to use --offload-arch instead of --cuda-gpu-arch.

This should be backwards compatbile. hipRTC also supports --offload-arch.

b41c1f01

08 Dec, 2022 4 commits

Dynamic ref dot operator (#1457) · d411aa69

Charlie Lin authored Dec 08, 2022

Extends dot MIGX operator to handle dynamic input shapes
Only allow dot between two dynamic shapes that have exactly matching outer dimensions
Inner dimensions must also match correspondingly
Updates dot related tests
Change check_shapes to use shape.ndim()
ONNX parsers for GEMM and MatMult will be updated in a separate PR

d411aa69

Dynamic reference Softmax (#1475) · 8e7d2efe

Charlie Lin authored Dec 08, 2022

No major changes required, use dyn_output and pass dynamic shape when calling compute_shape()
Adds dynamic shape tests

8e7d2efe

Dynamic ref flatten (#1482) · 4c32afcc

Charlie Lin authored Dec 08, 2022

Changes flatten's compute_shape() to handle dynamic shapes
Calculates the flattened shape with the min, max, and opt

4c32afcc

fix issues with compiling lstm ops in fp16 mode (#1450) · 352c2465

shivadbhavsar authored Dec 07, 2022

Currently, quantizing a program with rnn layers to fp16 results in segmentation faults due to a "convert" operation being applied to an "undefined" instruction.

The following changes are implemented to fix this issue:

Added is_undefined method to the instruction class that returns true if all inputs to the instruction are from an undefined op.
Updated rewrite_rnn pass to use the new is_undefined method rather than checking ins->name()
Updated the dead_code_elimination pass to also use this new method rather than only checking the instruction name

352c2465

07 Dec, 2022 2 commits
- Fix conversion issue in layernorm fusion (#1483) · 37c3c4a9
  Paul Fultz II authored Dec 07, 2022
```
* Add implicit_conversion
```
  37c3c4a9
- Dynamic ref Argmax (#1478) · 231d60a2
  Charlie Lin authored Dec 07, 2022
```
Extends the Argmax operator to handle dynamic input shapes.
Only shape function changes
```
  231d60a2
06 Dec, 2022 3 commits

Add tupleVisitor for from_gpu (#1465) · a4c2b889

Ted Themistokleous authored Dec 06, 2022

Need this for when we debug and use MIGRAPHX_TRACE_EVAL() to show tuples
Without this we break when reading our buffer due to the use of visit()
This came up as part of #1283 debugging.

a4c2b889

Dynamic ref squeeze and unsqueeze (#1426) · 48cc33e4

Charlie Lin authored Dec 06, 2022

Extends unsqueeze and squeeze to work for dynamic input shapes
Does not handle the steps parameter
Adds some additional negative axes shape tests

48cc33e4

Update MLIR integration (#1451) · be70702d

jungpark-mlir authored Dec 06, 2022

Update dialect registration interface
Update 2nd build pipeline call and use full arch name

be70702d

02 Dec, 2022 2 commits

Refactor non-standard literal construction (#1443) · fdc3f00a

Charlie Lin authored Dec 02, 2022

Fix problem with the contiguous operator constructing non-standard shape literals.  A non-standard literal will almost never be used, since a literal is known at compile time.  Added some comments on the intended behavior:

- literal{shape, vector} constructor with a non-standard shape is intended to keep the same ordering as the given vector. The data buffer will be populated such that when the non-standard indexing is used the original order is as given.
- literal{shape, argument} constructor directly copies the data buffer from the argument
- Changed non-standard literal fill() to use tensor_view iterators as it handles non-standard shapes now
- Changed the contiguous ref_ops_test to be more helpful

fdc3f00a

Dynamic ref pooling (#1449) · 0e40ebaa

Charlie Lin authored Dec 02, 2022

Extends the pooling operators for dynamic shape inputs

AveragePooling
GlobalAveragePooling
MaxPooling
GlobalMaxPooling
LpNormPooling
GlobalLpNormPooling
y.github.com>

0e40ebaa

29 Nov, 2022 1 commit

remove extra adjust allocation pass (#1477) · 5a2a83a4

kahmed10 authored Nov 30, 2022

Merging #1391 caused an extra adjust allocation pass for GPU targets. This removes that merge error.

5a2a83a4

28 Nov, 2022 1 commit

Dynamic ref transpose (#1438) · 32b08891

Charlie Lin authored Nov 28, 2022

Extends ref transpose operator for dynamic shapes
Make dynamic tests more consistent naming

32b08891

20 Nov, 2022 1 commit
- Make a cmake variable to enable find 2.0 (#1463) · 9f50b860
  Paul Fultz II authored Nov 20, 2022
  
  9f50b860
18 Nov, 2022 1 commit

Disable Find2.0 for now (#1462) · 493bb8d5

Umang Yadav authored Nov 18, 2022

Disabling it untill int8 fix is in mainline from MIOpen and also so that QA tests could run migraphx-driver and unittests from MIGraphX.

493bb8d5

17 Nov, 2022 2 commits

Fix logical_xor type checking (#1458) · af7e6eaa
Ted Themistokleous authored Nov 17, 2022
```
Fix to stop types failing for logical_xor during our fusions. 
```
af7e6eaa

Dynamic ref contiguous (#1445) · 95d82a51

Charlie Lin authored Nov 17, 2022

Extends the ref contiguous operator to handle dynamic shapes
Updates the eliminate_contiguous pass to use the dyn_output struct

95d82a51

14 Nov, 2022 1 commit
- Include timestamp while tracing (#1442) · 4a7af806
  Chris Austen authored Nov 14, 2022
```
* Include timestamp while tracing
```
  4a7af806
13 Nov, 2022 1 commit

Dyn ref multibroadcast; dyn binary (#1423) · d73c6d7c

Charlie Lin authored Nov 13, 2022

Updated Multibroadcast op to have a two input version for dynamic shapes
Current dynamic shape broadcasting logic
dynamic_dimensions must be the same or one of them is {1, 1, 0} or {1, 1, 1}
Works for dyn-dyn, dyn-static, and static-static shape combinations
Changed common.cpp for multibroadcasting for binary ops with dynamic shapes
Extended binary.hpp for dynamic shapes to test the new common.cpp stuff

d73c6d7c

07 Nov, 2022 1 commit
- Update rocblas header include path (#1444) · df2e7635
  arvindcheru authored Nov 07, 2022
  
  df2e7635
06 Nov, 2022 1 commit
- fix overflow for workspace size (#1446) · 18234a58
  Umang Yadav authored Nov 06, 2022
  
  18234a58
02 Nov, 2022 2 commits
- Add nhwc layout to gpu backend (#1391) · 1820198e
  Paul Fultz II authored Nov 02, 2022
```
Can be enabled via environment variable MIGRAPHX_ENABLE_NHWC
```
  1820198e
- Concat pointwise fusions (#1388) · 2f48b11a
  Paul Fultz II authored Nov 02, 2022
  
  2f48b11a
01 Nov, 2022 2 commits
- Add opset-13 support for parse_split (#1429) · 70d0e816
  Ted Themistokleous authored Nov 01, 2022
```
Newer split moves the split attribute to an input. In this case we check the
number of input args then.
```
  70d0e816
- Include array header for compatibility with GCC 12 (#1435) · ba0913b1
  Torsten Keßler authored Nov 01, 2022
  
  ba0913b1
28 Oct, 2022 1 commit

Use minimum block size of 64 threads (#1427) · 25a0e433

Umang Yadav authored Oct 28, 2022

Local Threads of multiples 32 were introduced in #1348
But LocalThreads that are not multiple of 64 are causing correctness issues.

25a0e433

27 Oct, 2022 2 commits

Upgrade CI environment to 5.3.0 (#1198) · 4b1c1c41

Chris Austen authored Oct 27, 2022

Upgraded Dockerfiles and fixed tidy issues to make Ubuntu 20.04 and ROCm 5.3.0 the default

4b1c1c41

Add JIT pad (#1411) · 0d841ded

kahmed10 authored Oct 27, 2022

updated GPU pad to now use JIT version.
added range functions for JIT kernels.

0d841ded

26 Oct, 2022 2 commits

rearrange default pass list; adjust_allocation must be run after rep… (#1418) · 7b9ce460
Brian Pickrell authored Oct 26, 2022
```
Fixes an observed regression error on certain Frozen Protobuf models due to PR 1280
```
7b9ce460

Regenerate driver models (#1422) · d8756a4e

kahmed10 authored Oct 26, 2022

use_dynamic_same_auto_pad was removed from convolution, but the driver models still retain the fields. This PR regenerates the files so that they are compatible again.

d8756a4e

24 Oct, 2022 1 commit

Add relaxed standard shape assertion (#1416) · f1ecad75

jungpark-mlir authored Oct 24, 2022

Reiterate the assertion on the standard shape but relax it for the multibroadcast ops deliberately inserted to explicit the broadcast.

f1ecad75