Commits · 555d07a2421bad0b871bbfd302a0fb45466bc0bf · gaoqiong / MIGraphX

17 Jun, 2022 2 commits

Remove failed test · 555d07a2
Paul authored Jun 17, 2022

555d07a2

Create allocate op and replace_allocate pass (#1183) · add6fb3b

kahmed10 authored Jun 17, 2022



* add allocate op header

* formatting

* add replace_allocate pass

* formatting

* move output param to remove_allocate pass

* formatting

* fix bugs in replace_allocate pass

* formatting

* fix verify if tests

* formatting

* move if op logic

* formatting

* cleanup lowering

* cleanup lowering

* formatting

* fix tidy

* formatting

* fix tidy

* add cpu allocate check

* formatting

* change cpu allocate in pass

* formatting

* add some tests for replace_allocate pass

* formatting

* pass by ref

* fix run_pass

* formatting

* update variable name for module

* update dce to use contains() and fix tidy

* formatting

* update cppcheck

* add if test

* formatting

* add if test

* rename var to mod_output_names

* formatting

* remove conditional

* update allocate op and tests

* formatting

* update replace_allocate tests

* update create_output_names() and conditional in replace_allocate

* formatting

* remove extra variable in replace_allocate

* update tools script for allocation_model
Co-authored-by: Umang Yadav <29876643+umangyadav@users.noreply.github.com>
Co-authored-by: Chris Austen <causten@users.noreply.github.com>
Co-authored-by: Paul Fultz II <pfultz2@yahoo.com>

add6fb3b

13 Jun, 2022 1 commit
- Update failed tests · 2e19d280
  Paul authored Jun 13, 2022
  
  2e19d280
09 Jun, 2022 1 commit
- Move mlir compile to jit pipeline · 02b0095c
  Paul authored Jun 09, 2022
  
  02b0095c
25 May, 2022 3 commits
- List failed tests · 3325ac9c
  Paul authored May 25, 2022
  
  3325ac9c
- Format · 79ffac9f
  Paul authored May 24, 2022
  
  79ffac9f
- Cleanup debug output · b7f31df5
  Paul authored May 24, 2022
  
  b7f31df5
24 May, 2022 2 commits
- Format · 9dcbd52b
  Paul authored May 24, 2022
  
  9dcbd52b
- Handle symetrical padding · 4272fff1
  Paul authored May 24, 2022
  
  4272fff1
18 May, 2022 3 commits
- Add comment · e872b1b7
  Paul authored May 18, 2022
  
  e872b1b7
- Format · 44e41db9
  Paul authored May 18, 2022
  
  44e41db9
- Use func.return · 56a6b232
  Paul authored May 18, 2022
  
  56a6b232
06 May, 2022 2 commits
- Update triple · 722d5f5c
  Paul authored May 06, 2022
  
  722d5f5c
- Add compile tests for gpu math functions (#1182) · 6a5cda96
  Paul Fultz II authored May 06, 2022
```
Add compile tests for gpu math functions
```
  6a5cda96
29 Mar, 2022 1 commit

Refactor runtime compiled kernels to use the same compile_ops pipeline (#1125) · 661046c6

Paul Fultz II authored Mar 29, 2022

This adds the infrastructure so we can compile everything in parallel, whereas before only pointwise kernels were compiled in parallel. This will also directly integrate with lowering and the gpu-driver. The kernels for pointwise and roialign are using this infrastructure. Scatternd is not since it does require standard shape.

This also makes it easier to add new runtime compiled kernels in the future.

661046c6

25 Feb, 2022 1 commit
- Add get_queue to context to get the current stream (#1097) · e5242676
  Paul Fultz II authored Feb 24, 2022
```
wrapped in a any_ptr class so the type can be checked at runtime for a mismatch.
```
  e5242676
09 Feb, 2022 1 commit
- Enable pointwise fusion by default (#1082) · c7419a9c
  Paul Fultz II authored Feb 09, 2022
```
There is now a MIGRAPHX_DISABLE_POINTWISE_FUSION to disable it
```
  c7419a9c
26 Jan, 2022 1 commit
- Updates · 1cc6c88c
  Paul authored Jan 25, 2022
  
  1cc6c88c
10 Jan, 2022 1 commit
- Fix output arg · 88f549e2
  Paul authored Jan 09, 2022
  
  88f549e2
11 Dec, 2021 3 commits
- Formatting · d0feb6b4
  Paul authored Dec 10, 2021
  
  d0feb6b4
- Add mlir verification · c83ee9f8
  Paul authored Dec 10, 2021
  
  c83ee9f8
- Dont provide output for return instruction · 2c952efd
  Paul authored Dec 10, 2021
  
  2c952efd
01 Dec, 2021 3 commits
- Handle unsinged integers · b406a418
  Paul authored Dec 01, 2021
  
  b406a418
- Register dialect · 1851e975
  Paul authored Dec 01, 2021
  
  1851e975
- Add mlir_compile · 812cd5c8
  Paul authored Dec 01, 2021
  
  812cd5c8
24 Nov, 2021 2 commits
- Format · ee382ad9
  Paul authored Nov 24, 2021
  
  ee382ad9
- Add return · 2a0ff223
  Paul authored Nov 24, 2021
  
  2a0ff223
17 Nov, 2021 1 commit

Handle removing contiguous on operators that use modules (#1005) · 785307c3

Paul Fultz II authored Nov 17, 2021

Currently, eliminate_contiguous will never remove contiguous for operators that use module inputs due to the fact that it doesn't pass the module inputs to compute_shape.

- Update to pass the module inputs correctly to compute_shape
- Fix the overloads of compute_shape so that when passed an empty vector of module inputs it will call the overload without module inputs
- Add tests with contiguous and pointwise module function.
- Move add_pointwise function to a seperate header to reuse across different tests

785307c3

16 Nov, 2021 2 commits
- Format · 15177ac0
  Paul authored Nov 16, 2021
  
  15177ac0
- Fix bug when appending module · f7f61d7a
  Paul authored Nov 16, 2021
  
  f7f61d7a
09 Nov, 2021 2 commits
- Formatting · cf4642cd
  Paul authored Nov 09, 2021
  
  cf4642cd
- Move mlir to the gpu and update the test · 0ad547aa
  Paul authored Nov 09, 2021
  
  0ad547aa
08 Oct, 2021 1 commit

Remove alpha and beta from `dot` and `quant_dot` (#961) · 21193e87

Umang Yadav authored Oct 08, 2021

Previously dot operator was defined as C = alpha * A . B + beta * C where * is scalar multiplication and . is dot product or matrix multiplication depending on dimension of the inputs.

Aim is to have the definition of dot operator as C = A . B without having alpha or beta.

In order to achieve the same effect as alpha and beta (1) it multiplies the one of the inputs to the dot operator with alpha value. (2) if beta is present then, multiplies the C with beta and then adds into the output from step 1.

21193e87

07 Sep, 2021 1 commit

qdq for quantization and include subgraph (#891) · b45f7239

Shucai Xiao authored Sep 07, 2021



Add operators, refactor parsers, add rewrite passes, add tests
Add ref implementations
Move broadcasting of scales and zero points to onnx parser
Allow for x and zero_point to have different types in quantizelinear; fix zero_point default type
fp16 and fp8 quantization to include subgraph and parameters
fix unit test to use qdq operators for int8 quantization
Co-authored-by: turneram <alturner@amd.com>

b45f7239

31 Aug, 2021 1 commit

Fix debug assert (#930) · bd85a76c

Shucai Xiao authored Aug 31, 2021

* fix two asserts for debug build

* add unit test for copy parameters

* clang format

* add a unit test for reorder_dims

* change tranpose to always require perm not be empty

* clang format

* remove an unnecessary line

* fix tidy error

* fix review comments

bd85a76c

24 Aug, 2021 1 commit

Change attributes names to be more consistent and reflect better meaning (#916) · 0d2606bb

Umang Yadav authored Aug 24, 2021

* rename broadcast and multibroadcast output_lens attribute to out_lens attribute, and change tests and source code to reflect the same

* change the reshape attribute from dims to out_lens

* change transpose attribute's name from dims to perm to reflect better meaning

* use permutation instead of perm for transpose

clang formaating

* use dims instead of out_lens for reshape

clang formatting

0d2606bb

19 Aug, 2021 1 commit
- Enable warnings when jit compiling (#913) · ccff6beb
  Paul Fultz II authored Aug 19, 2021
```
* Enable warnings when jit compiling

* Formatting
```
  ccff6beb
10 Aug, 2021 1 commit

Add option to compile with hiprtc (#892) · 91c9ebbc

Paul Fultz II authored Aug 10, 2021

* Add hiprtc compile option
* Add cross compile test
* Update error reporting
* Add tests for errors and warnings
* Fix tidy warning
* Add comment to ifdefs
* Skip null character at end of log
* Assert there is null at the end

91c9ebbc

05 Aug, 2021 1 commit

Add gpu driver and improvements to pointwise codegen (#851) · 29fa2666

Paul Fultz II authored Aug 05, 2021



* Add method to compile pointwise

* Formatting

* Add lambda

* Add semicolon

* Rename variable

* Add driver to run jit kernels

* Formatting

* Add context

* Formatting

* Make seperate driver folder

* Add more general gpu driver

* Formatting

* Print out wll time

* Formatting

* Run multiple times and skip first run

* Formatting

* Seperate time_op

* Run an op for comparison

* Formatting

* Add debug asserts

* Formatting

* Change parameer name

* Formatting

* Fix argument order

* Formatting

* Add preloading

* Formatting

* Allow a different data type

* Formatting

* Pipeline transformations

* Formatting

* Add vectorization

* Formatting

* Reduce dims

* Formatting

* Compile with launch params as constant

* Formatting

* Make sure buffer can be vecotrized

* Formatting

* Enable vectorization and preloading

* Formatting

* Add print header

* Formatting

* Avoid allocating to large of LDS

* Formatting

* Add some vec functions to a seperate header

* Formatting

* Add stride loops

* Formatting

* Improve the transform pipeline

* Formatting

* Add const

* Fix shape check

* Formatting

* Just check stride axis is zero

* Remove extra finc_vector_axis overload

* Simplify some mroe functions

* Formatting

* Remove some more extra functions

* Formatting

* Simplify more decltypes

* Add another const

* Fix test

* Get buffer pointer different for older compilers
Co-authored-by: Shucai Xiao <shucai@gmail.com>
Co-authored-by: Chris Austen <causten@users.noreply.github.com>

29fa2666

14 Jul, 2021 1 commit

Use the same device name function in the unit tests (#881) · 0b04fc80

Paul Fultz II authored Jul 14, 2021



* Unify device_name function

* Formatting
Co-authored-by: mvermeulen <5479696+mvermeulen@users.noreply.github.com>

0b04fc80