Commits · b35f6188b5c23e818f2d7d2967d520c42cf96c33 · gaoqiong / MIGraphX

03 Jul, 2022 4 commits
- Fix compile error · b35f6188
  Paul authored Jul 02, 2022
  
  b35f6188
- Format · 0e64d364
  Paul authored Jul 02, 2022
  
  0e64d364
- Fix query string · a86f411b
  Paul authored Jul 02, 2022
  
  a86f411b
- Fix fnction name · 595532cd
  Paul authored Jul 02, 2022
  
  595532cd
29 Jun, 2022 5 commits
- Format · 8431f8cc
  Paul authored Jun 28, 2022
  
  8431f8cc
- Add missing license files · 098ef858
  Paul authored Jun 28, 2022
  
  098ef858
- Remove comments · 811ee921
  Paul authored Jun 28, 2022
  
  811ee921
- Format · cfbd91a9
  Paul authored Jun 28, 2022
  
  cfbd91a9
- Query to perfdb · c37b1636
  Paul authored Jun 28, 2022
  
  c37b1636
25 Jun, 2022 2 commits
- bug fix: register the miopen_fusion op. (#1267) · 3b0a9116
  Brian Pickrell authored Jun 24, 2022
```
One-line fix to register the op miopen_fusion. This error was causing loading of compiled model files (*.mxr) to fail.
```
  3b0a9116
- Use jit for contiguous operator (#1217) · b75c83d8
  Paul Fultz II authored Jun 24, 2022
```
* Jit contiguous
```
  b75c83d8
23 Jun, 2022 1 commit
- remove eliminate_workspace pass (#1254) · f5760e21
  kahmed10 authored Jun 23, 2022
```
* remove eliminate workspace
* remove sync device and other tags
```
  f5760e21
22 Jun, 2022 4 commits
- Update license files (#1248) · e44cecbc
  Ted Themistokleous authored Jun 22, 2022
```
Updated each source file in the repo with the existing license.
```
  e44cecbc
- Format · 5fc228f6
  Paul authored Jun 22, 2022
  
  5fc228f6
- Other tidy fix · b664b50c
  Paul authored Jun 22, 2022
  
  b664b50c
- Fix tidy · f058aa18
  Paul authored Jun 22, 2022
  
  f058aa18
20 Jun, 2022 3 commits
- Fixing misspelled macro to enable MIOpen hidden find mode API (#1250) · c0398ded
  Zhuoran Yin authored Jun 20, 2022
```
* Fixing misspelled macro
Co-authored-by: Paul Fultz II <pfultz2@yahoo.com>
```
  c0398ded
- Use another ifdef · e977ab07
  Paul authored Jun 20, 2022
  
  e977ab07
- Fix tidy warning · fd5a68a7
  Paul authored Jun 20, 2022
  
  fd5a68a7
17 Jun, 2022 11 commits

Fix const · 7d023b6e
Paul authored Jun 17, 2022

7d023b6e
Format · f791188a
Paul authored Jun 17, 2022

f791188a
Tidy fixes · 33423f8c
Paul authored Jun 17, 2022

33423f8c
Foramt · 390586c5
Paul authored Jun 17, 2022

390586c5
Tidy fixes · f374143f
Paul authored Jun 17, 2022

f374143f
Format · bf3e958d
Paul authored Jun 17, 2022

bf3e958d
Check type for fp32 · 2bba1c7c
Paul authored Jun 17, 2022

2bba1c7c
Format · d97b3111
Paul authored Jun 17, 2022

d97b3111
Fix failures when mlir is disabled · 6f768f82
Paul authored Jun 17, 2022

6f768f82

Update lowering of Dot operator (#1247) · c99be32c

Umang Yadav authored Jun 17, 2022



* remove code for allocation of C param in dot lowering

* formatting
Co-authored-by: Paul Fultz II <pfultz2@yahoo.com>

c99be32c

Create allocate op and replace_allocate pass (#1183) · add6fb3b

kahmed10 authored Jun 17, 2022



* add allocate op header

* formatting

* add replace_allocate pass

* formatting

* move output param to remove_allocate pass

* formatting

* fix bugs in replace_allocate pass

* formatting

* fix verify if tests

* formatting

* move if op logic

* formatting

* cleanup lowering

* cleanup lowering

* formatting

* fix tidy

* formatting

* fix tidy

* add cpu allocate check

* formatting

* change cpu allocate in pass

* formatting

* add some tests for replace_allocate pass

* formatting

* pass by ref

* fix run_pass

* formatting

* update variable name for module

* update dce to use contains() and fix tidy

* formatting

* update cppcheck

* add if test

* formatting

* add if test

* rename var to mod_output_names

* formatting

* remove conditional

* update allocate op and tests

* formatting

* update replace_allocate tests

* update create_output_names() and conditional in replace_allocate

* formatting

* remove extra variable in replace_allocate

* update tools script for allocation_model
Co-authored-by: Umang Yadav <29876643+umangyadav@users.noreply.github.com>
Co-authored-by: Chris Austen <causten@users.noreply.github.com>
Co-authored-by: Paul Fultz II <pfultz2@yahoo.com>

add6fb3b

13 Jun, 2022 4 commits
- Format · 1770a342
  Paul authored Jun 13, 2022
  
  1770a342
- Correctly add module · aeb60bce
  Paul authored Jun 13, 2022
  
  aeb60bce
- Format · f75c5a38
  Paul authored Jun 12, 2022
  
  f75c5a38
- Add source locations · af09c35f
  Paul authored Jun 12, 2022
  
  af09c35f
10 Jun, 2022 1 commit

Add vectorized reduce (#1202) · aa7ff911

Paul Fultz II authored Jun 09, 2022



Consolidate the vectorize and preload
Add vectorization to reduction
Co-authored-by: kahmed10 <15948690+kahmed10@users.noreply.github.com>

aa7ff911

09 Jun, 2022 2 commits
- Format · 6b5c64ff
  Paul authored Jun 09, 2022
  
  6b5c64ff
- Move mlir compile to jit pipeline · 02b0095c
  Paul authored Jun 09, 2022
  
  02b0095c
07 Jun, 2022 1 commit

Prioritizing int8 over int8x4 when it is applicable (#1218) · 37c47504

Zhuoran Yin authored Jun 07, 2022



prioritizing int8 over int8x4 when it is applicable
Amend return to continue in apply loop
Adding error handling in case int8x4 compilation failed
Co-authored-by: Paul Fultz II <pfultz2@yahoo.com>

37c47504

03 Jun, 2022 1 commit

Group code objects by kernel name in perf report summary (#1234) · 7271ddbc

Paul Fultz II authored Jun 02, 2022

Break up the gpu::code_object  print to show the actual kernels...

gpu::code_object::add_kernel: 0.646121ms, 5%
gpu::code_object::mul_kernel: 0.623822ms, 5%
gpu::code_object::add_mul_erf_add_mul_mul_kernel: 0.498902ms, 4%
gpu::code_object::mul_add_kernel: 0.478352ms, 4%

7271ddbc

02 Jun, 2022 1 commit
- Fix dangling reference with gemm add fusion (#1233) · 1339ba35
  Paul Fultz II authored Jun 01, 2022
  
  1339ba35