Commits · 6f032fc0841f60b5c02b64e7a10ea367507c5c8a · gaoqiong / MIGraphX

22 Nov, 2023 4 commits
- Benchmark precompile op in gpu driver (#2340) · 6f032fc0
  Paul Fultz II authored Nov 22, 2023
  
  6f032fc0
- fix tidy (#2459) · 89215595
  Umang Yadav authored Nov 22, 2023
  
  89215595
- Use double buffer for block_scan (#2436) · ee257d99
  Paul Fultz II authored Nov 22, 2023
  
  ee257d99
- Add support for the dilations attribute to Pooling ops (#2105) · 19bd9c49
  Mirza Halilčević authored Nov 22, 2023
```
Introduce dilations attribute to pooling operators reference implementation.
```
  19bd9c49
21 Nov, 2023 2 commits

embed.cmake: add support for Windows resource file (#2330) · 24148857

Artur Wojcik authored Nov 21, 2023

This PR introduces the support of Windows resource files to Embed.cmake. It is ON by default on Windows, and when it is OFF *.cpp files will be used. The same applies to Linux - ON -> *.o (LD) or OFF -> *.cpp .

This PR fixes building resources on Linux with ld and objcopy commands.

24148857

enable compilation on Windows (#2402) · 624f8ef5
Artur Wojcik authored Nov 21, 2023

624f8ef5

17 Nov, 2023 1 commit

Ref implementation of FP8 (#2438) · 7f93a818

Umang Yadav authored Nov 17, 2023

Handles all 4 Fp8 dtypes listed here : https://onnx.ai/onnx/technical/float8.html
Follows saturation/clipping logic from table there as well : https://onnx.ai/onnx/technical/float8.html#cast
Only adding fp8e4m3fnuz in MIGraphX IR for now.

7f93a818

16 Nov, 2023 1 commit
- fix detection of hiprtc driver on Windows (#2401) · 5488b443
  Artur Wojcik authored Nov 16, 2023
  
  5488b443
08 Nov, 2023 2 commits

Fix Round operator inaccuracy (#2244) · 48c4453c

Zakor Gyula authored Nov 08, 2023

The inaccuracy was caused by ONNX round requires nearest integer rounding for halway (0.5) cases.
std::round rounds away from zero, thus giving wrong results with halfway cases.
Replaced std::round with std::nearbyint which uses the correct rounding by default.

48c4453c

Blas auto-tuning for GEMMs (#1668) · d7c8b66f
Brian Pickrell authored Nov 07, 2023

d7c8b66f

07 Nov, 2023 2 commits
- Update tracing of benchmark only when env var is set (#2409) · 3c160a3f
  Paul Fultz II authored Nov 07, 2023
  
  3c160a3f
- Add support for IsInf ONNX operator (#2289) · b206ed76
  Zakor Gyula authored Nov 07, 2023
  
  b206ed76
04 Nov, 2023 1 commit
- fix quotation marks in CMake to accommodate Windows (#2328) · b0798343
  Artur Wojcik authored Nov 04, 2023
  
  b0798343
30 Oct, 2023 2 commits
- Remove int8x4 format completely (#2373) · 22bb777f
  Umang Yadav authored Oct 30, 2023
  
  22bb777f
- make CK JIT optional (#2383) · 728bea34
  Artur Wojcik authored Oct 30, 2023
  
  728bea34
24 Oct, 2023 1 commit
- Ensure unique module name for MLIR standalone ops (#2360) · d1abf06f
  Paul Fultz II authored Oct 24, 2023
  
  d1abf06f
21 Oct, 2023 1 commit

Add benchmark tracing (#2354) · 8f9ccb9a

Paul Fultz II authored Oct 20, 2023

 Add tracing to benchmark to show which kernels are running and the time of every kernel

8f9ccb9a

20 Oct, 2023 2 commits
- Add support for select_last_index attribute for ArgMax & ArgMin (#2235) · 6ae4227a
  Zakor Gyula authored Oct 20, 2023
  
  6ae4227a
- CK GEMM Int8 Bug Fixes (#2229) · f47e0b5b
  turneram authored Oct 19, 2023
```
Adds workarounds to avoid passing capture ops and scalar literals from quantization as arguments to ck_gemm.
```
  f47e0b5b
19 Oct, 2023 2 commits
- Make argument constructor explicit (#2346) · 07848b28
  Paul Fultz II authored Oct 19, 2023
  
  07848b28
- Add flag to accept non-uniform WG sizes (#2167) · 581b1b5f
  Umang Yadav authored Oct 19, 2023
```
* Disable -Wunsafe-buffer-usage when compiling gpu code
```
  581b1b5f
16 Oct, 2023 1 commit

Enable MLIR by default for more cases (#2274) · 650ba45f

Paul Fultz II authored Oct 15, 2023

This will enable MLIR by default for these cases:

Any convolution fusion
Any int8 gemm fusion
All Navi3 standalone convolutions
With a flag(ie MIGRAPHX_ENABLE_MLIR) to enable MLIR for floating-point gemm fusions
Except:

3x3 winnograd convolutions fusions (except on Navi)
K > 2048 on gemm (as CK)
Also there is MIGRAPHX_DISABLE_MLIR to disable MLIR completely.

650ba45f

14 Oct, 2023 1 commit
- fix missing exports on Windows (#2327) · 68161431
  Artur Wojcik authored Oct 14, 2023
  
  68161431
13 Oct, 2023 1 commit
- Add CK GEMM-Softmax-GEMM fusion (#2250) · 05e36598
  turneram authored Oct 13, 2023
  
  05e36598
12 Oct, 2023 1 commit

Fix MLIR input fusion non-std shapes from squeeze, flatten and unsqueeze (#2313) · 1a1c1b42

Manupa Karunaratne authored Oct 12, 2023

Currently, we see MLIR partition candidates recieving non-standard shape due to not fusing in squeeze, flatten and unsqueeze ops. These ops could be canonicalized to reshape without introducing additional ops as long as MLIR backend is concerned.

1a1c1b42

11 Oct, 2023 2 commits
- Update time op to more accurately get device time (#2104) · 34b68ee4
  Paul Fultz II authored Oct 11, 2023
  
  34b68ee4
- a few c++ fixes to allow compilation on Windows (#2282) · a50cb302
  Artur Wojcik authored Oct 11, 2023
  
  a50cb302
06 Oct, 2023 3 commits
- [mlir] Add assertion to prevent sending MLIR dynamic shapes (#2299) · a3cf9951
  Krzysztof Drewniak authored Oct 06, 2023
  
  a3cf9951
- add missing DLL symbols exports (#2281) · 1082f667
  Artur Wojcik authored Oct 07, 2023
  
  1082f667
- prepare for Windows resources with resource script files (#1999) · 9d8331b4
  Artur Wojcik authored Oct 06, 2023
  
  9d8331b4
03 Oct, 2023 1 commit
- just use one flush call (#2272) · 36eaf9e5
  Umang Yadav authored Oct 02, 2023
  
  36eaf9e5
29 Sep, 2023 2 commits
- Changes for the CK + HIPRTC (#2251) · 4188c38e
  Umang Yadav authored Sep 29, 2023
```
add flags for ck, Enable CK with hipRTC.  CK can be used with the MIGRAPHX_ENABLE_CK=1 and MIGRAPHX_TUNE_CK=1
```
  4188c38e
- Enable MLIR to be built with MIGraphX (#2184) · 33382894
  Chris Austen authored Sep 29, 2023
```
Enable MLIR performance enhancements with MIGRAPHX_ENABLE_MLIR=1
```
  33382894
28 Sep, 2023 2 commits
- ROCm 5.7 CI update (#2201) · dcc7b0a5
  Ted Themistokleous authored Sep 28, 2023
  
  dcc7b0a5
- Add an error message when gpu_targets is not set (#2234) · d88d8735
  Paul Fultz II authored Sep 28, 2023
  
  d88d8735
27 Sep, 2023 5 commits

Modify reshapes (#2099) · 7e5ccd4b

Ted Themistokleous authored Sep 27, 2023

Modify reshapes to use reshape_lazy for aliasing and then reshape for a reshape copy operation to eliminate contiguous

7e5ccd4b

[mlir] Apply is_mlir_conv predicate in standalone MLIr offloading (#2249) · a761ffaa

Krzysztof Drewniak authored Sep 27, 2023

Currently, the is_mlir_conv predicate wasn't being used when
offloading standalone convolutions to MLIR on Navi3x, which caused
failures relating to being unable to construct the MLIR program when a
3D convlolution was passed in.

This commit amends the standalone lowering to use said predicate, as
well as to include quant_convolution and quant_dot into the set of
operations that get a standalone lowering.

a761ffaa

Fixed MIGraphX+rocMLIR integration regarding fast tuning (#2257) · 75a73214
ravil-mobile authored Sep 27, 2023

75a73214

[MLIR] Capture diagnostics, handle compile failure (#2001) · 48af0bcf

Krzysztof Drewniak authored Sep 27, 2023

Add mlir_logger, which registers a MLIR diagnostic handler that
captures any information generated by a MLIR compile and saves it to a
string.

This will be useful during tuning, where some such errors may be the
result of an inapplicable tuning configuration and should be
suppressed.

48af0bcf

fix order in layernorm matcher and add test for the same (#2189) · 03d8a250
Umang Yadav authored Sep 27, 2023

03d8a250