- 22 Nov, 2023 4 commits
-
-
Paul Fultz II authored
-
Umang Yadav authored
-
Paul Fultz II authored
-
Mirza Halilčević authored
Introduce dilations attribute to pooling operators reference implementation.
-
- 21 Nov, 2023 2 commits
-
-
Artur Wojcik authored
This PR introduces the support of Windows resource files to Embed.cmake. It is ON by default on Windows, and when it is OFF *.cpp files will be used. The same applies to Linux - ON -> *.o (LD) or OFF -> *.cpp . This PR fixes building resources on Linux with ld and objcopy commands.
-
Artur Wojcik authored
-
- 17 Nov, 2023 1 commit
-
-
Umang Yadav authored
Handles all 4 Fp8 dtypes listed here : https://onnx.ai/onnx/technical/float8.html Follows saturation/clipping logic from table there as well : https://onnx.ai/onnx/technical/float8.html#cast Only adding fp8e4m3fnuz in MIGraphX IR for now.
-
- 16 Nov, 2023 1 commit
-
-
Artur Wojcik authored
-
- 08 Nov, 2023 2 commits
-
-
Zakor Gyula authored
The inaccuracy was caused by ONNX round requires nearest integer rounding for halway (0.5) cases. std::round rounds away from zero, thus giving wrong results with halfway cases. Replaced std::round with std::nearbyint which uses the correct rounding by default.
-
Brian Pickrell authored
-
- 07 Nov, 2023 2 commits
-
-
Paul Fultz II authored
-
Zakor Gyula authored
-
- 04 Nov, 2023 1 commit
-
-
Artur Wojcik authored
-
- 30 Oct, 2023 2 commits
-
-
Umang Yadav authored
-
Artur Wojcik authored
-
- 24 Oct, 2023 1 commit
-
-
Paul Fultz II authored
-
- 21 Oct, 2023 1 commit
-
-
Paul Fultz II authored
Add tracing to benchmark to show which kernels are running and the time of every kernel
-
- 20 Oct, 2023 2 commits
-
-
Zakor Gyula authored
-
turneram authored
Adds workarounds to avoid passing capture ops and scalar literals from quantization as arguments to ck_gemm.
-
- 19 Oct, 2023 2 commits
-
-
Paul Fultz II authored
-
Umang Yadav authored
* Disable -Wunsafe-buffer-usage when compiling gpu code
-
- 16 Oct, 2023 1 commit
-
-
Paul Fultz II authored
This will enable MLIR by default for these cases: Any convolution fusion Any int8 gemm fusion All Navi3 standalone convolutions With a flag(ie MIGRAPHX_ENABLE_MLIR) to enable MLIR for floating-point gemm fusions Except: 3x3 winnograd convolutions fusions (except on Navi) K > 2048 on gemm (as CK) Also there is MIGRAPHX_DISABLE_MLIR to disable MLIR completely.
-
- 14 Oct, 2023 1 commit
-
-
Artur Wojcik authored
-
- 13 Oct, 2023 1 commit
-
-
turneram authored
-
- 12 Oct, 2023 1 commit
-
-
Manupa Karunaratne authored
Currently, we see MLIR partition candidates recieving non-standard shape due to not fusing in squeeze, flatten and unsqueeze ops. These ops could be canonicalized to reshape without introducing additional ops as long as MLIR backend is concerned.
-
- 11 Oct, 2023 2 commits
-
-
Paul Fultz II authored
-
Artur Wojcik authored
-
- 06 Oct, 2023 3 commits
-
-
Krzysztof Drewniak authored
-
Artur Wojcik authored
-
Artur Wojcik authored
-
- 03 Oct, 2023 1 commit
-
-
Umang Yadav authored
-
- 29 Sep, 2023 2 commits
-
-
Umang Yadav authored
add flags for ck, Enable CK with hipRTC. CK can be used with the MIGRAPHX_ENABLE_CK=1 and MIGRAPHX_TUNE_CK=1
-
Chris Austen authored
Enable MLIR performance enhancements with MIGRAPHX_ENABLE_MLIR=1
-
- 28 Sep, 2023 2 commits
-
-
Ted Themistokleous authored
-
Paul Fultz II authored
-
- 27 Sep, 2023 5 commits
-
-
Ted Themistokleous authored
Modify reshapes to use reshape_lazy for aliasing and then reshape for a reshape copy operation to eliminate contiguous
-
Krzysztof Drewniak authored
Currently, the is_mlir_conv predicate wasn't being used when offloading standalone convolutions to MLIR on Navi3x, which caused failures relating to being unable to construct the MLIR program when a 3D convlolution was passed in. This commit amends the standalone lowering to use said predicate, as well as to include quant_convolution and quant_dot into the set of operations that get a standalone lowering.
-
ravil-mobile authored
-
Krzysztof Drewniak authored
Add mlir_logger, which registers a MLIR diagnostic handler that captures any information generated by a MLIR compile and saves it to a string. This will be useful during tuning, where some such errors may be the result of an inapplicable tuning configuration and should be suppressed.
-
Umang Yadav authored
-