- 10 Nov, 2023 3 commits
-
-
Khalique Ahmed authored
-
Artur Wojcik authored
-
Artur Wojcik authored
-
- 09 Nov, 2023 3 commits
-
-
Krzysztof Drewniak authored
This is a companion PR to https://github.com/ROCmSoftwarePlatform/rocMLIR/pull/1312 . The updated commit hash points onto that PR branch, so coordinated merges are advised. With the above rocMLIR changes, the MLIR MIGraphX dialect now represents both the dimensions and strides of tensors inside MLIR, thus allowing NHWC convolutions to be correctly offloaded. In this PR, we: - Remove special handling for the case where non-standard shapes become input to MLIR modules - Fold broadcast and multibroadcast operations into the input size od MLIR modules - Update tests - Add an extra TRACE_MLIR print to help debug crashes in the high-level pipeline
-
Charlie Lin authored
-
Artur Wojcik authored
-
- 08 Nov, 2023 6 commits
-
-
Zakor Gyula authored
The inaccuracy was caused by ONNX round requires nearest integer rounding for halway (0.5) cases. std::round rounds away from zero, thus giving wrong results with halfway cases. Replaced std::round with std::nearbyint which uses the correct rounding by default.
-
Charlie Lin authored
Fixes an issue that comes up for variable input slice with steps set manually in ONNX to default 1's.
-
Attila Dusnoki authored
-
Brian Pickrell authored
-
Charlie Lin authored
Simplifies dimensions_of instructions to a literal when possible. Intended to be used after the split_single_dyn_dim pass.
-
kahmed10 authored
-
- 07 Nov, 2023 3 commits
-
-
Paul Fultz II authored
-
Zakor Gyula authored
-
Zakor Gyula authored
-
- 04 Nov, 2023 2 commits
-
-
Artur Wojcik authored
-
Mirza Halilčević authored
Since opset version 18, the Split operator allows splitting into unevenly sized outputs when the split input is not present, and a num_outputs attribute has been introduced.
-
- 03 Nov, 2023 1 commit
-
-
Brian Pickrell authored
-
- 31 Oct, 2023 2 commits
-
-
Paul Fultz II authored
-
Charlie Lin authored
-
- 30 Oct, 2023 4 commits
-
-
Umang Yadav authored
-
Charlie Lin authored
-
Artur Wojcik authored
-
Artur Wojcik authored
-
- 27 Oct, 2023 1 commit
-
-
Paul Fultz II authored
-
- 25 Oct, 2023 1 commit
-
-
Attila Dusnoki authored
-
- 24 Oct, 2023 1 commit
-
-
Paul Fultz II authored
-
- 23 Oct, 2023 1 commit
-
-
pramenku authored
-
- 21 Oct, 2023 1 commit
-
-
Paul Fultz II authored
Add tracing to benchmark to show which kernels are running and the time of every kernel
-
- 20 Oct, 2023 4 commits
-
-
Zakor Gyula authored
-
turneram authored
Adds workarounds to avoid passing capture ops and scalar literals from quantization as arguments to ck_gemm.
-
nives-vukovic authored
-
music-dino authored
-
- 19 Oct, 2023 2 commits
-
-
Paul Fultz II authored
-
Umang Yadav authored
* Disable -Wunsafe-buffer-usage when compiling gpu code
-
- 18 Oct, 2023 2 commits
-
-
Zakor Gyula authored
-
Charlie Lin authored
-
- 17 Oct, 2023 3 commits
-
-
Attila Dusnoki authored
-
Attila Dusnoki authored
-
Charlie Lin authored
-