Commits · dc8edad80d64be0781f47f8c7dedcc54acfbb8bb · gaoqiong / MIGraphX

30 Nov, 2023 1 commit
- [6.1] Add support for dot-(mul)-softmax-dot offloads to MLIR (#2345) · ff485c7a
  Manupa Karunaratne authored Nov 30, 2023
  
  ff485c7a
23 Nov, 2023 1 commit
- Use parallel STL for parallel execution (#2165) · 6aa6c954
  Paul Fultz II authored Nov 23, 2023
  
  6aa6c954
30 Oct, 2023 1 commit
- Add flag to control virtual environments (#2331) · 409fd18c
  Ahsan Saghir authored Oct 30, 2023
  
  409fd18c
16 Oct, 2023 1 commit

Enable MLIR by default for more cases (#2274) · 650ba45f

Paul Fultz II authored Oct 15, 2023

This will enable MLIR by default for these cases:

Any convolution fusion
Any int8 gemm fusion
All Navi3 standalone convolutions
With a flag(ie MIGRAPHX_ENABLE_MLIR) to enable MLIR for floating-point gemm fusions
Except:

3x3 winnograd convolutions fusions (except on Navi)
K > 2048 on gemm (as CK)
Also there is MIGRAPHX_DISABLE_MLIR to disable MLIR completely.

650ba45f

12 Oct, 2023 1 commit
- clang-format needed for build (#2320) · 50c5984a
  Chris Austen authored Oct 12, 2023
  
  50c5984a
02 Oct, 2023 1 commit
- Add gfx906 target to Jenkins (#2269) · ae5cc13e
  Chris Austen authored Oct 02, 2023
  
  ae5cc13e
01 Oct, 2023 1 commit
- Fix Jenkinsfile failure (#2265) · f3939b99
  Chris Austen authored Oct 01, 2023
  
  f3939b99
29 Sep, 2023 1 commit

Changes for the CK + HIPRTC (#2251) · 4188c38e

Umang Yadav authored Sep 29, 2023

add flags for ck, Enable CK with hipRTC.  CK can be used with the MIGRAPHX_ENABLE_CK=1 and MIGRAPHX_TUNE_CK=1

4188c38e

28 Sep, 2023 2 commits
- ROCm 5.7 CI update (#2201) · dcc7b0a5
  Ted Themistokleous authored Sep 28, 2023
  
  dcc7b0a5
- Force Onnxruntime build pipe to use mi100+ and no cdna (#2248) · 28614abd
  Ted Themistokleous authored Sep 28, 2023
```
Avoid the vega cards for the ORT build runs.
```
  28614abd
18 Aug, 2023 1 commit
- Disable hidden symbols for now (#2085) · 7ed4954a
  Paul Fultz II authored Aug 17, 2023
  
  7ed4954a
09 Aug, 2023 1 commit
- Add a job to check for export macros (#2016) · ff877d8f
  Paul Fultz II authored Aug 09, 2023
  
  ff877d8f
28 Jul, 2023 1 commit

Load python files in the driver (#1793) · b164ceef

Paul Fultz II authored Jul 28, 2023

The --py output can be loaded back in the driver. This will embed the migraphx interperter so we can execute the python directly. There is a migraphx_py library which will dynamically load the version of the library for python version is available on the system.

b164ceef

27 Jul, 2023 1 commit
- rename function 'near' to 'within_abs' (#1995) · 9cd9f1d8
  Artur Wojcik authored Jul 27, 2023
```
* rename function 'near' to 'make_near'

* try disabling vega10 machine
```
  9cd9f1d8
21 Jul, 2023 2 commits

Add back clamping and add tests (#1969) · 6957243c

Umang Yadav authored Jul 21, 2023

Fixes #1957

Clamping was removed in #1853.

Turns out clamping as necessary to handle overflow/underflow cases. during downcasting, if it overflowed then without clamping it returned infinity.

6957243c

Make global workitems multiple of local workitems (#1976) · 3216fe52

Umang Yadav authored Jul 20, 2023

HIP requires global work items in multiple of local work items. If it is not it is not guaranteed to generate correct results all the time.
Fixes #1977
Fixes #1644
MIGraphX CI has moved to rocm-5.6 which doesn't require hipRTC workarounds

3216fe52

18 Jul, 2023 1 commit
- Remove extra stages on jenkins (#1933) · 75e6618c
  Paul Fultz II authored Jul 18, 2023
  
  75e6618c
17 Jul, 2023 1 commit

Enable threading in MLIR (#1899) · 5f5356cc

Krzysztof Drewniak authored Jul 17, 2023

This commit removes the build options to disable threading and removes the mutex in compile_mlir.
The commit being tested is a draft PR on rocMLIR that'll get merged if this passes

5f5356cc

02 Jul, 2023 1 commit

Improvement to ck integration (#1859) · 3c9df3b4

Paul Fultz II authored Jul 02, 2023

Add a CI job to test CK
Add MIGRAPHX_TUNE_CK env variable to only do tuning for CK
Continue tuning even when there is invalid configs
Fix a bug with parallel compilation not using all available threads
Add additional test for gemms using half types
Removed int32 as supported type since it doesnt pass our test suite

3c9df3b4

31 May, 2023 2 commits
- Check if generate files are different (#1789) · 37711924
  Paul Fultz II authored May 31, 2023
  
  37711924
- Update pass manager to handle multi-target compilation (#1672) · 9473e3a2
  Umang Yadav authored May 31, 2023
```
partially solves #1656
This PR only handles compilation part of multitarget.
```
  9473e3a2
29 May, 2023 1 commit
- Ensure CI labels map correctly (#1780) · 3ea6ff7b
  Chris Austen authored May 29, 2023
  
  3ea6ff7b
19 May, 2023 1 commit

Docsupdate (#1748) · 3557ce90

Chris Austen authored May 18, 2023


Co-authored-by: Sam Wu <sam.wu2@amd.com>
Co-authored-by: Paul <pfultz2@yahoo.com>

3557ce90

22 Mar, 2023 1 commit
- Use version number as part of internal namespace symbol (#1633) · 09aaa63e
  Umang Yadav authored Mar 21, 2023
```
prevent dynamically loading the target library that is not compiled with the same version of MIGraphX core lib.
```
  09aaa63e
13 Mar, 2023 1 commit
- [MLIR] Adds a runtime switch to trigger MLIR (#1610) · 2db587ea
  Manupa Karunaratne authored Mar 13, 2023
```
* [MLIR] Adds a runtime switch to trigger MLIR
```
  2db587ea
16 Feb, 2023 1 commit
- Remove HCC (#1546) · bfd77388
  Umang Yadav authored Feb 16, 2023
```
* deprecate HCC
```
  bfd77388
31 Jan, 2023 1 commit

hipRTC fixes (#1531) · 91cc7242

Umang Yadav authored Jan 31, 2023

Added CMakeFlag for hipRTC. MIGRAPHX_USE_HIPRTC.
Added stages in Jenkins for hipRTC.
Fixes for some of the pending issues from hipRTC.

91cc7242

06 Jan, 2023 1 commit
- Add gpu debug builds (#1377) · 863bdfbf
  Paul Fultz II authored Jan 06, 2023
```
Run a stage using MIGRAPHX_GPU_DEBUG=1.
```
  863bdfbf
26 Sep, 2022 1 commit

Rewrite ONNX parse batch norm (#1362) · c00f8202

Charlie Lin authored Sep 26, 2022

Rewrites the BatchNormalization ONNX operator into other MIGX operators
- Added handling of 1D input tensor case (edge case in ONNX spec)
Removes the spatial and per_activation functionality (not in the ONNX spec)
- Did not remove the batch_norm_inference related code as the TensorFlow parser still uses it
- Can remove that code when the TF version is updated

c00f8202

12 Jul, 2022 1 commit

Add tests for C API (#1266) · a7a32a9e

Paul Fultz II authored Jul 12, 2022

This will ensure that migraphx.h can be included from a C compiler, and check that the C API can be called. This includes stdbool.h which is needed when using bool from C.

a7a32a9e

16 Jun, 2022 1 commit
- Use env var for creds · e16faac2
  Paul authored Jun 16, 2022
  
  e16faac2
29 Mar, 2022 1 commit
- Remove Navi from CI temporarily (#1147) · 024b4abc
  Chris Austen authored Mar 29, 2022
```
modify CI temporarily to stop using Navi hardware
```
  024b4abc
05 Nov, 2021 1 commit
- Update Docker to ROCm 4.5 and support Navi on Jenkins (#994) · 04e17804
  kahmed10 authored Nov 05, 2021
```
Moving our Docker file from ROCm 4.3 to 4.5 
Add Navi base GPUs in to the CI infrastructure 
```
  04e17804
28 Sep, 2021 1 commit
- Remove force depends so we can check for valid dependencies (#965) · 14556631
  Paul Fultz II authored Sep 28, 2021
```
No longer avoid dependency problems and install the half package
```
  14556631
26 Jul, 2021 1 commit
- Remove unknown flag · 3fb74986
  Paul authored Jul 25, 2021
  
  3fb74986
25 Jul, 2021 1 commit
- Move asan to jenkins · 8870e875
  Paul authored Jul 25, 2021
  
  8870e875
29 Apr, 2021 1 commit

MLIR MIOpen Dialect integration (phase 1) (#768) (#769) · 56584fa2

SJW authored Apr 29, 2021



* MLIR MIOpen Dialect integration (phase 1) (#768)

* Added Findmlir.cmake (using environment variables to import)

* Added mlir_conv pass to GPU target

  * Apply to any gpu::convolution if supported by MLIR

  * Call MLIR C-API to generate iGEMM kernel with configuration from gpu::convolution

  * Capture binary in dictionary for matching convolutions

  * Build a code_object_op with the binary and execution dimensions

  * Substitute for the gpu::convolution

* Changed the parameters for the code_object to reflect the generated MLIR kernel

* Expanded out MemRefDescriptor fields in param list

* Also updated for MLIR C-API changes

* * fixed global_size calculation

* MLIR MIOpen Dialect integration (phase 1) (#768)

* Added Findmlir.cmake (using environment variables to import)

* Added mlir_conv pass to GPU target

  * Apply to any gpu::convolution if supported by MLIR

  * Call MLIR C-API to generate iGEMM kernel with configuration from gpu::convolution

  * Capture binary in dictionary for matching convolutions

  * Build a code_object_op with the binary and execution dimensions

  * Substitute for the gpu::convolution

* Changed the parameters for the code_object to reflect the generated MLIR kernel

* Expanded out MemRefDescriptor fields in param list

* Also updated for MLIR C-API changes

* * Added command line option: --enable_mlir

* * fixed command line switch

* updated for new MLIR API changes

* * Added cget llvm-project-mlir to import MIIR API libraries into Dockerfile
  * removed cmake Findmlir

* updated for changes in MIIR C-API

* * updated CMakeLists.txt to allow disable of MLIR import

* fixed memory leaks and removed copies

* updated for 5D memrefs

* * formatting

* * fixed review comments

* * fixed merge issues

* hip gcnDeviceName now includes specifiers at the end
  * use major/minor values instead

* * disable MLIR by default

* * removed command-line switch --enable-mlir

* * fix unused when MLIR disabled

* * enable jenkins enable/test MLIR

* * format

* * fixed clang-tidy

* * added new type
Co-authored-by: Paul Fultz II <pfultz2@yahoo.com>
Co-authored-by: mvermeulen <5479696+mvermeulen@users.noreply.github.com>

56584fa2

09 Apr, 2021 1 commit

Upgrade docker to rocm 4.1 and drop hcc (#795) · 6d937d80

Paul Fultz II authored Apr 09, 2021

* Fix tidy warnings for 4.1

* Formatting

* Upgrade to 4.1 in docker

* Remove hcc build and enable ubsan on clang debug

* Add missing openmp package

* Construct directly

* Construct directly

* Upgrade rocm-cmake version

6d937d80

08 Jan, 2021 1 commit

Revamp CI infrastucture (#706) · ceb4ca09

Paul Fultz II authored Jan 08, 2021



* Add build and test github workflow

* Fix cget command

* Remove def-requirements.txt

* Add tmate session to debug workflow

* Run tmate session after installing dependencies

* Print date periodically

* Add clang tidy action

* Seperate build and run container in two different jobs

* Run bash script

* Remove interactive flag

* Try to mount the files

* Try to use the github workspace

* WIthout double braces

* Use env variable

* Pipe bash script in

* Run using hip-clang

* Use correct path

* Add verbose

* Remove j flag

* Only run for onnx file to debug

* Manually run clang-tidy

* Remove quiet flag

* Print header file

* Printout environment

* Remove extra defines

* Remove fixits and config flag

* Show ldd

* Add tmate session

* Run onnx protobuf first

* Generate proto for tensorflow

* Update cppcheck version

* Fix some cppcheck issues

* Add const

* Cppcheck fixes

* Formatting

* Fix more cppcheck issues

* Run two jobs

* Cache analysis and run format checking

* Fix yaml issues

* Fix yaml issues

* Fix indentation

* Switch to hip-clang for main docker file

* Use hip-clang in the readme

* Fixes for jenkins

* Use ccache to build

* Combine file

* Set restore keys

* Change stage name

* Build with ccache

* Add missing dependency for ccache

* Build debug with codecov

* Fix workflow syntax

* Fix list

* Use quotes

* Got to correct build path

* Install lcov

* Use sudo

* Echo all commands

* Setup tmate

* Add verbose output

* Build with cmake directly

* Add pthread flag

* Remove python config

* Continue on error

* Use on or off for cmake flag

* Use always upload cache

* Verbose output

* Verbose output from build

* Build one target

* Reduce debug symbols

* Increase garbage collection

* Remove dmesg

* Increase it to 20

* Update rocm cmake version

* Remove jobs from jenkins

* Run on all 3 ubuntus

* Remove gcc 5 jobs

* Dont add flag on 16.04

* Only upload coverage on 18.04

* Dont build for ubuntu 20.04

* Use matrix.os

* Use O2 for hip-clang since lower optimizations are broken

* Use rocm 3.0

* Pass ccache as cmake variable instead of env variable

* Build miopen from source

* Show ccache statistics

* Print log information

* Set compression level

* Use hash dir

* Set hashdir

* Install clang ocl from system

* Up compression level

* Add locale

* Increase cache size to 1G

* Lower compression level to 9

* Remove split dwarf

* Remove Og

* Add back Og

* Seperate debug and codecov

* Add missing backlash

* Garbage collect more often

* Add missing locales package

* Use Os

* Install onednn in docker and run tests

* Include target headers in tests

* Increase timeout

* Remove if condtion

* Make flag public

* Suppress memory leaks in onednn

* Use equal

* Add gh annotations

* Update rocm-cmake version

* Add ldconfig
Co-authored-by: Shucai Xiao <shucai@gmail.com>

ceb4ca09

14 Dec, 2020 1 commit

Use dnnl for cpu backend (#688) · 406afeb8

Paul Fultz II authored Dec 14, 2020



* Add flag to enable cpu backend

* Make buffers shared

* Enable optimizations

* Add onednn

* Formatting

* Formatting

* Add dnnl header

* Formatting

* Rewrite rnn first

* Formatting

* Call reference implementation

* Formatting

* Make literal data shared

* Formatting

* Add convolution

* Formatting

* Compensate for dilation

* Formatting

* Use name/make_op instead

* Formatting

* Rename gemm header

* Formatting

* Add dnnl convolution/gemm operators

* Formatting

* Add eliminate_contiguous

* Add faster pointwise operators

* Formatting

* Formatting

* Formatting

* Add dnnl op class

* Formatting

* Add add op

* Formatting

* Add concat operator

* Formatting

* Add more ops

* Create descriptor during finalization

* Formatting

* Dont rewrite pooling

* Enable memory coloring

* Formatting

* Add output aliases

* Formatting

* Fix errors

* Formatting

* Convert literals

* Add missing file

* Remove batch_norm

* Formatting

* Use strides

* Formatting

* Add some debug checks

* Formatting

* Fix big in adjusting shape for gemm

* Formatting

* Fix fallback dot operator

* Zero initialize buffers

* Add suport for group convolutions

* Formatting

* Make adjust allocation target independent

* Formatting

* Enable adjust_allocation for gpu/cpu

* Formatting

* Add copy to allocation model

* Formatting

* Add copy operator

* Formatting

* Better handling of output parameters in adjust_allocation

* Formatting

* Build with dnnl

* Make dnnl required

* Fix compile error

* Tidy fixes

* Formatting

* Tidy fixes

* Formatting

* Fix more tidy issues

* Formatting

* Add mul op

* Add mul op

* Set c compiler to clang as well

* Compensate for normalized compute shape

* Formatting

* Fix cppcheck errors

* Formatting

* Add onednn library to hcc

* Guard clang pragmas

* Disable cpu mode for gcc for now

* Leave it enabled it for gcc 7

* Fix cppcheck suppresion

* Fix compile error on gcc 5

* Remove unused code
Co-authored-by: Shucai Xiao <shucai.xiao@amd.com>
Co-authored-by: mvermeulen <5479696+mvermeulen@users.noreply.github.com>

406afeb8