Commits · 78eaf2b80d39277a59ff600573949740439259d3 · gaoqiong / MIGraphX

26 Feb, 2021 1 commit

Add more supported operators and optimizations for the cpu backend (#746) · a0b570b2

Paul Fultz II authored Feb 26, 2021



* Add eliminate_data_type pass

* Formatting

* Auto convert quant ops

* Formatting

* Flip the order of decompose

* Compute max size differently

* Formatting

* Clamp values in convert

* Formatting

* Fix loss of precision in reduce

* Formatting

* Fix bugs in reduction

* Fix accumulator type in reference softmax implementation

* Formatting

* Update convert test

* Remove unused variables

* Remove unnecessary quant_dot check

* Formatting

* Add tests

* Formatting

* Remove unused code

* Remove duplicate ops

* Remove blaze dependency

* Use set since shape::type_t is no hashable on gcc 5

* Formatting

* Add dnnl binary op

* Formatting

* Add binary and eltwise

* Formatting

* Add softmax

* Formatting

* Remove unused operators

* Add missing files

* Formatting

* Add lrn

* Formatting

* Add deconvolution

* Formatting

* Change allocate default

* Add reorder

* Formatting

* Add reductions

* Formatting

* Sort lines

* Change literals in another loop

* Add pow operator

* Formatting

* Add pow operator

* Formatting

* Make sure shapes are packed

* Allow broadcasted inputs

* Remove unused operators

* Simplify functions

* Remove softmax

* Add sub and erf functions

* Formatting

* Fix bug

* Formatting

* Improve parallism

* Formatting

* Allow multiple batch dimensions

* Formatting

* Move literal transforms out of lowering

* Formatting

* Add gather operator

* Sort lines

* Add early exit for carry

* Formatting

* Add missing concat

* Rename macro

* Fix deep nesting

* Formatting

* Fix cppcheck issues

* Remov else

* Move attribute to typedef

* Formatting

* Disable maybe-uninitialized warning since its broken on gcc

* Add constexpr default constructor

* Formatting

* Fix compiler warnings

* Fix adjust_allocation test
Co-authored-by: Shucai Xiao <shucai@gmail.com>
Co-authored-by: mvermeulen <5479696+mvermeulen@users.noreply.github.com>

a0b570b2

08 Feb, 2021 1 commit

Add a pass to remove unsupported data types (#738) · 3d24a21c

Paul Fultz II authored Feb 07, 2021



* Add eliminate_data_type pass

* Formatting

* Auto convert quant ops

* Formatting

* Flip the order of decompose

* Compute max size differently

* Formatting

* Clamp values in convert

* Formatting

* Fix loss of precision in reduce

* Formatting

* Fix bugs in reduction

* Fix accumulator type in reference softmax implementation

* Formatting

* Update convert test

* Remove unused variables

* Remove unnecessary quant_dot check

* Formatting

* Add tests

* Formatting

* Remove unused code

* Remove duplicate ops

* Remove blaze dependency

* Use set since shape::type_t is no hashable on gcc 5

* Formatting
Co-authored-by: Shucai Xiao <shucai@gmail.com>
Co-authored-by: mvermeulen <5479696+mvermeulen@users.noreply.github.com>

3d24a21c

14 Dec, 2020 1 commit

Use dnnl for cpu backend (#688) · 406afeb8

Paul Fultz II authored Dec 14, 2020



* Add flag to enable cpu backend

* Make buffers shared

* Enable optimizations

* Add onednn

* Formatting

* Formatting

* Add dnnl header

* Formatting

* Rewrite rnn first

* Formatting

* Call reference implementation

* Formatting

* Make literal data shared

* Formatting

* Add convolution

* Formatting

* Compensate for dilation

* Formatting

* Use name/make_op instead

* Formatting

* Rename gemm header

* Formatting

* Add dnnl convolution/gemm operators

* Formatting

* Add eliminate_contiguous

* Add faster pointwise operators

* Formatting

* Formatting

* Formatting

* Add dnnl op class

* Formatting

* Add add op

* Formatting

* Add concat operator

* Formatting

* Add more ops

* Create descriptor during finalization

* Formatting

* Dont rewrite pooling

* Enable memory coloring

* Formatting

* Add output aliases

* Formatting

* Fix errors

* Formatting

* Convert literals

* Add missing file

* Remove batch_norm

* Formatting

* Use strides

* Formatting

* Add some debug checks

* Formatting

* Fix big in adjusting shape for gemm

* Formatting

* Fix fallback dot operator

* Zero initialize buffers

* Add suport for group convolutions

* Formatting

* Make adjust allocation target independent

* Formatting

* Enable adjust_allocation for gpu/cpu

* Formatting

* Add copy to allocation model

* Formatting

* Add copy operator

* Formatting

* Better handling of output parameters in adjust_allocation

* Formatting

* Build with dnnl

* Make dnnl required

* Fix compile error

* Tidy fixes

* Formatting

* Tidy fixes

* Formatting

* Fix more tidy issues

* Formatting

* Add mul op

* Add mul op

* Set c compiler to clang as well

* Compensate for normalized compute shape

* Formatting

* Fix cppcheck errors

* Formatting

* Add onednn library to hcc

* Guard clang pragmas

* Disable cpu mode for gcc for now

* Leave it enabled it for gcc 7

* Fix cppcheck suppresion

* Fix compile error on gcc 5

* Remove unused code
Co-authored-by: Shucai Xiao <shucai.xiao@amd.com>
Co-authored-by: mvermeulen <5479696+mvermeulen@users.noreply.github.com>

406afeb8

16 Nov, 2020 1 commit

Normalize ops (#667) · 8443ecd1

Shucai Xiao authored Nov 16, 2020



* add a pass to normalize ops

* clang format

* add unit tests

* clang format

* code backup

* clang format

* code backup

* clang format

* add support for slice in the normalize_op function

* clang format

* add operation method api for whether we need to call normalize_op

* clang format

* fix review comments

* clang format

* rename a function namejJ

* clang format

* change compute_shape to normalize_compute_shape for corresponding operators

* clang format

* remove unnecessary code

* fix various issues

* clang format

* add attributes to operators having axis attributes

* clang format

* fixed jenkins build error

* clang format

* fix a bug related to slice

* clang format

* code backup

* clang format

* code backup

* clang format

* rename a file

* fix cppcheck error

* some code refinement

* clang format

* change attributes to enum

* clang format

* refine the enum

* clang format

* remove unnecessary code

* add unit tests for more code coverage and fixed a bug

* clang format

* remove unnecessary changes

* change normalize_axes to normalize

* clang format

* revert back the changes in broadcast.hpp

* rename normalize_axes to normalize

* fix review comments

* clang format

* Add flag to enable cpu backend

* Make buffers shared

* Enable optimizations

* Formatting

* Try to avoid ambiguous assign in value class

* fixed a build error

* clang format

* add the normalize_ops pass to the ref target

* refactor program to module to normalize_ops pass
Co-authored-by: Paul <pfultz2@yahoo.com>
Co-authored-by: mvermeulen <5479696+mvermeulen@users.noreply.github.com>

8443ecd1

10 Nov, 2020 1 commit

Add flag to enable cpu backend (#680) · d39e51ed

Paul Fultz II authored Nov 10, 2020

* Add flag to enable cpu backend

* Make buffers shared

* Enable optimizations

* Formatting

* Enable cpu backend for gcc builds

d39e51ed

10 Sep, 2020 1 commit

Add load/save function for program (#623) · 63c5582a

Paul Fultz II authored Sep 09, 2020



* Add save/load functions

* Formatting

* Add loading and saving to the driver

* Formatting

* Add return

* Serialize the context with the program

* Formatting

* Add python API

* Formatting

* Add c/c++ apis

* Formatting

* Add tests

* Formatting

* Fix tidy error

* Fix python doc

* Restore python code

* Add function name to errors

* Formatting

* Use lvalue for writing

* Serialize context

* Fix convolution and pooling operator for miopen

* Formatting

* Add const ref

* Set target name to gpu

* Add target tests

* Formatting

* Move register target to cpp file

* Fix target test

* Use make_target in driver

* Formatting

* Use make_target for the API

* Formatting

* Add cpu include

* Increase timeout

* Add more tests

* Formatting
Co-authored-by: Shucai Xiao <shucai.xiao@amd.com>
Co-authored-by: mvermeulen <5479696+mvermeulen@users.noreply.github.com>

63c5582a

15 Nov, 2019 1 commit

Add option to do offload copying automatically (#403) · 81b0ff5d

Paul Fultz II authored Nov 15, 2019

* Add compiler options

* Add copy operators

* Formatting

* Use run_passes in tests

* Formatting

* Use run_pass in schedule test

* Formatting

* Add compile_options to get_passes in target

* Formatting

* Offload copy option

* Formatting

* Copy using pinned memory

* Formatting

* Improve performance of gpu copying

* Formatting

* Dont copy

* Formatting

* Always make an extra copy

* Formatting

* Remove unused write op

* Add missing include

* Remove copy_to_gpu function in python api

* Make offload copy disabled by default on C++

* Formatting

* Fix tidy issues

* Formatting

* Fix namespace

* Fix python tests

* Turn clang format off since its broken

* Fix compile error on gcc 5

* Remove commented code

81b0ff5d

26 Aug, 2019 2 commits
- clang format · 8fbd2874
  Shucai Xiao authored Aug 26, 2019
  
  8fbd2874
- refine int8 quantization interface · 12ff93ac
  Shucai Xiao authored Aug 26, 2019
  
  12ff93ac
05 Mar, 2019 1 commit
- clang format · dbcf2dec
  Shucai Xiao authored Mar 05, 2019
  
  dbcf2dec
21 Feb, 2019 1 commit
- Reduce pass include · e8009f7b
  Paul authored Feb 20, 2019
  
  e8009f7b
07 Feb, 2019 1 commit
- merge rnn operator rewritting into one file, so only one pass is needed · 3c7b6d27
  Shucai Xiao authored Feb 07, 2019
  
  3c7b6d27
04 Feb, 2019 6 commits
- add the rewrite_gru pass. · e3df7b35
  Shucai Xiao authored Feb 04, 2019
  
  e3df7b35
- clang format. · 76a4fbbc
  Shucai Xiao authored Feb 04, 2019
  
  76a4fbbc
- fix a merge conflict · 75f3d056
  Shucai Xiao authored Feb 04, 2019
  
  75f3d056
- clang format · b76e438a
  Shucai Xiao authored Feb 04, 2019
  
  b76e438a
- clang format. · 0cc5b80e
  Shucai Xiao authored Feb 04, 2019
  
  0cc5b80e
- fixed a bug and added one more tests. · 1b4ebf8e
  Shucai Xiao authored Feb 04, 2019
  
  1b4ebf8e
23 Jan, 2019 2 commits
- clang format · f792097f
  Shucai Xiao authored Jan 23, 2019
  
  f792097f
- fixed build error. · d4594903
  Shucai Xiao authored Jan 23, 2019
  
  d4594903
22 Jan, 2019 1 commit
- add a pass for rnn operator. · a2ea4ecd
  Shucai Xiao authored Jan 22, 2019
  
  a2ea4ecd
27 Nov, 2018 1 commit
- Rename more things to migraphx · 0b217041
  Paul authored Nov 27, 2018
  
  0b217041
14 Nov, 2018 1 commit
- Rename to migraphx · 96358e41
  Paul authored Nov 14, 2018
  
  96358e41
06 Nov, 2018 9 commits
- fixed the format error related to clang-tidy. · d918b57f
  Shucai Xiao authored Nov 06, 2018
  
  d918b57f
- change the macro MIGRAPH_ININE_NS back to version_1 in .cpp files. · 1aaccf06
  Shucai Xiao authored Nov 05, 2018
  
  1aaccf06
- clang format of changed files. · 801ca743
  Shucai Xiao authored Nov 05, 2018
  
  801ca743
- fixed the warnings related to inline namespace. · 80246f67
  Shucai Xiao authored Nov 05, 2018
  
  80246f67
- Added inline namespace for all .hpp and .cpp file. · d79346d1
  Shucai Xiao authored Nov 05, 2018
  
  d79346d1
- change the macro MIGRAPH_ININE_NS back to version_1 in .cpp files. · c05e72e2
  Shucai Xiao authored Nov 05, 2018
  
  c05e72e2
- clang format of changed files. · d6d386f7
  Shucai Xiao authored Nov 05, 2018
  
  d6d386f7
- fixed the warnings related to inline namespace. · 7f9d85cc
  Shucai Xiao authored Nov 05, 2018
  
  7f9d85cc
- Added inline namespace for all .hpp and .cpp file. · e1ef1e17
  Shucai Xiao authored Nov 05, 2018
  
  e1ef1e17
02 Nov, 2018 1 commit

Remove cpu from names (#102) · 0d0778b7

Shucai Xiao authored Nov 02, 2018

* add the slice test example on gpu.

* change the gpu slice test according to comments.

* rename cpu_lowering to lowering, rename cpu_target to target, so consistent with gpu side.

* fix the format of a file CMakeLists.txt.

* Revert "change the gpu slice test according to comments."

This reverts commit 721bbb180d11811dc914d60fd8a1c91926e3f947.

* Revert "add the slice test example on gpu."

This reverts commit 68dabb05adffd429e5e5d10c3a1def2b06489f63.

* fix a format for the file doc/src/reference/targets.rst

0d0778b7