- 14 Dec, 2020 1 commit
-
-
Paul Fultz II authored
* Add flag to enable cpu backend * Make buffers shared * Enable optimizations * Add onednn * Formatting * Formatting * Add dnnl header * Formatting * Rewrite rnn first * Formatting * Call reference implementation * Formatting * Make literal data shared * Formatting * Add convolution * Formatting * Compensate for dilation * Formatting * Use name/make_op instead * Formatting * Rename gemm header * Formatting * Add dnnl convolution/gemm operators * Formatting * Add eliminate_contiguous * Add faster pointwise operators * Formatting * Formatting * Formatting * Add dnnl op class * Formatting * Add add op * Formatting * Add concat operator * Formatting * Add more ops * Create descriptor during finalization * Formatting * Dont rewrite pooling * Enable memory coloring * Formatting * Add output aliases * Formatting * Fix errors * Formatting * Convert literals * Add missing file * Remove batch_norm * Formatting * Use strides * Formatting * Add some debug checks * Formatting * Fix big in adjusting shape for gemm * Formatting * Fix fallback dot operator * Zero initialize buffers * Add suport for group convolutions * Formatting * Make adjust allocation target independent * Formatting * Enable adjust_allocation for gpu/cpu * Formatting * Add copy to allocation model * Formatting * Add copy operator * Formatting * Better handling of output parameters in adjust_allocation * Formatting * Build with dnnl * Make dnnl required * Fix compile error * Tidy fixes * Formatting * Tidy fixes * Formatting * Fix more tidy issues * Formatting * Add mul op * Add mul op * Set c compiler to clang as well * Compensate for normalized compute shape * Formatting * Fix cppcheck errors * Formatting * Add onednn library to hcc * Guard clang pragmas * Disable cpu mode for gcc for now * Leave it enabled it for gcc 7 * Fix cppcheck suppresion * Fix compile error on gcc 5 * Remove unused code Co-authored-by:
Shucai Xiao <shucai.xiao@amd.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 04 Nov, 2020 1 commit
-
-
Paul Fultz II authored
* Add all_targets cmake target * Rename target * Add ref target * Rename tests * Refactor compiler target * Formatting * Verify for every target * Formatting * Add verify test suite * Formatting * Add initial test programs * Formatting * Add rnn tests * Formatting * Validate gpu * Formatting * Remove old gpu tests * Fix gpu tests * Fix ref error * Fix tidy issues * Formatting * Tidy fixes * Fix header in python api * Rename to ref * Use ref in verify_onnx * Fix tidy issue * Build with verbose on * Fix typo * Remove verbose * rename some cpu prefix to ref Co-authored-by:Shucai Xiao <Shucai.Xiao@amd.com>
-
- 17 Nov, 2019 1 commit
-
-
Paul authored
-
- 04 Sep, 2019 1 commit
-
-
Paul authored
-
- 14 Nov, 2018 1 commit
-
-
Paul authored
-
- 09 Nov, 2018 1 commit
-
-
Paul authored
-
- 08 Nov, 2018 1 commit
-
-
Paul authored
-
- 02 Nov, 2018 1 commit
-
-
Shucai Xiao authored
* add the slice test example on gpu. * change the gpu slice test according to comments. * rename cpu_lowering to lowering, rename cpu_target to target, so consistent with gpu side. * fix the format of a file CMakeLists.txt. * Revert "change the gpu slice test according to comments." This reverts commit 721bbb180d11811dc914d60fd8a1c91926e3f947. * Revert "add the slice test example on gpu." This reverts commit 68dabb05adffd429e5e5d10c3a1def2b06489f63. * fix a format for the file doc/src/reference/targets.rst
-
- 18 Sep, 2018 2 commits
- 13 Sep, 2018 1 commit
-
-
mei-ye authored
-
- 11 Sep, 2018 1 commit
-
-
mei-ye authored
-
- 01 Sep, 2018 1 commit
-
-
Paul Fultz II authored
-
- 31 Aug, 2018 1 commit
-
-
mei-ye authored
-
- 08 Aug, 2018 1 commit
-
-
mei-ye authored
-
- 05 Aug, 2018 1 commit
-
-
Paul authored
-
- 04 Aug, 2018 4 commits
- 16 Jul, 2018 1 commit
-
-
Paul authored
-
- 02 Jul, 2018 1 commit
-
-
Paul authored
-
- 21 May, 2018 1 commit
-
-
Paul authored
-