"vscode:/vscode.git/clone" did not exist on "35d644628fd31b1df00e2bc5b601b89c5fd335c6"
- 05 Sep, 2021 2 commits
- 31 Aug, 2021 1 commit
-
-
ltqin authored
* start * modify transformat * modify device convolutiion * modify host * added host conv bwd and wrw * remove bwd, seperate wrw * clean * hacall k to zero * out log * fixed * fixed * change to (out in wei) * input hack * hack to out * format * fix by comments * change wei hacks(wei transform has not merge) * fix program once issue * fix review comment * fix vector load issue * tweak Co-authored-by:
ltqin <letaoqin@amd.com> Co-authored-by:
Jing Zhang <jizhan@amd.com> Co-authored-by:
Chao Liu <chao.liu2@amd.com>
-
- 27 Aug, 2021 1 commit
-
-
Chao Liu authored
* use cast_pointer_to_generic_address_space() in v6r1 kernel wrapper, DynamcBuffer and buffer_load take customized invalid-element-value, add buffer_load/store for fp64 * use remove_cvref_t
-
- 25 Aug, 2021 1 commit
-
-
zjing14 authored
* add f32/i32 atomicAdd support into dynamicBuffer, and enable it in v1r3 * fixed * fixed * update comment Co-authored-by:Chao Liu <chao.liu2@amd.com>
-
- 23 Aug, 2021 2 commits
- 19 Aug, 2021 2 commits
-
-
zjing14 authored
* xdlops refactor * fixed commnt * clean xdlops_gemm * add make c into xldops-gemm * change mfma_info * refactor xdlops, hide c desc * clean * clean * clean * apply hacks changes to v4r4r4_nhwc * rename hacks and use single stage adapter * enable fp16 mfma
-
zjing14 authored
* added host conv wrw
-
- 16 Aug, 2021 4 commits
- 13 Aug, 2021 3 commits
- 11 Aug, 2021 2 commits
- 10 Aug, 2021 8 commits
- 09 Aug, 2021 8 commits
- 08 Aug, 2021 1 commit
-
-
Chao Liu authored
-
- 07 Aug, 2021 2 commits
- 06 Aug, 2021 3 commits