- 23 Sep, 2021 4 commits
- 22 Sep, 2021 4 commits
- 10 Sep, 2021 1 commit
-
-
ltqin authored
-
- 09 Sep, 2021 2 commits
- 08 Sep, 2021 4 commits
- 07 Sep, 2021 8 commits
- 06 Sep, 2021 1 commit
-
-
ltqin authored
-
- 03 Sep, 2021 3 commits
- 02 Sep, 2021 5 commits
- 01 Sep, 2021 4 commits
- 31 Aug, 2021 3 commits
-
-
ltqin authored
-
ltqin authored
-
ltqin authored
* start * modify transformat * modify device convolutiion * modify host * added host conv bwd and wrw * remove bwd, seperate wrw * clean * hacall k to zero * out log * fixed * fixed * change to (out in wei) * input hack * hack to out * format * fix by comments * change wei hacks(wei transform has not merge) * fix program once issue * fix review comment * fix vector load issue * tweak Co-authored-by:
ltqin <letaoqin@amd.com> Co-authored-by:
Jing Zhang <jizhan@amd.com> Co-authored-by:
Chao Liu <chao.liu2@amd.com>
-
- 27 Aug, 2021 1 commit
-
-
Chao Liu authored
* use cast_pointer_to_generic_address_space() in v6r1 kernel wrapper, DynamcBuffer and buffer_load take customized invalid-element-value, add buffer_load/store for fp64 * use remove_cvref_t
-