"vscode:/vscode.git/clone" did not exist on "e6dc8a144a6d16a4c2c68fb530238b439aea1a97"
- 13 Oct, 2022 1 commit
-
-
Adam Osewski authored
* Move kernel implementation files under impl directory. * Update examples paths. * Update device kernel impl include paths. * Update tensor operation instances include paths. * Update profiler and tests include paths. * Clang-format * Update include paths for batched gemm reduce * Refactor UnitTest ConvNDBwdWeight. * Refactor fwd and bwd data convND UT. * Fix used test macro. * Fix include path. * Fix include paths. * Fix include paths in profiler and tests. * Fix include paths. Co-authored-by:Adam Osewski <aosewski@amd.com>
-
- 25 Aug, 2022 1 commit
-
-
Qianfeng authored
* Add int4 reduction examples * Contain all using of int4_t inside the pre-compiling condition checking
-
- 13 Aug, 2022 1 commit
-
-
Qianfeng authored
* Update the reduce_blockwise example to support user specified data type and input+reducing dimensions * Add examples for using reduce_multiblock_atomic_add * Add more running examples to the default command-line * Remove un-necessary header including * Update to the example README.md
-