- 16 Sep, 2024 1 commit
-
-
carlushuang authored
-
- 14 Sep, 2024 1 commit
-
-
carlushuang authored
-
- 13 Sep, 2024 7 commits
-
-
carlushuang authored
-
carlushuang authored
-
carlushuang authored
-
carlushuang authored
-
carlushuang authored
-
carlushuang authored
-
carlushuang authored
-
- 12 Sep, 2024 2 commits
-
-
Illia Silin authored
-
Mateusz Ozga authored
* Add pool2d instance BWD AVG * Add pool2d instance BWD MAX * Fix: avg review * Fix review: part2 * Fix - enable test when type is compiled * Fix review part3
-
- 11 Sep, 2024 2 commits
-
-
jakpiase authored
* added pool2d fwd * add tests * add reviewers changes * Revert "Merge remote-tracking branch 'origin/develop' into jakpiase/pool2d_fwd_new" This reverts commit 6b2ba7ff8960b0a6ddbe30d8dac53eeb55a8597e, reversing changes made to 22c82bea0caf3e0f29399100c1bb67b8003fc042. * Revert "add reviewers changes" This reverts commit 22c82bea0caf3e0f29399100c1bb67b8003fc042. * added reviewers comments * revert some old files * add reviewers requests --------- Co-authored-by:Adam Osewski <19374865+aosewski@users.noreply.github.com>
-
jakpiase authored
* Implemented smfmac xdlops * Added smfmac blockwise xdlops * fixes * add reviewers suggestions --------- Co-authored-by:Adam Osewski <19374865+aosewski@users.noreply.github.com>
-
- 10 Sep, 2024 1 commit
-
-
Dan Yao authored
* fix fa bwd * revert kernelBlockSize in gemm_kernel.hpp
-
- 09 Sep, 2024 1 commit
-
-
Thomas Ning authored
-
- 08 Sep, 2024 2 commits
-
-
carlushuang authored
-
carlushuang authored
-
- 07 Sep, 2024 2 commits
-
-
carlushuang authored
-
Thomas Ning authored
* Checkpoint: Finished with the tile example & kernel verification, working on the different matrix layout * Finished the Matrix Layout feature set up. Note: Need to modify the inner block to solve the shuffle problem in the future. * Fix: Clang Format, API fixed from fmha * fix with better naming convention * revert back the pipeline code of fmha * Fixed: Addressed the comments and merge the GEMM shape of GEMM Operator and FMHA Operator to one. * clang format with the reference_gemm file * convert the clang format with the remod.py * Changed the format and variable name of the kernel gemm_shape and partitioner --------- Co-authored-by:thomasning <thomasning@banff-cyxtera-s70-4.ctr.dcgpu>
-
- 06 Sep, 2024 2 commits
-
-
carlushuang authored
-
carlushuang authored
-
- 05 Sep, 2024 2 commits
-
-
M.Emin Ozturk authored
* issue fix, one line changed for tmp * clang --------- Co-authored-by:
Emin Ozturk <emin.ozturk@utah.edu> Co-authored-by:
Harisankar Sadasivan <135730918+hsadasiv@users.noreply.github.com>
-
Haocong WANG authored
* revert ckprofiler change * temp save * Add test and test pass * test pass * Fix bug inside rotating buffer when tensor is not packed * bug fix * clang format --------- Co-authored-by:Illia Silin <98187287+illsilin@users.noreply.github.com>
-
- 04 Sep, 2024 3 commits
-
-
Rostyslav Geyyer authored
-
Illia Silin authored
* copy all fmha headers when building library * fix the rocm_install call for mha headers
-
Illia Silin authored
* locate a newwer version of python when -DRHEL=ON flag is set * allow setting python version on cmake command line
-
- 03 Sep, 2024 6 commits
-
-
carlushuang authored
-
carlushuang authored
-
carlushuang authored
-
carlushuang authored
-
carlushuang authored
-
Bartłomiej Kocot authored
* Add support for NGCHW in grouped conv bwd wei * Comments fixes * navi fixes * Update function names
-
- 02 Sep, 2024 1 commit
-
-
Bartłomiej Kocot authored
Revert "Revert "Revert Revert Support access per groups and filter2x3 in grouped conv fwd (#1382) (#1406) (#1415)" (#1455)" (#1490) This reverts commit 5ff8eeeb.
-
- 01 Sep, 2024 4 commits
-
-
carlushuang authored
-
carlushuang authored
-
carlushuang authored
-
carlushuang authored
-
- 30 Aug, 2024 2 commits
-
-
Dan Yao authored
* asm rtn * add asm rtn macro * reorder macro --------- Co-authored-by:carlushuang <carlus.huang@amd.com>
-
Po Yen Chen authored
Co-authored-by:carlushuang <carlus.huang@amd.com>
-
- 29 Aug, 2024 1 commit
-
-
Po Yen Chen authored
-