"src/include/blockwise_4d_tensor_op.hpp" did not exist on "a0584426ff5b6b8b448c971b97c9b1a4d86ba010"
  1. 02 Dec, 2024 1 commit
  2. 22 Nov, 2024 1 commit
  3. 21 Nov, 2024 2 commits
  4. 20 Nov, 2024 1 commit
  5. 15 Nov, 2024 1 commit
  6. 14 Nov, 2024 1 commit
  7. 12 Nov, 2024 1 commit
  8. 07 Nov, 2024 1 commit
  9. 06 Nov, 2024 1 commit
  10. 05 Nov, 2024 2 commits
  11. 04 Nov, 2024 1 commit
  12. 01 Nov, 2024 1 commit
    • Illia Silin's avatar
      Reduce build time. (#1621) · 03c6448b
      Illia Silin authored
      * disable fp8 gemm_universal on gfx90a and gfx908 by default
      
      * fix cmake syntax
      
      * fix clang format
      
      * add ifdefs in amd_xdlops
      
      * disable fp8 gemm instances on gfx90a by default
      
      * update readme
      03c6448b
  13. 30 Oct, 2024 3 commits
    • Andriy Roshchenko's avatar
    • Andriy Roshchenko's avatar
    • Adam Osewski's avatar
      [CK-Tile] Universal gemm memory bound pipeline (#1558) · 24d996aa
      Adam Osewski authored
      * CK-Tile GEMM with memory bound pipeline.
      
      * Memory bound gemm pipeline.
      
      * Fix not closed namespace.
      
      * Block gemm mem pipeline draft.
      
      * Do not use ck_tile:: within ck_tile namespace.
      
      * Refactoring & Move Layout info to pipeline problem.
      
      * Get hot loop and TailNum information before lunching kernel.
      
      * Fixes in pipeline.
      
      * Add comment to load_tile_raw and change variable naming style.
      
      * Few small changes & formatting.
      
      * Do not use macro.
      
      * Add gtests.
      
      * Use AccDataType for Output of MFMA instruction.
      
      * Formatting.
      
      * Refactor gemm examples.
      
      * Switch over to current block gemm.
      
      * Use currently available pipeline policy.
      
      * Refactoring and review comment.s
      
      * Fixes after merge.
      
      * Add missing include.
      
      * Add load tile overload which accepts output tensor as parameter.
      
      * This give 8% perf boost at the cost of using more registers.
      
      * Rename example.
      
      * Small changes.
      
      * Fix compilation err and lower K.
      
      * Support different layouts for A/B
      
      * Fix vector size for different layouts.
      
      * Rename Alignment into VectorSize
      
      * Unblock tests.
      24d996aa
  14. 29 Oct, 2024 1 commit
  15. 21 Oct, 2024 1 commit
  16. 16 Oct, 2024 1 commit
  17. 15 Oct, 2024 3 commits
  18. 14 Oct, 2024 1 commit
  19. 11 Oct, 2024 2 commits
  20. 10 Oct, 2024 1 commit
  21. 07 Oct, 2024 2 commits
  22. 27 Sep, 2024 1 commit
  23. 20 Sep, 2024 1 commit
  24. 17 Sep, 2024 1 commit
  25. 16 Sep, 2024 1 commit
  26. 13 Sep, 2024 1 commit
  27. 12 Sep, 2024 1 commit
  28. 11 Sep, 2024 1 commit
    • jakpiase's avatar
      Rewrite pool2d fwd (#1462) · e8d2887c
      jakpiase authored
      
      
      * added pool2d fwd
      
      * add tests
      
      * add reviewers changes
      
      * Revert "Merge remote-tracking branch 'origin/develop' into jakpiase/pool2d_fwd_new"
      
      This reverts commit 6b2ba7ff8960b0a6ddbe30d8dac53eeb55a8597e, reversing
      changes made to 22c82bea0caf3e0f29399100c1bb67b8003fc042.
      
      * Revert "add reviewers changes"
      
      This reverts commit 22c82bea0caf3e0f29399100c1bb67b8003fc042.
      
      * added reviewers comments
      
      * revert some old files
      
      * add reviewers requests
      
      ---------
      Co-authored-by: default avatarAdam Osewski <19374865+aosewski@users.noreply.github.com>
      e8d2887c
  29. 05 Sep, 2024 1 commit
  30. 04 Sep, 2024 1 commit
  31. 03 Sep, 2024 1 commit
  32. 26 Aug, 2024 1 commit
    • Illia Silin's avatar
      Enable daily ninja build traces. (#1487) · 19d22e60
      Illia Silin authored
      * add ninja trace to CI builds
      
      * fix ninja trace logic
      
      * update the ninja trace logic in jenkins file
      
      * limit the number of threads to run ninja build
      
      * use ninja for installation after build
      
      * update the path to ninjatracing tool
      
      * use ninja to run check when using build trace
      
      * fix jenkins logic
      
      * fix typos
      
      * set proper setup_args for all stages
      
      * fix ninja syntax
      
      * replace ninja check with ninja test
      
      * enable ninja tracing with mainline and staging compilers
      19d22e60