"...composable_kernel_rocm.git" did not exist on "0ab48d621547f47957702e46fc210b298edd1bdc"
  1. 11 Nov, 2024 1 commit
  2. 29 Oct, 2024 4 commits
  3. 28 Oct, 2024 2 commits
  4. 27 Oct, 2024 1 commit
  5. 24 Oct, 2024 2 commits
  6. 23 Oct, 2024 7 commits
  7. 22 Oct, 2024 5 commits
  8. 21 Oct, 2024 3 commits
  9. 20 Oct, 2024 3 commits
  10. 18 Oct, 2024 2 commits
  11. 16 Oct, 2024 1 commit
  12. 15 Oct, 2024 3 commits
  13. 14 Oct, 2024 1 commit
  14. 13 Oct, 2024 1 commit
  15. 11 Oct, 2024 1 commit
  16. 09 Oct, 2024 2 commits
  17. 08 Oct, 2024 1 commit
    • Rostyslav Geyyer's avatar
      Add a gpu gemm reference kernel (#1528) · aa932445
      Rostyslav Geyyer authored
      
      
      * Add a gpu gemm reference kernel
      
      * Switch to gpu reference in gemm examples
      
      * Remove redundant arguments
      
      * Update all related examples
      
      * Update more examples
      
      * Try less threads per block
      
      * Try even less threads per block
      
      * Add support for all matrix layouts
      
      * Increase block size
      
      * Clean up
      
      * Remove hardcoded strides
      
      * Clean up
      
      * Try a column-major case
      
      * Revert back to row-major
      
      * Run both CPU and GPU veriffication
      
      ---------
      Co-authored-by: default avatarPo Yen Chen <PoYen.Chen@amd.com>
      aa932445