1. 29 Oct, 2024 1 commit
  2. 24 Oct, 2024 2 commits
  3. 23 Oct, 2024 3 commits
  4. 22 Oct, 2024 2 commits
  5. 21 Oct, 2024 3 commits
  6. 19 Oct, 2024 1 commit
  7. 18 Oct, 2024 2 commits
  8. 16 Oct, 2024 5 commits
  9. 15 Oct, 2024 4 commits
  10. 14 Oct, 2024 2 commits
  11. 11 Oct, 2024 5 commits
  12. 10 Oct, 2024 5 commits
  13. 09 Oct, 2024 3 commits
  14. 08 Oct, 2024 2 commits
    • Rostyslav Geyyer's avatar
      Add a gpu gemm reference kernel (#1528) · aa932445
      Rostyslav Geyyer authored
      
      
      * Add a gpu gemm reference kernel
      
      * Switch to gpu reference in gemm examples
      
      * Remove redundant arguments
      
      * Update all related examples
      
      * Update more examples
      
      * Try less threads per block
      
      * Try even less threads per block
      
      * Add support for all matrix layouts
      
      * Increase block size
      
      * Clean up
      
      * Remove hardcoded strides
      
      * Clean up
      
      * Try a column-major case
      
      * Revert back to row-major
      
      * Run both CPU and GPU veriffication
      
      ---------
      Co-authored-by: default avatarPo Yen Chen <PoYen.Chen@amd.com>
      aa932445
    • Po Yen Chen's avatar
      [CK_TILE] Update example README files & fix script compatibility issue (#1548) · 0c094daa
      Po Yen Chen authored
      * Fix text alignment of ArgParser::print()
      
      * Update example README files
      
      * Clarify make-ck-dev.sh <arch> usage
      
      * Only keep some of the argument from '-?' output
      
      * Undo command line output changes in README
      
      * Only keep existing argument on doc and update description
      
      * Fix text alignment
      
      * Make cmake-ck-*.sh compatible with 'sh' command
      0c094daa