"docs/vscode:/vscode.git/clone" did not exist on "48a8f4a7fd18d516ffc0a304219ef722613ea792"
  1. 15 Oct, 2024 1 commit
  2. 12 Oct, 2024 6 commits
  3. 11 Oct, 2024 6 commits
  4. 10 Oct, 2024 6 commits
  5. 09 Oct, 2024 3 commits
  6. 08 Oct, 2024 3 commits
    • Rostyslav Geyyer's avatar
      Add a gpu gemm reference kernel (#1528) · aa932445
      Rostyslav Geyyer authored
      
      
      * Add a gpu gemm reference kernel
      
      * Switch to gpu reference in gemm examples
      
      * Remove redundant arguments
      
      * Update all related examples
      
      * Update more examples
      
      * Try less threads per block
      
      * Try even less threads per block
      
      * Add support for all matrix layouts
      
      * Increase block size
      
      * Clean up
      
      * Remove hardcoded strides
      
      * Clean up
      
      * Try a column-major case
      
      * Revert back to row-major
      
      * Run both CPU and GPU veriffication
      
      ---------
      Co-authored-by: default avatarPo Yen Chen <PoYen.Chen@amd.com>
      aa932445
    • Po Yen Chen's avatar
      [CK_TILE] Update example README files & fix script compatibility issue (#1548) · 0c094daa
      Po Yen Chen authored
      * Fix text alignment of ArgParser::print()
      
      * Update example README files
      
      * Clarify make-ck-dev.sh <arch> usage
      
      * Only keep some of the argument from '-?' output
      
      * Undo command line output changes in README
      
      * Only keep existing argument on doc and update description
      
      * Fix text alignment
      
      * Make cmake-ck-*.sh compatible with 'sh' command
      0c094daa
    • Qianfeng's avatar
      [CK_TILE] Simplify the codes in splitkv_combine pipeline (#1549) · 74d68e3b
      Qianfeng authored
      
      
      * Simplify the codes in splitkv_combine pipeline
      
      * Always set kPadSeqLenK=true for fmha splitkv kernels
      
      * Change in Oacc Alignment and TileDistribution to be more adaptable to tile sizes
      
      ---------
      Co-authored-by: default avatarPo Yen Chen <PoYen.Chen@amd.com>
      74d68e3b
  7. 07 Oct, 2024 4 commits
  8. 04 Oct, 2024 3 commits
  9. 02 Oct, 2024 2 commits
  10. 01 Oct, 2024 4 commits
  11. 27 Sep, 2024 1 commit
  12. 26 Sep, 2024 1 commit