1. 18 Feb, 2025 9 commits
  2. 17 Feb, 2025 8 commits
  3. 15 Feb, 2025 4 commits
  4. 14 Feb, 2025 10 commits
  5. 13 Feb, 2025 4 commits
  6. 12 Feb, 2025 5 commits
    • Thomas Ning's avatar
      CK Tile GEMM Compute V2 (2 LDS Ping Pong mechanism) (#1853) · 16fa63ea
      Thomas Ning authored
      * comp v4 setup
      
      * add a file
      
      * Finished the coding of the feature, Compiler not in the way we supposed to have
      
      * Update some of the code to better format
      
      * get tback the restrict variable name, need to switch out to solve the transpose issue
      
      * Solve the compiler issue on SHMEM conflict
      
      * roll back to compute pipeline
      
      * Add the changes from include/ck_tile
      
      * Address the comments
      
      * Merge from internal (#1857)
      
      * enable batched_gemm_softmax_gemm_perm_wmma for gfx12
      
      * disable instances with blocksize=256 in attention examples
      
      * debuggging
      
      * debug
      
      * fixed lds_enabled
      
      * debugging
      
      * Fix and add limit to skiplds feature
      
      * Enable skipLds feature and fix compilation bugs
      
      * add ck_tile definitions for gfx12
      
      * fix clang format and test/wmma_op
      
      * updage instances cmake for gfx12
      
      * disable the test_wmma_op on gfx12
      
      * fix the builds for gfx950
      
      * add gfx12 and gfx950 to default target list
      
      * ...
      16fa63ea
    • jefyang1's avatar
      7b826807
    • coderfeli's avatar
      fix mtile 64,128 for gemm1 · 568ad1e1
      coderfeli authored
      568ad1e1
    • coderfeli's avatar
      remove d2 for gemm1 · 59f3e009
      coderfeli authored
      59f3e009
    • coderfeli's avatar
      moe gemm1 scaleready · 418baed3
      coderfeli authored
      418baed3