1. 06 Aug, 2024 1 commit
  2. 02 Aug, 2024 5 commits
  3. 01 Aug, 2024 2 commits
  4. 31 Jul, 2024 20 commits
  5. 30 Jul, 2024 2 commits
  6. 26 Jul, 2024 2 commits
  7. 25 Jul, 2024 2 commits
  8. 24 Jul, 2024 3 commits
  9. 23 Jul, 2024 1 commit
  10. 22 Jul, 2024 1 commit
  11. 19 Jul, 2024 1 commit
    • Haocong WANG's avatar
      [GEMM] F8 GEMM, performance optimized. (#1384) · 8c90f25b
      Haocong WANG authored
      
      
      * add ab_scale init support
      
      * enabled interwave
      
      * add scale type; update isSupport
      
      * adjust example
      
      * clean
      
      * enable f8 pure gemm rcr ckprofiler
      
      * Add gemm_multiply_multiply instances
      
      * clang format
      
      * Optimize for ScaleBlockMNK=128
      
      * enable abscale f8 gemm ck profiler
      
      * Add pure f8 gemm test suite
      
      * Reverting to the state of project at f60fd77
      
      * update copyright
      
      * clang format
      
      * update copyright
      
      ---------
      Co-authored-by: default avatarroot <jizhan@amd.com>
      8c90f25b