1. 27 Nov, 2024 1 commit
  2. 18 Nov, 2024 1 commit
  3. 15 Nov, 2024 2 commits
  4. 14 Nov, 2024 1 commit
  5. 05 Nov, 2024 1 commit
  6. 01 Nov, 2024 1 commit
    • Illia Silin's avatar
      Reduce build time. (#1621) · 03c6448b
      Illia Silin authored
      * disable fp8 gemm_universal on gfx90a and gfx908 by default
      
      * fix cmake syntax
      
      * fix clang format
      
      * add ifdefs in amd_xdlops
      
      * disable fp8 gemm instances on gfx90a by default
      
      * update readme
      03c6448b
  7. 23 Oct, 2024 1 commit
  8. 22 Oct, 2024 1 commit
  9. 18 Oct, 2024 2 commits
  10. 10 Oct, 2024 1 commit
  11. 09 Oct, 2024 1 commit
  12. 07 Oct, 2024 2 commits
  13. 04 Oct, 2024 1 commit
    • arai713's avatar
      Codegen build (#1526) · b545de17
      arai713 authored
      * updating codegen build for MIOpen access: adding .cmake for codegen component
      
      (cherry picked from commit 652a7c04)
      
      * updating CMake
      
      (cherry picked from commit a685822e)
      b545de17
  14. 03 Oct, 2024 1 commit
  15. 13 Sep, 2024 1 commit
    • Jun Liu's avatar
      Customize filesystem in CK for legacy systems (#1509) · 81bc1496
      Jun Liu authored
      
      
      * Legacy support: customized filesystem
      
      * Update cmakefile for python alternative path
      
      * fix build issues
      
      * CK has no boost dependency
      
      * More fixes to issues found on legay systems
      
      * fix clang format issue
      
      * Check if blob is correctly generated in cmake
      
      * fix the python issues
      
      * add a compiler flag for codegen when using alternative python
      
      * use target_link_options instead of target_compile_options
      
      ---------
      Co-authored-by: default avatarillsilin <Illia.Silin@amd.com>
      81bc1496
  16. 04 Sep, 2024 1 commit
  17. 23 Aug, 2024 1 commit
  18. 22 Aug, 2024 1 commit
  19. 21 Aug, 2024 1 commit
  20. 16 Aug, 2024 1 commit
    • Illia Silin's avatar
      Re-enable fp8 types for all architectures. (#1470) · c8b6b642
      Illia Silin authored
      * re-enable fp8 and bf8 for all targets
      
      * restore the fp8 gemm instances
      
      * re-enable conv_3d fp8 on all architectures
      
      * diasble several fp8 gemm instances on all architectures except gfx94
      
      * clang format fix
      c8b6b642
  21. 15 Aug, 2024 1 commit
  22. 14 Aug, 2024 1 commit
    • Haocong WANG's avatar
      [GEMM] gemm_universal related optimization (#1453) · 3049b546
      Haocong WANG authored
      
      
      * replace buffer_atomic with global_atomic
      
      * fixed global_atomic_add
      
      * added bf16 atomic_add
      
      * format
      
      * clang-format-12
      
      * clean
      
      * clean
      
      * add guards
      
      * Update gtest.cmake
      
      * enabled splitk_gemm_multi_d
      
      * format
      
      * add ckProfiler
      
      * format
      
      * fixed naming
      
      * format
      
      * clean
      
      * clean
      
      * add guards
      
      * fix clang format
      
      * format
      
      * add kbatch printout
      
      * clean
      
      * Add rocm6.2 related gemm optimization
      
      * Limit bf16 atomic usage
      
      * remove redundant RCR gemm_universal instance
      
      * Add RRR fp8 gemm universal instance
      
      * Bug fix
      
      * Add GPU_TARGET guard to FP8/BF8 target
      
      * bug fix
      
      * update cmake
      
      * remove all fp8/bf8 example if arch not support
      
      * Enable fp8 RRR support in ckProfiler
      
      * limit greedy-reverse flag to gemm_universal in ckProfiler
      
      ---------
      Co-authored-by: default avatarJing Zhang <jizhan@fb.com>
      Co-authored-by: default avatarJing Zhang <jizhan@meta.com>
      Co-authored-by: default avatarzjing14 <zhangjing14@gmail.com>
      Co-authored-by: default avatarIllia Silin <98187287+illsilin@users.noreply.github.com>
      Co-authored-by: default avatarillsilin <Illia.Silin@amd.com>
      3049b546
  23. 09 Aug, 2024 1 commit
  24. 08 Aug, 2024 1 commit
  25. 06 Aug, 2024 1 commit
  26. 01 Aug, 2024 1 commit
  27. 26 Jul, 2024 1 commit
  28. 16 Jul, 2024 2 commits
  29. 10 Jul, 2024 1 commit
  30. 08 Jul, 2024 1 commit
  31. 27 Jun, 2024 1 commit
  32. 19 Jun, 2024 1 commit
  33. 22 May, 2024 1 commit
  34. 10 May, 2024 1 commit
  35. 01 May, 2024 1 commit
  36. 18 Apr, 2024 1 commit