1. 28 Nov, 2024 6 commits
  2. 27 Nov, 2024 2 commits
    • Illia Silin's avatar
    • Adam Osewski's avatar
      Polished Grouped GEMM APIs and new BF16 instances (#1600) · 061ac064
      Adam Osewski authored
      * Few small fixes.
      
      * New GroupedGemm instances (BF16)
      
      * Unify and refactor GroupedGEMM device API.
      
      * Adapt changes to new API.
      
      * Adapt grouped gemm profiler.
      
      * Accept multiple kbatches for grouped gemm profiler.
      
      - delete obsolete two stage as it is now covered by grouped gemm
      
      * Update unit test for grouped gemm.
      
      * Fix thresholds for BF16 and F8. Unblock tests.
      
      * Fix few instances.
      
      * Multiple small fixes.
      
      * Adapt to new API, check dynamic casting.
      
      * Uncomment few data types in grouped gemm profiler.
      
      * Fix call to SetDeviceArgs.
      
      * Fix profile grouped gemm multiply tile loop.
      
      * Fix grouped gemm tile loop kernel args in client examples.
      
      * Review comments.
      061ac064
  3. 26 Nov, 2024 7 commits
  4. 25 Nov, 2024 4 commits
  5. 23 Nov, 2024 2 commits
  6. 22 Nov, 2024 2 commits
  7. 21 Nov, 2024 3 commits
  8. 20 Nov, 2024 2 commits
    • Illia Silin's avatar
      Optimize docker file. (#1679) · d31e8249
      Illia Silin authored
      * reduce the docker image size and layers
      
      * clean up docker file
      
      * fix linker error for client example 24
      
      * install CK into the default /opt/rocm/ path
      
      * restore installing CK to alternative path in CI
      
      * add linking for utility lib
      d31e8249
    • Haocong WANG's avatar
      fix bug (#1680) · 81ec5eff
      Haocong WANG authored
      81ec5eff
  9. 19 Nov, 2024 2 commits
  10. 18 Nov, 2024 2 commits
  11. 15 Nov, 2024 3 commits
  12. 14 Nov, 2024 2 commits
  13. 13 Nov, 2024 3 commits