1. 18 Nov, 2024 1 commit
    • Illia Silin's avatar
      Add bf16 and int8 wmma gemms for Navi3x and Navi4x. (#1671) · 8aba2724
      Illia Silin authored
      * add bf16 gemms for gfx11/gfx12
      
      * reduce the input values in test_gemm
      
      * add int8 wmma gemm instances for gfx11/gfx12
      
      * add example gemm_wmma_int8
      
      * fix bug in gemm_wmma_int8 test
      
      * increase bf16 gemm test tolerance
      
      * update the dates and clean-up commented-out instances
      8aba2724
  2. 07 Nov, 2024 1 commit
  3. 27 Jun, 2024 1 commit
  4. 16 Apr, 2024 1 commit
  5. 04 Apr, 2024 1 commit
  6. 10 Mar, 2024 1 commit
  7. 24 Feb, 2024 1 commit
  8. 02 Feb, 2024 1 commit
  9. 13 Jun, 2023 1 commit
  10. 31 May, 2023 1 commit
  11. 10 May, 2023 1 commit
  12. 15 Mar, 2023 1 commit
  13. 10 Mar, 2023 1 commit
  14. 24 Feb, 2023 1 commit
  15. 16 Feb, 2023 2 commits
  16. 03 Feb, 2023 1 commit
  17. 17 Jan, 2023 1 commit
    • Haocong WANG's avatar
      [Navi3x-LWPCK-545] Block-wise GEMM + Real GEMM_WMMA_FP16 (#541) · 919aeb1f
      Haocong WANG authored
      * wmma_op + unit test
      
      * add arch limitation to wmma test
      
      * change arch limitation
      
      * Refactor + Add all type unit test(int4 compile failed)
      
      * Add f32_16x16x16_bf16 unit test
      
      * tempsave
      
      * tempsave
      
      * tempsave
      
      * runtime bug, cannot find symbol
      
      * workaround for incorrect HIP warpSize return value
      
      * debugging
      
      * tempsave
      
      * Correctness OK, waiting for optimization
      
      * Tidy up + format
      
      * temp save
      
      * temp save, reproduce the v_bfi_b32 issue
      
      * add inline asm for wmmaop test
      
      * tidy up
      
      * clean some debug purpose code
      
      * discard some codes
      
      * clang format
      
      * clang format
      
      * compiler issue fixed + increase tile size
      919aeb1f
  18. 15 Dec, 2022 1 commit
  19. 13 Dec, 2022 1 commit
  20. 09 Dec, 2022 1 commit
  21. 02 Dec, 2022 1 commit
  22. 24 Nov, 2022 1 commit
  23. 28 Oct, 2022 1 commit
  24. 21 Oct, 2022 2 commits