1. 05 Dec, 2024 1 commit
  2. 04 Dec, 2024 2 commits
  3. 30 Nov, 2024 1 commit
    • mtgu0705's avatar
      Merge the int4 kernel and profiling in one commit for RTP. · 40054f53
      mtgu0705 authored
      Add int4+scale based on Zhang, Jing pk_i4. Compile pass, function pass.
      Modify the kernel to 128x128x128, and use mfma_32x32x4
      Move the weight permute from host to device
      
      Modified the scale init method.
      
      Modified the init method, the function is failed, need to debug.
      
      Added init method
      
      Support group=128 for Llam2-7B-int4
      
      Move the weight permute from host to device
      
      Add ckProfiler for GEMM b scale (int4)
      
      Add reference function.
      
      Add pipeline v4 (2 LDS pingpong)
      
      Add more int4-Gemm kernel profiling instances.
      
      Modify the in4-Gemm kernel instances
      
      Move the pk_i4 permute in kernel
      40054f53
  4. 27 Oct, 2024 1 commit
  5. 24 Oct, 2024 2 commits
  6. 23 Oct, 2024 6 commits
  7. 22 Oct, 2024 3 commits
  8. 21 Oct, 2024 3 commits
  9. 20 Oct, 2024 2 commits
  10. 18 Oct, 2024 2 commits
  11. 16 Oct, 2024 1 commit
  12. 15 Oct, 2024 3 commits
  13. 14 Oct, 2024 1 commit
  14. 13 Oct, 2024 1 commit
  15. 11 Oct, 2024 1 commit
  16. 09 Oct, 2024 1 commit
  17. 07 Oct, 2024 1 commit
  18. 04 Oct, 2024 1 commit
  19. 02 Oct, 2024 1 commit
  20. 25 Sep, 2024 1 commit
  21. 20 Sep, 2024 2 commits
  22. 13 Sep, 2024 1 commit
    • Jun Liu's avatar
      Customize filesystem in CK for legacy systems (#1509) · 81bc1496
      Jun Liu authored
      
      
      * Legacy support: customized filesystem
      
      * Update cmakefile for python alternative path
      
      * fix build issues
      
      * CK has no boost dependency
      
      * More fixes to issues found on legay systems
      
      * fix clang format issue
      
      * Check if blob is correctly generated in cmake
      
      * fix the python issues
      
      * add a compiler flag for codegen when using alternative python
      
      * use target_link_options instead of target_compile_options
      
      ---------
      Co-authored-by: default avatarillsilin <Illia.Silin@amd.com>
      81bc1496
  23. 12 Sep, 2024 1 commit
  24. 11 Sep, 2024 1 commit
    • jakpiase's avatar
      Rewrite pool2d fwd (#1462) · e8d2887c
      jakpiase authored
      
      
      * added pool2d fwd
      
      * add tests
      
      * add reviewers changes
      
      * Revert "Merge remote-tracking branch 'origin/develop' into jakpiase/pool2d_fwd_new"
      
      This reverts commit 6b2ba7ff8960b0a6ddbe30d8dac53eeb55a8597e, reversing
      changes made to 22c82bea0caf3e0f29399100c1bb67b8003fc042.
      
      * Revert "add reviewers changes"
      
      This reverts commit 22c82bea0caf3e0f29399100c1bb67b8003fc042.
      
      * added reviewers comments
      
      * revert some old files
      
      * add reviewers requests
      
      ---------
      Co-authored-by: default avatarAdam Osewski <19374865+aosewski@users.noreply.github.com>
      e8d2887c