1. 04 Dec, 2024 1 commit
  2. 03 Dec, 2024 2 commits
    • Bartłomiej Kocot's avatar
      Add basic documentation structure (#1715) · 5affda81
      Bartłomiej Kocot authored
      * Add basic documentation structure
      
      * Add terminology placeholder
      
      * Add codegen placeholder
      
      * Create template for each page
      5affda81
    • Illia Silin's avatar
      OCP FP8 support for gfx12. (#1710) · 08d5c02c
      Illia Silin authored
      * (2/5) bilinear gemm pass, perf bug: skip a lds has lower performance than skip b lds
      
      * (3/5) batched gemm pass, perf bug: skip a lds has lower performance than skip b lds
      
      * (4/5) grouped conv pass
      
      * (5/5) attention pass, todo: debug lds perf bug
      
      * AIT Attention API refactor (#8)
      
      * sanity pass
      
      * sanity pass 2
      
      * confirm significant performance regression.
      
      * turn on all instances
      
      * turn off instance format
      
      * Fix bug & tunning & format
      
      * DML meta, self_attn+cross_attn
      
      * sanity pass
      
      * remove useless flag
      
      * update tile and problem size used in AIT attention
      
      * bug fix in grouped conv supporting check
      
      * deprecate inline asm wmma
      
      * Bug fix: double lds skip
      
      * clang-format
      
      * Fix errors in
      1. example, fmha
      2. gridwise pipeline
      3. deviceop, fmha, change some containers from vector to array
      
      * part2 of previous commit
      
      * clang format
      
      * API fix of gridwisegemmpipeline
      
      * separate array base and vector base attention...
      08d5c02c
  3. 02 Dec, 2024 2 commits
  4. 30 Nov, 2024 2 commits
  5. 29 Nov, 2024 2 commits
  6. 28 Nov, 2024 3 commits
  7. 27 Nov, 2024 3 commits
  8. 26 Nov, 2024 7 commits
  9. 25 Nov, 2024 4 commits
  10. 23 Nov, 2024 2 commits
  11. 22 Nov, 2024 2 commits
  12. 21 Nov, 2024 3 commits
  13. 20 Nov, 2024 2 commits
    • Illia Silin's avatar
      Optimize docker file. (#1679) · d31e8249
      Illia Silin authored
      * reduce the docker image size and layers
      
      * clean up docker file
      
      * fix linker error for client example 24
      
      * install CK into the default /opt/rocm/ path
      
      * restore installing CK to alternative path in CI
      
      * add linking for utility lib
      d31e8249
    • Haocong WANG's avatar
      fix bug (#1680) · 81ec5eff
      Haocong WANG authored
      81ec5eff
  14. 19 Nov, 2024 2 commits
  15. 18 Nov, 2024 2 commits
  16. 15 Nov, 2024 1 commit