• Zhiwen Mo's avatar
    [SM100] Add sm100 GEMM layouts and tcgen05 support (#887) · f58bcd43
    Zhiwen Mo authored
    * update sm100 related utcmma, tmem, ld/st256 in src
    * update sm100 related utcmma, tmem, ld/st256 in tilelang
    * Remove deprecated GEMM examples and related README documentation for SM100 architecture support
    * Update GEMM implementation to replace UTCMMA with TCGEN5MMA across relevant files
    * Remove gemm_umma.py example and update README to reflect TCGEN5MMA terminology changes
    * Update README.md for gemm_sm100 example by removing outdated API sections and streamlining documentation
    * Update README and source files to reflect TCGEN5.MMA terminology changes
    * Refactor CUDA GEMM header for improved readability
    f58bcd43
test_tilelang_cpu_gemm.py 4.14 KB