"...resnet50_tensorflow.git" did not exist on "d9cb85543604a498a232c0210d06ab8af1d129aa"
  • botbw's avatar
    [Language] support `T.gemm_sp_v2` on sm80 and sm89 (#1056) · 283a9a00
    botbw authored
    * [misc] add a cpp side wrapper for gemm_sp_py
    
    * [misc] typing
    
    * [IR] bind GemmSPWarpPolicy
    
    * [chore] add wrapper code
    
    * [IR] fix GemmSPWarpPolicy
    
    * [codegen] apply ptxas instructions
    
    * [intrinsic] add typical (unused) mma layout
    
    * [template] add uint16 debug func
    
    * [intrinsic] add b matrix layout
    
    * [gemm_sp] enable fp16/bf16 on sm8x
    
    * [layout] refactor fp16/bf16 layout
    
    * [gemm_sp] enable int8
    
    * [chore] update test case dtype
    
    * [gemm_sp] enable fp32
    
    * [layout] refactor layouts
    
    * [intrinsic] enable ldmatrix for mat A
    
    * [layout] enable ldsm for matrix b
    
    * [layout] add ldmatrix for fp32 and fp8
    
    * [chore] refine
    
    * [chore] refactor
    
    * [chore] add fp8 efactor
    
    * [chore] refactor
    
    * [chore] add remove negative zero util
    
    * [example] add a custom compress kernel
    
    * [chore] minor update
    
    * [test] refactor gemm_sp test
    
    * [refactor] make metadata layout func
    
    * [example] add option for using cutlass layout
    
    * [doc] add a gemm_sp doc
    
    * [doc] minor polish
    
    * [chore] remove unused
    
    * [bugfix] fix non replicate b case
    
    * [test] refactor
    
    * [chore] add a check
    
    * [bugfix] fix util bug
    
    * [wip] init a new test case for v2
    
    * [chore] minor refactor
    
    * [chore] minor update
    
    * [bugfix] enable 16bit rs
    
    * [language] enable rs
    
    * [language] enable gemm_sp_sr
    
    * [language] enable gemm_sp_rr
    
    * [test] enable more tests
    
    * [tvm] update ffi binding
    
    * [chore] remove print
    
    * [chore] fix benchmark script
    
    * [lint] precommit lint
    
    * [chore] apply feedback
    
    * [test] use arch 8.0
    
    * [chore] rollback ::ordered_metadata for backward compatibility
    
    * [bugfix] fix captialized
    
    * [example] keep gemm_sp on hopper
    
    * [test] fix no fp8 normal kernel
    
    * [test] reduce matmul size to satisfy accum error
    
    * [test] use cal_diff for assertion
    
    * [bugfix] expand float8 type
    
    * [lib] add make_int4 for short type
    
    * [language] add transpose E
    
    * [bugfix] fix wrong var
    
    * [format] format
    
    * [chore] refactor binding
    
    * [chore] fix wrong passing var
    283a9a00
test_tilelang_tilelibrary_gemm_sp.py 11.6 KB