• zqh-wz's avatar
    [Feature] Upgrade cutlass version and support fp8 T.gemm (#202) · 2cccf1f5
    zqh-wz authored
    
    
    * upgrade cutlass to upstream v3.8.0
    
    * Implement fp8 gemm and add example script
    
    * Fix dtype retrieval with map_torch_type for fp8 inputs
    
    * Disable vectorization of fp8 values
    
    * Make MMA declaration compatible with cutlass 3.4.0+
    
    * Add test for fp8 T.gemm
    
    * fix indent
    
    * fix indent
    
    * Add copyright and license header
    
    * Add copyright and license header
    
    * lint fix
    
    * Refactor matmul_nt and assert_matmul_correctness functions for improved readability by consolidating parameter definitions and adjusting formatting.
    
    * clang format lint
    
    ---------
    Co-authored-by: default avatarLei Wang <34334180+LeiWang1999@users.noreply.github.com>
    Co-authored-by: default avatarLeiWang1999 <leiwang1999@outlook.com>
    2cccf1f5
bulk_copy.cc 13.7 KB