"build_tools/git@developer.sourcefind.cn:hehl2/torchaudio.git" did not exist on "894959a7bd69ae9c4a50723ad128c734c64e6da2"
  • Chao Liu's avatar
    Add gridwise GEMM pipeline (#89) · 22d438ae
    Chao Liu authored
    * clean up
    
    * add mutilple thread scratch to ThreadwiseTensorSliceTransfer_v3r1
    
    * add 2 stage prefetch
    
    * add more sanity check into transform_tensor_descriptor
    
    * tweak
    
    * enabling 2 stage prefetch to exsiting gridwise gemm; tweak
    
    * enabling 2 stage prefetch to exsiting gridwise gemm
    
    * move gridwise gemm pipeline in class; clean up
    
    * add some irregular tile size
    
    * update CalculateHasMainK0BlockLoop for multi-stage-prefetch
    
    * refactor gridwise gemm pipeline class
    22d438ae
profile_gemm_impl.hpp 14.7 KB