• Rostyslav Geyyer's avatar
    Add padding device_gemm_add_add_fastgelu_xdl_c_shuffle instances to enable... · 9a1f2475
    Rostyslav Geyyer authored
    Add padding device_gemm_add_add_fastgelu_xdl_c_shuffle instances to enable arbitrary problem size (#535)
    
    * Add padding device_gemm_add_add_fastgelu_xdl_c_shuffle instances
    
    * Add padding device_gemm_add_fastgelu_xdl_c_shuffle instances
    
    * Add gemm_add_fastgelu profiler impl
    
    * Add padding device_gemm_fastgelu_xdl_c_shuffle instances
    
    * Add gemm_fastgelu profiler impl
    9a1f2475
CMakeLists.txt 3.58 KB