"pretrain_bert_ict.py" did not exist on "1a1cfcff338acac65aa60e70348adb124f8e631f"
  • Anthony Chang's avatar
    Tune & add conflict-free LDS gemm kernels (#159) · 7db48f90
    Anthony Chang authored
    * retune & add conflict-free bf16/fp16 c-shuffle gemm instances
    
    amend wrong K1 value in some fp16/bf16 kernel instances
    
    * make gemm cshuffle's timing behavior consistent with all other functions
    
    * clang-format
    
    * retune & add conflict-free fp32 c-shuffle gemm instances
    
    * retune & add conflict-free int8 c-shuffle gemm instances
    
    * update the underlying gridwise gemm of all c-shuffle gemm kernels
    
    * typo
    7db48f90
gemm_xdl_fp16.cpp 9.22 KB