"backends/v2/Cargo.toml" did not exist on "fa8a8e05afa435bd39308974925a1098239dfcb4"
  • zjing14's avatar
    Add xdlops v4r4r4 into online compilation (#48) · fbdf4332
    zjing14 authored
    
    
    * init for v4r4 xdlops olc
    
    * refactor wrap
    
    * init impl of v4r4 nchw xdlops olc
    
    * tuning
    
    * test perf
    
    * fixed v4r4 nhwc
    
    * tuned v4r4 nhwc
    
    * use gridwise_gemm_xdlops_v2r3
    
    * swap a/b
    
    * add pointer support into offline v2r3
    
    * debugging v4r4r4 transform for olc
    
    * change timer of olc
    
    * refactor v4r4 xdlops nchw olc
    
    * remove transform fun in v4r4 xdlops nhwc olc
    Co-authored-by: default avatarChao Liu <chao.liu2@amd.com>
    fbdf4332
conv_driver_v2_olc.cpp 12.8 KB