• zjing14's avatar
    Add VectorType support into StaticBuffer (#27) · 846f462b
    zjing14 authored
    
    
    * init StaticBufferV2
    
    * clean
    
    * adopt old output stage for staticBufferV2
    
    * clean
    
    * remove hack
    
    * clean
    
    * clean
    
    * clean code
    
    * move c_buffer alloc into blockwise gemm
    
    * add adaptors for m/n_thread_data_on_grid
    
    * adjust blockwise_gemm_xdlops
    
    * reorder ops in GEMM hot loop
    Co-authored-by: default avatarChao Liu <chao.liu2@amd.com>
    846f462b
amd_xdlops.hpp 14.4 KB