"driver/include/device_tensor.hpp" did not exist on "31ded4ac4bc524acdbf897ffff094d7e7cbed991"
Split k f16 (#97)
* init for splitk f16
* a working prototype
* debug
* perf debug
* update example
* instances for mk kn
* add instances for all layers
* clean
* clean
* add tuning
* format
* add mn_padding into irregular tile
* clean
Co-authored-by:
Chao Liu <chao.liu2@amd.com>
Showing
Please register or sign in to comment