"vscode:/vscode.git/clone" did not exist on "38086572c83a3882961ef5b14e66e7a01de54ddf"
ckProfiler and device-level XDL GEMM operator (#48)
* add DeviceGemmXdl * update script * fix naming issue * fix comment * output HostTensorDescriptor * rename * padded GEMM for fwd v4r4r4 nhwc * refactor * refactor * refactor * adding ckProfiler * adding ckProfiler * refactor * fix tuning parameter bug * add more gemm instances * add more fp16 GEMM instances * fix profiler driver * fix bug in tuning parameter * add fp32 gemm instances * small fix * refactor * rename * refactor gemm profiler; adding DeviceConv and conv profiler * refactor * fix * add conv profiler * refactor * adding more GEMM and Conv instance * Create README.md Add build instruction for ckProfiler * Create README.md Add Readme for gemm_xdl example * Update README.md Remove build instruction from top most folder * Update README.md * clean up
Showing
profiler/CMakeLists.txt
0 → 100644
profiler/README.md
0 → 100644
profiler/conv_profiler.cpp
0 → 100644
profiler/gemm_profiler.cpp
0 → 100644
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
profiler/profiler.cpp
0 → 100644
script/conv_driver.sh
0 → 100755
This diff is collapsed.
script/example_gemm_xdl.sh
0 → 100755
script/gemm_driver.sh
0 → 100755
script/run.sh
deleted
100755 → 0
This diff is collapsed.
Please register or sign in to comment