"...composable_kernel_onnxruntime.git" did not exist on "e4584d91acc14a22426cbf081c8cc8394c136f6b"
Remove gemm copy and simplify rocblas call (#356)
* Remove extra copy in gemm * combine rocblas gemm call * clang format * fix a bug in calling rocblas function * clang format' * backup of temporary changes * clang format * unify the gemm call to avoid multiple gpu implemantation * clang format * remove unnecessary code * backup temp changes * clang format * fix cppcheck error * code backup * clang format * remove unnecessary synchronization function * clang format * fix bugs * clang format * more optimization related to gemm * clang format * code cleanup * implementation that can achieves better performance * clang format * temp changes to try performance * clang format * revert to previous commits * fixed review comments * clang format * fix review comments
Showing
Please register or sign in to comment