Add rocblas_alt_impl falg for bwd rocblas calls in MHA (#70)
* Add missing flags arg in gemm_switch_fp32accum call
* Add rocblas_alt_impl flag in MHA
<rev> Add rocblas_alt_impl flag for all bwd gemms in MHA module
* Use ifdef for rocblas_gemm_flags_fp16_alt_impl to target at various AMD hardware
Co-authored-by:
hubertlu-tw <hubertlu@amd.com>
Showing
Please register or sign in to comment