Fp16/fp8 mixed-precision Gemm with multiply+add fusion (#865)
* add compute_type
* add multiply_add ckProfiler
* add f8_fp16 support
* clean
* clean
* fixed lds size calc
* format
---------
Co-authored-by:
Jing Zhang <jizha@amd.com>
Showing
Please register or sign in to comment