"vscode:/vscode.git/clone" did not exist on "cf0f7eafe69a7bb2aebf2c6c6ac361be8d4ccfe6"
Update for fmha_fwd qs_ks_vs pipeline (#1810)
* Update for fmha_fwd qs_ks_vs pipeline
* Remove _builtin_amdgcn_sched_barrier(0)
* Move p_compute to p converting earlier for trying to increase vgprs re-using
* Enable GetQKBlockGemm to use WarpGemm-16x16x16 for QLoadOnce==false situation
* Re-add __builtin_amdgcn_sched_barrier(0)
---------
Co-authored-by:
Po Yen Chen <PoYen.Chen@amd.com>
Showing
Please register or sign in to comment