Fix flash attn mask bug (#733)
* add check input parameter
* add instance for vector load = 1
* move gerneral instance to first pos
* fix read bias code
* regular code for bias load
---------
Co-authored-by:
zjing14 <zhangjing14@gmail.com>
Showing
Please register or sign in to comment