Add flashattention2 (#196)
* first * fix causal mask * disable flash attention2 on sm70 * fix 2 * update readme * clang-format * disable ft2 on windows * fix lint * fix build * fix build * fix long kv seq * fix lint * sync copy output --------- Co-authored-by:grimoire <yaoqian@pjlab.org.cn> Co-authored-by:
irexyc <irexyc@gmail.com>
Showing
This diff is collapsed.
Please register or sign in to comment