- 23 Jul, 2024 2 commits
-
-
Tri Dao authored
-
rocking authored
* Support ck in fmha * Add ck submodule * Do not return lse if return_softmax == false * Use receipt to speed up ck compile time * Integrate new version of ck_tile * Support dropout for mha_fwd() * Add dropout to mha_varlen_fwd() * Update ck to develop * Extract padding function for dropout randval * Extract randval transformation function * Sync the code structure and coding style with FA * Remove this line, c++ api will handle this. Sync with test_flash_attn.py * fix compile error * Add mha_bwd * Generate dropout seed and offset from user generator * update CK * Add mha_varlen_bwd * Use same python as build flash-attn to generate ck kernel * Fix bug of group mode fwd about returning softmax lse * larger the test tollerance * Add test_flash_attn_output() and test_flash_attn_varlen_output() * Always fill softmax_lse * Remove duplicate benchmark script, since we already implement mha_bwd * Refine get value from tuple * Use default parameter for stream_config * unblock all platform * Add comment * refine the test code * Refine naming * Add unpack to namespace * Do not hardcode the warp size 64 * Add more targets * Add README * Optimize mha_fwd if seqlen_q == 1 * Support get_wheel_url for rocm * Detect rocm environment by pytorch's IS_HIP_EXTENSION * update to lastest ck * Add necessary compile flag * Sync the api with upstream FA --------- Co-authored-by:
carlushuang <carlus.huang@amd.com> Co-authored-by:
Yichen Yan <wenji.yyc@alibaba-inc.com> Co-authored-by:
Po Yen Chen <PoYen.Chen@amd.com> Co-authored-by:
Yichen Yan <oraluben@outlook.com>
-
- 11 Jul, 2024 2 commits
- 26 May, 2024 1 commit
-
-
Tri Dao authored
-
- 21 Feb, 2024 1 commit
-
-
Tri Dao authored
-
- 23 Jan, 2024 2 commits
-
-
Tao He authored
Signed-off-by:Tao He <sighingnow@gmail.com>
-
Tri Dao authored
Co-authored-by:ljss <450993438@qq.com>
-
- 08 Jan, 2024 1 commit
-
-
Erich Schubert authored
-
- 24 Dec, 2023 1 commit
-
-
Tri Dao authored
-
- 22 Dec, 2023 1 commit
-
-
Tri Dao authored
-
- 09 Oct, 2023 1 commit
-
-
Tri Dao authored
-
- 08 Oct, 2023 1 commit
-
-
Tri Dao authored
-
- 05 Oct, 2023 2 commits
- 04 Sep, 2023 1 commit
-
-
Tri Dao authored
-
- 29 Aug, 2023 1 commit
-
-
Jeffrey Quesnelle authored
-
- 25 Aug, 2023 2 commits
- 01 Aug, 2023 1 commit
-
-
Tri Dao authored
-
- 23 Jul, 2023 1 commit
-
-
Ian Timmis authored
* README syntax highlighting Adds syntax highlighting to README * Update README.md
-
- 18 Jul, 2023 2 commits
- 17 Jul, 2023 1 commit
-
-
Tri Dao authored
-
- 08 Jun, 2023 1 commit
-
-
Pierce Freeman authored
-
- 25 May, 2023 1 commit
-
-
Tri Dao authored
-
- 19 May, 2023 1 commit
-
-
Tri Dao authored
-
- 17 May, 2023 1 commit
-
-
Tri Dao authored
-
- 27 Apr, 2023 1 commit
-
-
Anthony Hu authored
-
- 12 Apr, 2023 1 commit
-
-
Tri Dao authored
-
- 15 Mar, 2023 1 commit
-
-
Tri Dao authored
-
- 13 Mar, 2023 1 commit
-
-
Tri Dao authored
-
- 09 Feb, 2023 1 commit
-
-
Tri Dao authored
-
- 27 Dec, 2022 1 commit
-
-
Caleb Thomas authored
-
- 29 Nov, 2022 1 commit
-
-
Tri Dao authored
-
- 22 Nov, 2022 1 commit
-
-
Tri Dao authored
-
- 15 Nov, 2022 1 commit
-
-
Tri Dao authored
-
- 14 Nov, 2022 2 commits
- 24 Oct, 2022 1 commit
-
-
Tri Dao authored
-