- 23 Jul, 2024 2 commits
- 22 Jul, 2024 1 commit
-
-
Phil Wang authored
* check in the two ways of approaching backwards for softcapping, both functional * prepare the softcap switch for backwards * temporary * cleanup to the way Tri prefers * calculate dtanh when copying from scores -> dtanh Tensor * no ternary operators allowed for constexpr, so just use some hack found online * fix maybe_dtanh, restore some files * restore another file * move calculate_dtanh to utils and colocate with apply_softcap * cleanup * maybe last cleanup * save for another pr * remove a stray line * fix spacing * fix an issue, and make test_flash_attn.py ready to test softcapping backwards
-
- 11 Jul, 2024 1 commit
-
-
Tri Dao authored
-
- 10 Jul, 2024 3 commits
- 08 Jul, 2024 1 commit
-
-
Nicolas Patry authored
* Softcap v2 (fwd only). * Some missing interface + remove overrides in tests.
-
- 01 Jul, 2024 2 commits
-
-
66RING authored
-
Liang authored
Co-authored-by:zl <zl@deepseek.com>
-
- 27 Jun, 2024 1 commit
-
-
Grigory Sizov authored
* Support unpadded LSE layout. Co-authored-by:
Xinfeng Xie <xfxie.ceca@gmail.com> Co-authored-by:
Jianyu Huang <hjyahead@gmail.com> * Cleanup * Fix unpadded LSE on split-kv path * Fix formatting and comments * Fix inline vs forceinline --------- Co-authored-by:
Xinfeng Xie <xfxie.ceca@gmail.com> Co-authored-by:
Jianyu Huang <hjyahead@gmail.com>
-
- 26 May, 2024 1 commit
-
-
Tri Dao authored
-
- 08 Apr, 2024 1 commit
-
-
Tri Dao authored
-
- 28 Mar, 2024 2 commits
-
-
Driss Guessous authored
-
ljss authored
-
- 15 Mar, 2024 1 commit
-
-
Driss Guessous authored
-
- 21 Feb, 2024 1 commit
-
-
Tri Dao authored
-
- 20 Feb, 2024 1 commit
-
-
Tri Dao authored
-
- 30 Jan, 2024 1 commit
-
-
Jeremy Reizenstein authored
For faster and smaller builds in some simple cases, provide switches to allow disabling -backward -alibi -uneven k -dropout -local attention Co-authored-by:Jeremy Francis Reizenstein <bottler@users.noreply.github.com>
-
- 23 Jan, 2024 2 commits
-
-
Tri Dao authored
Co-authored-by:ljss <450993438@qq.com>
-
Tri Dao authored
-
- 22 Jan, 2024 1 commit
-
-
Tri Dao authored
-
- 21 Jan, 2024 6 commits
- 20 Jan, 2024 1 commit
-
-
Tri Dao authored
-
- 15 Jan, 2024 2 commits
- 14 Jan, 2024 5 commits
- 13 Jan, 2024 2 commits
- 12 Jan, 2024 1 commit
-
-
Tri Dao authored
-
- 24 Dec, 2023 1 commit
-
-
Tri Dao authored
-