- 10 Jul, 2024 2 commits
- 08 Jul, 2024 1 commit
-
-
Nicolas Patry authored
* Softcap v2 (fwd only). * Some missing interface + remove overrides in tests.
-
- 03 Jul, 2024 1 commit
-
-
muoshuosha authored
Co-authored-by:moshuosha <moshuosha@qq.com>
-
- 01 Jul, 2024 1 commit
-
-
cao lei authored
-
- 27 Jun, 2024 1 commit
-
-
Grigory Sizov authored
* Support unpadded LSE layout. Co-authored-by:
Xinfeng Xie <xfxie.ceca@gmail.com> Co-authored-by:
Jianyu Huang <hjyahead@gmail.com> * Cleanup * Fix unpadded LSE on split-kv path * Fix formatting and comments * Fix inline vs forceinline --------- Co-authored-by:
Xinfeng Xie <xfxie.ceca@gmail.com> Co-authored-by:
Jianyu Huang <hjyahead@gmail.com>
-
- 15 Mar, 2024 1 commit
-
-
Grigory Sizov authored
* Enable paged attention in varlen forward * Format + fix padding
-
- 21 Feb, 2024 1 commit
-
-
Tri Dao authored
-
- 23 Jan, 2024 1 commit
-
-
Tri Dao authored
Co-authored-by:ljss <450993438@qq.com>
-
- 14 Jan, 2024 1 commit
-
-
Tri Dao authored
-
- 13 Jan, 2024 1 commit
-
-
Tri Dao authored
-
- 24 Dec, 2023 1 commit
-
-
Tri Dao authored
-
- 22 Dec, 2023 1 commit
-
-
Tri Dao authored
-
- 03 Oct, 2023 1 commit
-
-
Tri Dao authored
-
- 26 Sep, 2023 1 commit
-
-
Tri Dao authored
Co-authored-by:Timothee Lacroix <t@mistral.ai>
-
- 24 Sep, 2023 1 commit
-
-
Tri Dao authored
-
- 21 Sep, 2023 1 commit
-
-
Tri Dao authored
-
- 18 Sep, 2023 1 commit
-
-
Tri Dao authored
-
- 16 Sep, 2023 1 commit
-
-
Tri Dao authored
-
- 13 Sep, 2023 1 commit
-
-
Tri Dao authored
-
- 04 Sep, 2023 2 commits
- 29 Aug, 2023 1 commit
-
-
Tri Dao authored
-
- 25 Aug, 2023 1 commit
-
-
Tri Dao authored
-
- 19 Aug, 2023 1 commit
-
-
Tri Dao authored
-
- 16 Aug, 2023 1 commit
-
-
Tri Dao authored
-
- 13 Aug, 2023 1 commit
-
-
Tri Dao authored
-
- 01 Aug, 2023 2 commits
- 17 Jul, 2023 1 commit
-
-
Tri Dao authored
-
- 13 Nov, 2022 2 commits
- 09 Nov, 2022 1 commit
-
-
Tri Dao authored
-
- 06 Nov, 2022 1 commit
-
-
Tri Dao authored
-
- 05 Nov, 2022 1 commit
-
-
Tri Dao authored
This is faster since we only need to do atomic adds on dq, instead of atomic adds on both dk and dv.
-
- 04 Nov, 2022 2 commits
- 02 Nov, 2022 1 commit
-
-
Tri Dao authored
-
- 01 Nov, 2022 1 commit
-
-
Tri Dao authored
-
- 31 Oct, 2022 1 commit
-
-
Tri Dao authored
-