- 03 Sep, 2024 1 commit
-
-
Jongseok Park authored
-
- 01 Aug, 2024 1 commit
-
-
Woosuk Kwon authored
-
- 31 May, 2024 1 commit
-
-
Antoni Baum authored
-
- 19 May, 2024 2 commits
-
-
Woosuk Kwon authored
-
Woosuk Kwon authored
-
- 08 Apr, 2024 2 commits
- 28 Mar, 2024 5 commits
-
-
Woosuk Kwon authored
-
Driss Guessous authored
-
ljss authored
-
Driss Guessous authored
-
ljss authored
-
- 26 Mar, 2024 17 commits
- 15 Mar, 2024 2 commits
-
-
Driss Guessous authored
-
Grigory Sizov authored
* Enable paged attention in varlen forward * Format + fix padding
-
- 14 Mar, 2024 1 commit
-
-
Chirag Jain authored
-
- 21 Feb, 2024 2 commits
- 20 Feb, 2024 1 commit
-
-
Tri Dao authored
-
- 10 Feb, 2024 2 commits
-
-
Tri Dao authored
-
Brian Hirsh authored
-
- 08 Feb, 2024 1 commit
-
-
Grigory Sizov authored
-
- 30 Jan, 2024 1 commit
-
-
Jeremy Reizenstein authored
For faster and smaller builds in some simple cases, provide switches to allow disabling -backward -alibi -uneven k -dropout -local attention Co-authored-by:Jeremy Francis Reizenstein <bottler@users.noreply.github.com>
-
- 23 Jan, 2024 1 commit
-
-
Tri Dao authored
Co-authored-by:ljss <450993438@qq.com>
-