- 23 Oct, 2022 1 commit
-
-
Tri Dao authored
-
- 22 Oct, 2022 1 commit
-
-
Tri Dao authored
-
- 21 Oct, 2022 3 commits
- 17 Oct, 2022 2 commits
- 16 Oct, 2022 1 commit
-
-
Tri Dao authored
-
- 14 Oct, 2022 2 commits
- 10 Oct, 2022 1 commit
-
-
Tri Dao authored
build wheel workflow
-
- 06 Oct, 2022 2 commits
-
-
Tri Dao authored
remove numpy dependency
-
Antoine Adam authored
According to the `setup.py` file, only dependencies are torch and einops. But the `bert_padding.py` file requires `numpy` only to multiply the elements of a `torch.Size` object. This change aims at allowing the use of FlashAttention without numpy.
-
- 05 Oct, 2022 5 commits
-
-
Tri Dao authored
Make flash attention compile on Windows.
-
Eric Engelhart authored
-
Eric Engelhart authored
-
Eric Engelhart authored
-
Tri Dao authored
-
- 26 Sep, 2022 2 commits
-
-
robotcator authored
-
robotcator authored
-
- 12 Sep, 2022 1 commit
-
-
Tri Dao authored
-
- 11 Sep, 2022 1 commit
-
-
Tri Dao authored
-
- 09 Sep, 2022 1 commit
-
-
Tri Dao authored
-
- 06 Sep, 2022 2 commits
-
-
Tri Dao authored
Update flash_attention.py
-
eric-tc-wong authored
Recasting query and key after rotary_emb()
-
- 09 Aug, 2022 1 commit
-
-
Tri Dao authored
-
- 05 Aug, 2022 2 commits
- 22 Jul, 2022 1 commit
-
-
Tri Dao authored
-
- 12 Jul, 2022 1 commit
-
-
Tri Dao authored
Phil Tillet suggests calling it "experimental".
-
- 11 Jul, 2022 2 commits
- 10 Jul, 2022 6 commits
- 04 Jul, 2022 2 commits