Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
flash-attention
Commits
7c766b1bbc65d341f5f7fcead82946319c9657d0
Switch branch/tag
flash-attention
csrc
ft_attention
decoder_masked_multihead_attention_template.hpp
04 Jan, 2023
3 commits
[Gen, FT] Use fp32 accum for FMA
· be1afaa2
Tri Dao
authored
Jan 03, 2023
be1afaa2
[Gen, FT] Use tlength instead of params.timestep for rotary
· f266fc72
Tri Dao
authored
Jan 03, 2023
f266fc72
[Gen] Add kernel from FasterTransformer for benchmarking
· a01d1213
Tri Dao
authored
Jan 03, 2023
a01d1213