Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
flash-attention
Repository
f5d0fbd46805155a4406d36e8fb9f9d0324030c4
Switch branch/tag
flash-attention
csrc
ft_attention
decoder_masked_multihead_attention_ut...
Find file
Blame
History
Permalink
[FT] Fix FT's single query attention for bf16 hdim128 rotary
· f5d0fbd4
Tri Dao
authored
Mar 28, 2023
f5d0fbd4
decoder_masked_multihead_attention_utils.h
51.5 KB
Edit
Web IDE
Replace decoder_masked_multihead_attention_utils.h
×
Attach a file by drag & drop or
click to upload
Commit message
Replace decoder_masked_multihead_attention_utils.h
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.