Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
jerrrrry
infinilm
Repository
a256e8d9ca49b59f8b2579a8147a41c5ef6a6e87
Switch branch/tag
infinilm
csrc
models
llama
llama_attention.cpp
Find file
Blame
History
Permalink
add mha_kvcache (#261)
· a256e8d9
suss
authored
Mar 11, 2026
* add mha_kvcache * repair gqa-api bug
a256e8d9
llama_attention.cpp
20.2 KB
Edit
Web IDE
Replace llama_attention.cpp
×
Attach a file by drag & drop or
click to upload
Commit message
Replace llama_attention.cpp
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.