Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
flash-attention
Repository
"megatron/optimizer_param_scheduler.py" did not exist on "6c3f6c7bb582b4509b28b64c3772e56f11627b7f"
011ec323d6064cb18fb9a55a0904cabf9d62b8aa
Switch branch/tag
flash-attention
flash_attn
models
gpt.py
Find file
Blame
History
Permalink
Support MQA + MP for decoding (#490)
· 011ec323
dan_the_3rd
authored
Aug 30, 2023
Co-authored-by: danthe3rd <danthe3rd>
011ec323
gpt.py
45.3 KB
Edit
Web IDE
Replace gpt.py
×
Attach a file by drag & drop or
click to upload
Commit message
Replace gpt.py
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.