Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
text-generation-inference
Commits
db4cb5e4ed2b994131fc575eff6689da9b661679
Switch branch/tag
text-generation-inference
server
text_generation_server
models
flash_llama.py
21 Apr, 2023
2 commits
fix(server): fix past key values logic (#216)
· db4cb5e4
OlivierDehaene
authored
Apr 21, 2023
@njhill fyi
db4cb5e4
feat(router): add device and dtype info (#215)
· 343437c7
OlivierDehaene
authored
Apr 21, 2023
343437c7
19 Apr, 2023
1 commit
feat(server): support quantization for flash models (#200)
· e14ae3b5
OlivierDehaene
authored
Apr 19, 2023
closes #197
e14ae3b5
11 Apr, 2023
1 commit
feat(server): add flash attention llama (#144)
· 299217c9
OlivierDehaene
authored
Apr 11, 2023
299217c9