Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
text-generation-inference
Commits
343437c7b5987c64cd3353b0913ffad5fd3df4b5
Switch branch/tag
text-generation-inference
server
text_generation_server
models
flash_llama.py
21 Apr, 2023
1 commit
feat(router): add device and dtype info (#215)
· 343437c7
OlivierDehaene
authored
Apr 21, 2023
343437c7
19 Apr, 2023
1 commit
feat(server): support quantization for flash models (#200)
· e14ae3b5
OlivierDehaene
authored
Apr 19, 2023
closes #197
e14ae3b5
11 Apr, 2023
1 commit
feat(server): add flash attention llama (#144)
· 299217c9
OlivierDehaene
authored
Apr 11, 2023
299217c9