Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
text-generation-inference
Repository
aadc9cb485e3837a2603da512d2a798ebb7db5ee
Switch branch/tag
text-generation-inference
server
text_generation_server
models
flash_causal_lm.py
Find file
Blame
History
Permalink
Fix prefix caching + speculative decoding (#2711)
· aadc9cb4
Travis Addair
authored
Nov 04, 2024
aadc9cb4
flash_causal_lm.py
93.8 KB
Edit
Web IDE
Replace flash_causal_lm.py
×
Attach a file by drag & drop or
click to upload
Commit message
Replace flash_causal_lm.py
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.