Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Repository
74d5543ec589daaa4ac042d65d52dccf26ee3f2c
Switch branch/tag
vllm_cscc
vllm
attention
backends
flashinfer.py
Find file
Blame
History
Permalink
[Core][Kernels] Use FlashInfer backend for FP8 KV Cache when available. (#7798)
· b98cc28f
Pavani Majety
authored
Aug 28, 2024
Co-authored-by:
Simon Mo
<
simon.mo@hey.com
>
b98cc28f
flashinfer.py
30.8 KB
Edit
Web IDE
Replace flashinfer.py
×
Attach a file by drag & drop or
click to upload
Commit message
Replace flashinfer.py
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.