Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Repository
33170081f13b4e4453f0fff8df994df8d9aa6c6c
Switch branch/tag
vllm_cscc
tests
neuron
test_prefix_prefill.py
Find file
Blame
History
Permalink
[Neuron][Kernel] Vectorize KV cache load in FlashPagedAttention to maximize DMA bandwidth (#13245)
· 33170081
Lingfan Yu
authored
Feb 20, 2025
Signed-off-by:
Lingfan Yu
<
lingfany@amazon.com
>
33170081
test_prefix_prefill.py
15.7 KB
Edit
Web IDE
Replace test_prefix_prefill.py
×
Attach a file by drag & drop or
click to upload
Commit message
Replace test_prefix_prefill.py
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.