"vllm/vscode:/vscode.git/clone" did not exist on "679ca5d8d346ede84c9cbba5d6a8789723c295c0"
[Neuron][Kernel] NKI-based flash-attention kernel with paged KV cache (#11277)
Signed-off-by:Liangfu Chen <liangfc@amazon.com> Co-authored-by:
Jiangfei Duan <jfduan@outlook.com>
Showing
Please register or sign in to comment