[Attention] Use sparse prefill kernel for fp8 kv-cache in DeepSeek-v3.2 (#27532)
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com>
Showing
This diff is collapsed.
vllm/v1/worker/workspace.py
0 → 100644
Please register or sign in to comment