[Nvidia] Integrate SM100 cudnn prefill API to MLA prefill (#20411)
Signed-off-by:Elfie Guo <elfieg@nvidia.com> Co-authored-by:
Elfie Guo <eflieg@nvidia.com>
Showing
vllm/envs.py
100644 → 100755
vllm/v1/attention/backends/mla/common.py
100644 → 100755
Please register or sign in to comment