vllm/attention/backends/flash_attn.py · 6832707e90d460bc1d1eec550e0035af72db7a27 · OpenDAS / vllm_cscc · GitLab

Find file Blame History Permalink

[V1][Bugfix] Standardize quantized kv cache rejection for attention backends (#14221) · 6832707e
Michael Goin authored Mar 06, 2025
```
Signed-off-by: mgoin <mgoin64@gmail.com>
```
6832707e

flash_attn.py 39.7 KB