-
vllmellm authored
[FEAT][ROCm] Enable running Flash Attention as ViT attn backend for Qwen-VL models on ROCm platform. (#22069) Signed-off-by:
tjtanaavllm <tunjian.tan@amd.com> Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Co-authored-by:
tjtanaavllm <tunjian.tan@amd.com>
d3a6f212