[FEAT][ROCm] Enable running Flash Attention as ViT attn backend for Qwen-VL...
[FEAT][ROCm] Enable running Flash Attention as ViT attn backend for Qwen-VL models on ROCm platform. (#22069) Signed-off-by:tjtanaavllm <tunjian.tan@amd.com> Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Co-authored-by:
tjtanaavllm <tunjian.tan@amd.com>
Showing
Please register or sign in to comment