"tests/vscode:/vscode.git/clone" did not exist on "50632adc583fe1af8e2ee112b48c61c14153574f"
Unverified Commit 3132a933 authored by Lucas Wilkinson's avatar Lucas Wilkinson Committed by GitHub
Browse files

[Bugfix][Kernel] FA3 Fix - RuntimeError: This flash attention build only...


[Bugfix][Kernel] FA3 Fix - RuntimeError: This flash attention build only supports pack_gqa (for build size reasons). (#12405)
Signed-off-by: default avatarLucas Wilkinson <lwilkinson@neuralmagic.com>
parent df5dafaa
......@@ -576,7 +576,7 @@ else()
FetchContent_Declare(
vllm-flash-attn
GIT_REPOSITORY https://github.com/vllm-project/flash-attention.git
GIT_TAG 0aff05f577e8a10086066a00618609199b25231d
GIT_TAG 9732b0ce005d1e6216864788502d5570004678f5
GIT_PROGRESS TRUE
# Don't share the vllm-flash-attn build between build types
BINARY_DIR ${CMAKE_BINARY_DIR}/vllm-flash-attn
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment