Unverified Commit 77d2a5f1 authored by Jonas M. Kübler's avatar Jonas M. Kübler Committed by GitHub
Browse files

pick up tuned prefill configs for FP8 FA3 (#36265)


Signed-off-by: default avatarJonas M. Kübler <44084297+jmkuebler@users.noreply.github.com>
Signed-off-by: default avatarJonas Kuebler <kuebj@amazon.com>
parent 59192dfd
......@@ -39,7 +39,7 @@ else()
FetchContent_Declare(
vllm-flash-attn
GIT_REPOSITORY https://github.com/vllm-project/flash-attention.git
GIT_TAG 1488682bb545f7d020e958a33116b1419d1cfc83
GIT_TAG 29210221863736a08f71a866459e368ad1ac4a95
GIT_PROGRESS TRUE
# Don't share the vllm-flash-attn build between build types
BINARY_DIR ${CMAKE_BINARY_DIR}/vllm-flash-attn
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment