Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
Lmdeploy
Commits
8c672a7b
"vscode:/vscode.git/clone" did not exist on "007982e799ffee14065ba7773b3b78c448535639"
Unverified
Commit
8c672a7b
authored
Nov 29, 2023
by
q.yao
Committed by
GitHub
Nov 29, 2023
Browse files
fix turbomind build on sm<80 (#754)
* fix * fix lint
parent
4744b28c
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
6 additions
and
0 deletions
+6
-0
src/turbomind/models/llama/flash_attention2/flash_fwd_launch_template.h
...models/llama/flash_attention2/flash_fwd_launch_template.h
+6
-0
No files found.
src/turbomind/models/llama/flash_attention2/flash_fwd_launch_template.h
View file @
8c672a7b
...
...
@@ -14,7 +14,13 @@
template
<
typename
Kernel_traits
,
bool
Is_dropout
,
bool
Is_causal
,
bool
Is_even_N
,
bool
Is_even_K
,
bool
Return_softmax
>
__global__
void
flash_fwd_kernel
(
Flash_fwd_params
params
)
{
#if __CUDA_ARCH__ >= 800
flash
::
compute_attn
<
Kernel_traits
,
Is_dropout
,
Is_causal
,
Is_even_N
,
Is_even_K
,
Return_softmax
>
(
params
);
#else
// TODO: support flash attention2 on sm<80
assert
(
false
);
#endif
}
template
<
typename
Kernel_traits
,
bool
Is_dropout
,
bool
Is_causal
>
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment