Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
flash-attention
Commits
10b6f3a8
"docs/source/en/api/pipelines/overview.mdx" did not exist on "48269070d23ad8a4c6f31bc6847c358aac182ad1"
Commit
10b6f3a8
authored
Feb 26, 2024
by
skrider
Committed by
Woosuk Kwon
Mar 28, 2024
Browse files
revert hardcoded rotcossin thread layout
parent
166f33fd
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
3 deletions
+1
-3
csrc/flash_attn/src/kernel_traits.h
csrc/flash_attn/src/kernel_traits.h
+1
-3
No files found.
csrc/flash_attn/src/kernel_traits.h
View file @
10b6f3a8
...
...
@@ -158,9 +158,7 @@ struct Flash_fwd_kernel_traits : public Base {
make_tiled_copy
(
Copy_Atom
<
DefaultCopy
,
ElementAccum
>
{},
GmemLayoutAtomOaccum
{},
Layout
<
Shape
<
_1
,
_4
>>
{}));
// Val layout, 4 vals per store
// using GmemLayoutAtomRotcossin = GmemLayoutAtom;
using
GmemLayoutAtomRotcossin
=
Layout
<
Shape
<
Int
<
kNThreads
/
kGmemThreadsPerRow
>
,
Int
<
kGmemThreadsPerRow
>>
,
Stride
<
Int
<
kGmemThreadsPerRow
>
,
_1
>>
;
using
GmemLayoutAtomRotcossin
=
GmemLayoutAtom
;
using
GmemTiledCopyRotcossin
=
decltype
(
make_tiled_copy
(
Copy_Atom
<
UniversalCopy
<
uint64_t
>
,
Element
>
{},
GmemLayoutAtomRotcossin
{},
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment