Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
composable_kernel_ROCM
Commits
59e3cb05
Commit
59e3cb05
authored
Jan 24, 2025
by
Qianfeng Zhang
Browse files
Remove last block_sync_lds() in the loop
parent
3ee41b40
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
0 additions
and
3 deletions
+0
-3
include/ck_tile/ops/fmha/pipeline/block_fmha_pipeline_qr_ks_vs_async.hpp
.../ops/fmha/pipeline/block_fmha_pipeline_qr_ks_vs_async.hpp
+0
-3
No files found.
include/ck_tile/ops/fmha/pipeline/block_fmha_pipeline_qr_ks_vs_async.hpp
View file @
59e3cb05
...
...
@@ -426,8 +426,6 @@ struct BlockFmhaPipelineQRKSVSAsync
}
move_tile_window
(
v_dram_window
,
{
0
,
kK1
});
__builtin_amdgcn_sched_barrier
(
0
);
if
constexpr
(
NumVLdsBuffers
>
1
)
{
v_buf
=
load_tile
(
v_dram_window
);
// load next v_buf
...
...
@@ -623,7 +621,6 @@ struct BlockFmhaPipelineQRKSVSAsync
get_slice_tile
(
v_lds_window
,
sequence
<
((
k1_loops
-
1
)
%
NumVLdsBuffers
)
*
kN1
,
0
>
{},
sequence
<
(((
k1_loops
-
1
)
%
NumVLdsBuffers
)
+
1
)
*
kN1
,
kK1
>
{}));
block_sync_lds
();
}
}
while
(
++
i_total_loops
<
num_total_loop
);
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment