Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
apex
Commits
20eda944
Commit
20eda944
authored
Jun 30, 2025
by
limm
Browse files
fixed a bug regarding PTX commands
parent
e70ac2fd
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
1 deletion
+2
-1
apex/contrib/csrc/group_norm/group_norm_nhwc_bwd_one_pass_kernel.cuh
...b/csrc/group_norm/group_norm_nhwc_bwd_one_pass_kernel.cuh
+2
-1
No files found.
apex/contrib/csrc/group_norm/group_norm_nhwc_bwd_one_pass_kernel.cuh
View file @
20eda944
...
@@ -328,7 +328,8 @@ __global__ __launch_bounds__(THREADS_PER_BLOCK_)
...
@@ -328,7 +328,8 @@ __global__ __launch_bounds__(THREADS_PER_BLOCK_)
// Mark the completion of the threadblock.
// Mark the completion of the threadblock.
if
(
threadIdx
.
x
==
0
)
{
if
(
threadIdx
.
x
==
0
)
{
asm
volatile
(
"red.release.gpu.global.add.s32 [%0], 1;"
::
"l"
(
barrier
));
int
one
=
1
;
asm
volatile
(
"red.release.gpu.global.add.s32 [%0], 1;"
::
"l"
(
barrier
),
"r"
(
one
));
}
}
// Exit if that's not the last thread block.
// Exit if that's not the last thread block.
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment