Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
DeepEP
Commits
a42ecdc0
Commit
a42ecdc0
authored
Mar 31, 2026
by
lishen
Browse files
cached_notify_combine添加launch_bounds
parent
c2abba11
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
1 deletion
+2
-1
csrc/kernels/intranode.cu
csrc/kernels/intranode.cu
+2
-1
No files found.
csrc/kernels/intranode.cu
View file @
a42ecdc0
...
@@ -544,7 +544,8 @@ void dispatch(void *recv_x, float *recv_x_scales, int *recv_src_idx, int64_t *re
...
@@ -544,7 +544,8 @@ void dispatch(void *recv_x, float *recv_x_scales, int *recv_src_idx, int64_t *re
#undef DISPATCH_LAUNCH_CASE
#undef DISPATCH_LAUNCH_CASE
}
}
template
<
int
kNumRanks
>
template
<
int
kNumRanks
>
__global__
void
cached_notify_combine
(
void
**
buffer_ptrs
,
int
*
send_head
,
int
num_channels
,
__global__
void
__launch_bounds__
(
kWarpSize
*
NUM_MAX_NVL_PEERS
,
1
)
cached_notify_combine
(
void
**
buffer_ptrs
,
int
*
send_head
,
int
num_channels
,
int
num_recv_tokens
,
int
num_memset_int
,
int
num_recv_tokens
,
int
num_memset_int
,
int
**
barrier_signal_ptrs
,
int
rank
)
{
int
**
barrier_signal_ptrs
,
int
rank
)
{
const
auto
sm_id
=
static_cast
<
int
>
(
blockIdx
.
x
);
const
auto
sm_id
=
static_cast
<
int
>
(
blockIdx
.
x
);
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment