Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
DeepEP
Commits
6c4ba139
Commit
6c4ba139
authored
Jun 04, 2026
by
lijian
Browse files
fix hang when sm greater than 32.
Signed-off-by:
lijian
<
34831075+lijian0711@users.noreply.github.com
>
parent
d8aab4e1
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
1 deletion
+1
-1
csrc/kernels/internode.cu
csrc/kernels/internode.cu
+1
-1
No files found.
csrc/kernels/internode.cu
View file @
6c4ba139
...
@@ -1285,7 +1285,7 @@ void cached_notify(int hidden_int4, int num_scales, int num_topk_idx, int num_to
...
@@ -1285,7 +1285,7 @@ void cached_notify(int hidden_int4, int num_scales, int num_topk_idx, int num_to
int
num_max_nvl_chunked_recv_tokens
,
int
**
barrier_signal_ptrs
,
int
rank
,
int
num_max_nvl_chunked_recv_tokens
,
int
**
barrier_signal_ptrs
,
int
rank
,
hipStream_t
stream
,
int64_t
num_rdma_bytes
,
int64_t
num_nvl_bytes
,
hipStream_t
stream
,
int64_t
num_rdma_bytes
,
int64_t
num_nvl_bytes
,
bool
is_cached_dispatch
,
bool
low_latency_mode
)
{
bool
is_cached_dispatch
,
bool
low_latency_mode
)
{
const
int
num_threads
=
::
max
(
128
,
kWarpSize
*
num_channels
);
const
int
num_threads
=
::
min
(
1024
,
::
max
(
128
,
kWarpSize
*
num_channels
)
)
;
const
auto
num_rdma_ranks
=
num_ranks
/
NUM_MAX_NVL_PEERS
;
const
auto
num_rdma_ranks
=
num_ranks
/
NUM_MAX_NVL_PEERS
;
// Get clean meta
// Get clean meta
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment