Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
DeepEP
Commits
23ded3bd
"ssh:/git@developer.sourcefind.cn:2222/tsoc/openmm.git" did not exist on "cecc774ae43f64fb6a2fd936a76eb3351d3864c1"
Unverified
Commit
23ded3bd
authored
Apr 29, 2025
by
fzyzcjy
Committed by
GitHub
Apr 29, 2025
Browse files
Update deep_ep.cpp
parent
65e2a700
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
0 deletions
+2
-0
csrc/deep_ep.cpp
csrc/deep_ep.cpp
+2
-0
No files found.
csrc/deep_ep.cpp
View file @
23ded3bd
...
@@ -614,6 +614,8 @@ Buffer::internode_dispatch(const torch::Tensor& x, const std::optional<torch::Te
...
@@ -614,6 +614,8 @@ Buffer::internode_dispatch(const torch::Tensor& x, const std::optional<torch::Te
const
std
::
optional
<
torch
::
Tensor
>&
cached_rdma_channel_prefix_matrix
,
const
std
::
optional
<
torch
::
Tensor
>&
cached_recv_rdma_rank_prefix_sum
,
const
std
::
optional
<
torch
::
Tensor
>&
cached_rdma_channel_prefix_matrix
,
const
std
::
optional
<
torch
::
Tensor
>&
cached_recv_rdma_rank_prefix_sum
,
const
std
::
optional
<
torch
::
Tensor
>&
cached_gbl_channel_prefix_matrix
,
const
std
::
optional
<
torch
::
Tensor
>&
cached_recv_gbl_rank_prefix_sum
,
const
std
::
optional
<
torch
::
Tensor
>&
cached_gbl_channel_prefix_matrix
,
const
std
::
optional
<
torch
::
Tensor
>&
cached_recv_gbl_rank_prefix_sum
,
int
expert_alignment
,
const
Config
&
config
,
std
::
optional
<
EventHandle
>&
previous_event
,
bool
async
,
bool
allocate_on_comm_stream
)
{
int
expert_alignment
,
const
Config
&
config
,
std
::
optional
<
EventHandle
>&
previous_event
,
bool
async
,
bool
allocate_on_comm_stream
)
{
pybind11
::
gil_scoped_release
release
;
const
int
num_channels
=
config
.
num_sms
/
2
;
const
int
num_channels
=
config
.
num_sms
/
2
;
EP_HOST_ASSERT
(
config
.
num_sms
%
2
==
0
);
EP_HOST_ASSERT
(
config
.
num_sms
%
2
==
0
);
EP_HOST_ASSERT
(
0
<
get_num_rdma_ranks
()
and
get_num_rdma_ranks
()
<=
NUM_MAX_RDMA_PEERS
);
EP_HOST_ASSERT
(
0
<
get_num_rdma_ranks
()
and
get_num_rdma_ranks
()
<=
NUM_MAX_RDMA_PEERS
);
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment