Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
FastMoE
Commits
c7e6a3db
Commit
c7e6a3db
authored
Dec 28, 2022
by
Rick Ho
Browse files
update header file processgroupnccl
parent
a762d33c
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
12 additions
and
0 deletions
+12
-0
cuda/fmoe_cuda.cpp
cuda/fmoe_cuda.cpp
+7
-0
cuda/global_exchange.cpp
cuda/global_exchange.cpp
+5
-0
No files found.
cuda/fmoe_cuda.cpp
View file @
c7e6a3db
...
...
@@ -5,7 +5,14 @@
// global_exchange
#ifdef FMOE_USE_NCCL
#if defined(TORCH_VERSION_MAJOR) && (TORCH_VERSION_MAJOR > 1 || \
(TORCH_VERSION_MAJOR == 1 && TORCH_VERSION_MINOR >= 13))
#include <torch/csrc/distributed/c10d/ProcessGroupNCCL.hpp>
#else
#include <c10d/ProcessGroupNCCL.hpp>
#endif
torch
::
Tensor
_expert_exchange
(
torch
::
Tensor
local_expert_count
,
long
n_expert
,
long
n_workers
);
...
...
cuda/global_exchange.cpp
View file @
c7e6a3db
...
...
@@ -98,7 +98,12 @@ torch::Tensor _global_gather(
return
local_output_buf
;
}
#if defined(TORCH_VERSION_MAJOR) && (TORCH_VERSION_MAJOR > 1 || \
(TORCH_VERSION_MAJOR == 1 && TORCH_VERSION_MINOR >= 13))
#include <torch/csrc/distributed/c10d/ProcessGroupNCCL.hpp>
#else
#include <c10d/ProcessGroupNCCL.hpp>
#endif
class
HackNCCLGroup
:
public
c10d
::
ProcessGroupNCCL
{
public:
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment