Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
TransformerEngine
Commits
782f6092
Commit
782f6092
authored
Jul 22, 2025
by
yuguo
Browse files
[DCU] fix compile
parent
eac75188
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
5 additions
and
1 deletion
+5
-1
transformer_engine/common/fused_router/fused_moe_aux_loss.cu
transformer_engine/common/fused_router/fused_moe_aux_loss.cu
+1
-1
transformer_engine/common/fused_router/utils.h
transformer_engine/common/fused_router/utils.h
+4
-0
No files found.
transformer_engine/common/fused_router/fused_moe_aux_loss.cu
View file @
782f6092
...
...
@@ -5,8 +5,8 @@
************************************************************************/
#include <assert.h>
#include <cooperative_groups.h>
#include <cuda_runtime.h>
#include <cooperative_groups.h>
#include <transformer_engine/fused_router.h>
#include "../common.h"
...
...
transformer_engine/common/fused_router/utils.h
View file @
782f6092
...
...
@@ -9,6 +9,10 @@
#include "transformer_engine/transformer_engine.h"
#ifdef __HIP_PLATFORM_AMD__
#define __syncwarp __syncthreads
#endif
namespace
transformer_engine
{
constexpr
size_t
kThreadsPerWarp
=
32
;
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment