Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
TransformerEngine
Commits
6a34b657
Unverified
Commit
6a34b657
authored
Jan 15, 2026
by
Jacket
Committed by
GitHub
Jan 15, 2026
Browse files
fix: enable opt for cutlass sources to avoid infinite compile time (#2595)
Signed-off-by:
Kaining Zhong
<
kainingz@nvidia.com
>
parent
6cbdb042
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
14 additions
and
2 deletions
+14
-2
transformer_engine/common/CMakeLists.txt
transformer_engine/common/CMakeLists.txt
+14
-2
No files found.
transformer_engine/common/CMakeLists.txt
View file @
6a34b657
...
@@ -230,12 +230,24 @@ add_library(transformer_engine SHARED ${transformer_engine_SOURCES})
...
@@ -230,12 +230,24 @@ add_library(transformer_engine SHARED ${transformer_engine_SOURCES})
target_include_directories
(
transformer_engine PUBLIC
target_include_directories
(
transformer_engine PUBLIC
"
${
CMAKE_CURRENT_SOURCE_DIR
}
/include"
)
"
${
CMAKE_CURRENT_SOURCE_DIR
}
/include"
)
#
CUTLASS
kernels require SM90a
and cause hang in debug build
#
Grouped GEMM
kernels require SM90a
set_property
(
set_property
(
SOURCE gemm/cutlass_grouped_gemm.cu
SOURCE gemm/cutlass_grouped_gemm.cu
APPEND
APPEND
PROPERTY
PROPERTY
COMPILE_OPTIONS
"--generate-code=arch=compute_90a,code=sm_90a;-g0"
)
COMPILE_OPTIONS
"--generate-code=arch=compute_90a,code=sm_90a"
)
# CUTLASS kernels could cause hang in debug build
set
(
CUTLASS_KERNEL_SOURCES
gemm/cutlass_grouped_gemm.cu
hadamard_transform/group_hadamard_transform_cast_fusion.cu
hadamard_transform/group_row_cast_col_hadamard_transform_cast_fusion.cu
hadamard_transform/hadamard_transform_cast_fusion.cu
)
set_property
(
SOURCE
${
CUTLASS_KERNEL_SOURCES
}
APPEND
PROPERTY
COMPILE_OPTIONS
"-g0;-dopt=on"
)
# Configure dependencies
# Configure dependencies
target_link_libraries
(
transformer_engine PUBLIC
target_link_libraries
(
transformer_engine PUBLIC
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment