Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
45a1a69b
"vscode:/vscode.git/clone" did not exist on "564985729abc267af281de11f737cfb29b5c0abb"
Unverified
Commit
45a1a69b
authored
May 30, 2024
by
Simon Mo
Committed by
GitHub
May 30, 2024
Browse files
[Build] Disable sm_90a in cu11 (#5141)
parent
87a658c8
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
8 additions
and
6 deletions
+8
-6
CMakeLists.txt
CMakeLists.txt
+8
-6
No files found.
CMakeLists.txt
View file @
45a1a69b
...
@@ -200,11 +200,13 @@ if(VLLM_GPU_LANG STREQUAL "CUDA")
...
@@ -200,11 +200,13 @@ if(VLLM_GPU_LANG STREQUAL "CUDA")
# The CUTLASS kernels for Hopper require sm90a to be enabled.
# The CUTLASS kernels for Hopper require sm90a to be enabled.
# This is done via the below gencode option, BUT that creates kernels for both sm90 and sm90a.
# This is done via the below gencode option, BUT that creates kernels for both sm90 and sm90a.
# That adds an extra 17MB to compiled binary, so instead we selectively enable it.
# That adds an extra 17MB to compiled binary, so instead we selectively enable it.
if
(
${
CMAKE_CUDA_COMPILER_VERSION
}
VERSION_GREATER 11
)
set_source_files_properties
(
set_source_files_properties
(
"csrc/quantization/cutlass_w8a8/scaled_mm_dq_c3x.cu"
"csrc/quantization/cutlass_w8a8/scaled_mm_dq_c3x.cu"
PROPERTIES
PROPERTIES
COMPILE_FLAGS
COMPILE_FLAGS
"-gencode arch=compute_90a,code=sm_90a"
)
"-gencode arch=compute_90a,code=sm_90a"
)
endif
()
endif
()
endif
()
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment