Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
e123aafd
Unverified
Commit
e123aafd
authored
Mar 04, 2025
by
Michael Goin
Committed by
GitHub
Mar 05, 2025
Browse files
Disable GPTQ AllSpark kernels for CUDA Compiler < 12.0 (#14157)
Signed-off-by:
mgoin
<
mgoin64@gmail.com
>
parent
5b143d33
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
2 deletions
+2
-2
CMakeLists.txt
CMakeLists.txt
+2
-2
No files found.
CMakeLists.txt
View file @
e123aafd
...
...
@@ -319,7 +319,7 @@ if(VLLM_GPU_LANG STREQUAL "CUDA")
# Only build AllSpark kernels if we are building for at least some compatible archs.
cuda_archs_loose_intersection
(
ALLSPARK_ARCHS
"8.0;8.6;8.7;8.9"
"
${
CUDA_ARCHS
}
"
)
if
(
ALLSPARK_ARCHS
)
if
(
${
CMAKE_CUDA_COMPILER_VERSION
}
VERSION_GREATER 12.0 AND
ALLSPARK_ARCHS
)
set
(
ALLSPARK_SRCS
"csrc/quantization/gptq_allspark/allspark_repack.cu"
"csrc/quantization/gptq_allspark/allspark_qgemm_w8a16.cu"
)
...
...
@@ -330,7 +330,7 @@ if(VLLM_GPU_LANG STREQUAL "CUDA")
message
(
STATUS
"Building AllSpark kernels for archs:
${
ALLSPARK_ARCHS
}
"
)
else
()
message
(
STATUS
"Not building AllSpark kernels as no compatible archs found"
" in CUDA target architectures"
)
" in CUDA target architectures
, or CUDA not >= 12.0
"
)
endif
()
# The cutlass_scaled_mm kernels for Hopper (c3x, i.e. CUTLASS 3.x) require
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment