Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
59abbd84
Unverified
Commit
59abbd84
authored
Jun 09, 2025
by
Conroy Cheers
Committed by
GitHub
Jun 09, 2025
Browse files
[Fix] Allow kernel compilation for CUDA capability 8.7 (#19328)
Signed-off-by:
Conroy Cheers
<
conroy@corncheese.org
>
parent
95a6568b
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
3 additions
and
3 deletions
+3
-3
CMakeLists.txt
CMakeLists.txt
+3
-3
No files found.
CMakeLists.txt
View file @
59abbd84
...
@@ -308,7 +308,7 @@ if(VLLM_GPU_LANG STREQUAL "CUDA")
...
@@ -308,7 +308,7 @@ if(VLLM_GPU_LANG STREQUAL "CUDA")
# Keep building Marlin for 9.0 as there are some group sizes and shapes that
# Keep building Marlin for 9.0 as there are some group sizes and shapes that
# are not supported by Machete yet.
# are not supported by Machete yet.
# 9.0 for latest bf16 atomicAdd PTX
# 9.0 for latest bf16 atomicAdd PTX
cuda_archs_loose_intersection
(
MARLIN_ARCHS
"8.0;9.0+PTX"
"
${
CUDA_ARCHS
}
"
)
cuda_archs_loose_intersection
(
MARLIN_ARCHS
"8.0;
8.7;
9.0+PTX"
"
${
CUDA_ARCHS
}
"
)
if
(
MARLIN_ARCHS
)
if
(
MARLIN_ARCHS
)
#
#
...
@@ -454,7 +454,7 @@ if(VLLM_GPU_LANG STREQUAL "CUDA")
...
@@ -454,7 +454,7 @@ if(VLLM_GPU_LANG STREQUAL "CUDA")
# kernels for the remaining archs that are not already built for 3x.
# kernels for the remaining archs that are not already built for 3x.
# (Build 8.9 for FP8)
# (Build 8.9 for FP8)
cuda_archs_loose_intersection
(
SCALED_MM_2X_ARCHS
cuda_archs_loose_intersection
(
SCALED_MM_2X_ARCHS
"7.5;8.0;8.9+PTX"
"
${
CUDA_ARCHS
}
"
)
"7.5;8.0;8.
7;8.
9+PTX"
"
${
CUDA_ARCHS
}
"
)
# subtract out the archs that are already built for 3x
# subtract out the archs that are already built for 3x
list
(
REMOVE_ITEM SCALED_MM_2X_ARCHS
${
SCALED_MM_3X_ARCHS
}
)
list
(
REMOVE_ITEM SCALED_MM_2X_ARCHS
${
SCALED_MM_3X_ARCHS
}
)
if
(
SCALED_MM_2X_ARCHS
)
if
(
SCALED_MM_2X_ARCHS
)
...
@@ -684,7 +684,7 @@ if(VLLM_GPU_LANG STREQUAL "CUDA")
...
@@ -684,7 +684,7 @@ if(VLLM_GPU_LANG STREQUAL "CUDA")
list
(
APPEND VLLM_MOE_EXT_SRC
"
${
VLLM_MOE_WNA16_SRC
}
"
)
list
(
APPEND VLLM_MOE_EXT_SRC
"
${
VLLM_MOE_WNA16_SRC
}
"
)
# 9.0 for latest bf16 atomicAdd PTX
# 9.0 for latest bf16 atomicAdd PTX
cuda_archs_loose_intersection
(
MARLIN_MOE_ARCHS
"8.0;9.0+PTX"
"
${
CUDA_ARCHS
}
"
)
cuda_archs_loose_intersection
(
MARLIN_MOE_ARCHS
"8.0;
8.7;
9.0+PTX"
"
${
CUDA_ARCHS
}
"
)
if
(
MARLIN_MOE_ARCHS
)
if
(
MARLIN_MOE_ARCHS
)
#
#
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment