Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
b639327a
Unverified
Commit
b639327a
authored
Jul 12, 2025
by
Michael Goin
Committed by
GitHub
Jul 11, 2025
Browse files
Revert "Use NVCC --compress-mode to reduce binary size by 30% #20694" (#20853)
Signed-off-by:
mgoin
<
mgoin64@gmail.com
>
parent
4afe687a
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
0 additions
and
10 deletions
+0
-10
CMakeLists.txt
CMakeLists.txt
+0
-10
No files found.
CMakeLists.txt
View file @
b639327a
...
@@ -171,16 +171,6 @@ if(NVCC_THREADS AND VLLM_GPU_LANG STREQUAL "CUDA")
...
@@ -171,16 +171,6 @@ if(NVCC_THREADS AND VLLM_GPU_LANG STREQUAL "CUDA")
list
(
APPEND VLLM_GPU_FLAGS
"--threads=
${
NVCC_THREADS
}
"
)
list
(
APPEND VLLM_GPU_FLAGS
"--threads=
${
NVCC_THREADS
}
"
)
endif
()
endif
()
#
# Set nvcc fatbin compression.
#
if
(
VLLM_GPU_LANG STREQUAL
"CUDA"
)
if
(
${
CMAKE_CUDA_COMPILER_VERSION
}
VERSION_GREATER_EQUAL 12.8
)
list
(
APPEND VLLM_GPU_FLAGS
"-Xfatbin"
"-compress-all"
"-compress-mode=size"
)
endif
()
endif
()
#
#
# Use FetchContent for C++ dependencies that are compiled as part of vLLM's build process.
# Use FetchContent for C++ dependencies that are compiled as part of vLLM's build process.
# setup.py will override FETCHCONTENT_BASE_DIR to play nicely with sccache.
# setup.py will override FETCHCONTENT_BASE_DIR to play nicely with sccache.
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment