Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
change
sglang
Commits
b49d6d0f
"...text-generation-inference.git" did not exist on "895c5f15628df870f7a2ced7151dedb84231a996"
Unverified
Commit
b49d6d0f
authored
Jan 31, 2025
by
Yineng Zhang
Committed by
GitHub
Jan 31, 2025
Browse files
support 12.5 CUDA runtime (#3231)
parent
c02e3139
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
10 additions
and
2 deletions
+10
-2
.github/workflows/release-docker.yml
.github/workflows/release-docker.yml
+4
-2
docker/Dockerfile
docker/Dockerfile
+6
-0
No files found.
.github/workflows/release-docker.yml
View file @
b49d6d0f
...
@@ -14,7 +14,7 @@ jobs:
...
@@ -14,7 +14,7 @@ jobs:
environment
:
'
prod'
environment
:
'
prod'
strategy
:
strategy
:
matrix
:
matrix
:
cuda_version
:
[
'
11.8.0'
,
'
12.1.1'
,
'
12.4.1'
]
cuda_version
:
[
'
11.8.0'
,
'
12.1.1'
,
'
12.4.1'
,
'
12.5.1'
]
build_type
:
[
'
all'
,
'
srt'
]
build_type
:
[
'
all'
,
'
srt'
]
steps
:
steps
:
-
name
:
Delete huge unnecessary tools folder
-
name
:
Delete huge unnecessary tools folder
...
@@ -39,6 +39,8 @@ jobs:
...
@@ -39,6 +39,8 @@ jobs:
cuda_tag="cu121"
cuda_tag="cu121"
elif [ "${{ matrix.cuda_version }}" = "12.4.1" ]; then
elif [ "${{ matrix.cuda_version }}" = "12.4.1" ]; then
cuda_tag="cu124"
cuda_tag="cu124"
elif [ "${{ matrix.cuda_version }}" = "12.5.1" ]; then
cuda_tag="cu125"
else
else
echo "Unsupported CUDA version"
echo "Unsupported CUDA version"
exit 1
exit 1
...
@@ -58,7 +60,7 @@ jobs:
...
@@ -58,7 +60,7 @@ jobs:
docker build . -f docker/Dockerfile --build-arg CUDA_VERSION=${{ matrix.cuda_version }} --build-arg BUILD_TYPE=${{ matrix.build_type }} -t lmsysorg/sglang:${tag}${tag_suffix} --no-cache
docker build . -f docker/Dockerfile --build-arg CUDA_VERSION=${{ matrix.cuda_version }} --build-arg BUILD_TYPE=${{ matrix.build_type }} -t lmsysorg/sglang:${tag}${tag_suffix} --no-cache
docker push lmsysorg/sglang:${tag}${tag_suffix}
docker push lmsysorg/sglang:${tag}${tag_suffix}
if [ "${{ matrix.cuda_version }}" = "12.
4
.1" ]; then
if [ "${{ matrix.cuda_version }}" = "12.
5
.1" ]; then
docker tag lmsysorg/sglang:${tag}${tag_suffix} lmsysorg/sglang:latest${tag_suffix}
docker tag lmsysorg/sglang:${tag}${tag_suffix} lmsysorg/sglang:latest${tag_suffix}
docker push lmsysorg/sglang:latest${tag_suffix}
docker push lmsysorg/sglang:latest${tag_suffix}
fi
fi
docker/Dockerfile
View file @
b49d6d0f
...
@@ -30,6 +30,8 @@ RUN python3 -m pip install --upgrade pip setuptools wheel html5lib six \
...
@@ -30,6 +30,8 @@ RUN python3 -m pip install --upgrade pip setuptools wheel html5lib six \
python3
-m
pip
install
torch
--index-url
https://download.pytorch.org/whl/cu121
;
\
python3
-m
pip
install
torch
--index-url
https://download.pytorch.org/whl/cu121
;
\
elif
[
"
$CUDA_VERSION
"
=
"12.4.1"
]
;
then
\
elif
[
"
$CUDA_VERSION
"
=
"12.4.1"
]
;
then
\
python3
-m
pip
install
torch
--index-url
https://download.pytorch.org/whl/cu124
;
\
python3
-m
pip
install
torch
--index-url
https://download.pytorch.org/whl/cu124
;
\
elif
[
"
$CUDA_VERSION
"
=
"12.5.1"
]
;
then
\
python3
-m
pip
install
torch
--index-url
https://download.pytorch.org/whl/cu124
;
\
elif
[
"
$CUDA_VERSION
"
=
"11.8.0"
]
;
then
\
elif
[
"
$CUDA_VERSION
"
=
"11.8.0"
]
;
then
\
python3
-m
pip
install
torch
--index-url
https://download.pytorch.org/whl/cu118
;
\
python3
-m
pip
install
torch
--index-url
https://download.pytorch.org/whl/cu118
;
\
python3
-m
pip
install
sgl-kernel
-i
https://docs.sglang.ai/whl/cu118
;
\
python3
-m
pip
install
sgl-kernel
-i
https://docs.sglang.ai/whl/cu118
;
\
...
@@ -42,6 +44,8 @@ RUN python3 -m pip install --upgrade pip setuptools wheel html5lib six \
...
@@ -42,6 +44,8 @@ RUN python3 -m pip install --upgrade pip setuptools wheel html5lib six \
python3
-m
pip
--no-cache-dir
install
-e
"python[srt]"
--find-links
https://flashinfer.ai/whl/cu121/torch2.4/flashinfer/
;
\
python3
-m
pip
--no-cache-dir
install
-e
"python[srt]"
--find-links
https://flashinfer.ai/whl/cu121/torch2.4/flashinfer/
;
\
elif
[
"
$CUDA_VERSION
"
=
"12.4.1"
]
;
then
\
elif
[
"
$CUDA_VERSION
"
=
"12.4.1"
]
;
then
\
python3
-m
pip
--no-cache-dir
install
-e
"python[srt]"
--find-links
https://flashinfer.ai/whl/cu124/torch2.4/flashinfer/
;
\
python3
-m
pip
--no-cache-dir
install
-e
"python[srt]"
--find-links
https://flashinfer.ai/whl/cu124/torch2.4/flashinfer/
;
\
elif
[
"
$CUDA_VERSION
"
=
"12.5.1"
]
;
then
\
python3
-m
pip
--no-cache-dir
install
-e
"python[srt]"
--find-links
https://flashinfer.ai/whl/cu124/torch2.4/flashinfer/
;
\
elif
[
"
$CUDA_VERSION
"
=
"11.8.0"
]
;
then
\
elif
[
"
$CUDA_VERSION
"
=
"11.8.0"
]
;
then
\
python3
-m
pip
--no-cache-dir
install
-e
"python[srt]"
--find-links
https://flashinfer.ai/whl/cu118/torch2.4/flashinfer/
;
\
python3
-m
pip
--no-cache-dir
install
-e
"python[srt]"
--find-links
https://flashinfer.ai/whl/cu118/torch2.4/flashinfer/
;
\
python3
-m
pip
install
sgl-kernel
-i
https://docs.sglang.ai/whl/cu118
;
\
python3
-m
pip
install
sgl-kernel
-i
https://docs.sglang.ai/whl/cu118
;
\
...
@@ -53,6 +57,8 @@ RUN python3 -m pip install --upgrade pip setuptools wheel html5lib six \
...
@@ -53,6 +57,8 @@ RUN python3 -m pip install --upgrade pip setuptools wheel html5lib six \
python3
-m
pip
--no-cache-dir
install
-e
"python[all]"
--find-links
https://flashinfer.ai/whl/cu121/torch2.4/flashinfer/
;
\
python3
-m
pip
--no-cache-dir
install
-e
"python[all]"
--find-links
https://flashinfer.ai/whl/cu121/torch2.4/flashinfer/
;
\
elif
[
"
$CUDA_VERSION
"
=
"12.4.1"
]
;
then
\
elif
[
"
$CUDA_VERSION
"
=
"12.4.1"
]
;
then
\
python3
-m
pip
--no-cache-dir
install
-e
"python[all]"
--find-links
https://flashinfer.ai/whl/cu124/torch2.4/flashinfer/
;
\
python3
-m
pip
--no-cache-dir
install
-e
"python[all]"
--find-links
https://flashinfer.ai/whl/cu124/torch2.4/flashinfer/
;
\
elif
[
"
$CUDA_VERSION
"
=
"12.5.1"
]
;
then
\
python3
-m
pip
--no-cache-dir
install
-e
"python[all]"
--find-links
https://flashinfer.ai/whl/cu124/torch2.4/flashinfer/
;
\
elif
[
"
$CUDA_VERSION
"
=
"11.8.0"
]
;
then
\
elif
[
"
$CUDA_VERSION
"
=
"11.8.0"
]
;
then
\
python3
-m
pip
--no-cache-dir
install
-e
"python[all]"
--find-links
https://flashinfer.ai/whl/cu118/torch2.4/flashinfer/
;
\
python3
-m
pip
--no-cache-dir
install
-e
"python[all]"
--find-links
https://flashinfer.ai/whl/cu118/torch2.4/flashinfer/
;
\
python3
-m
pip
install
sgl-kernel
-i
https://docs.sglang.ai/whl/cu118
;
\
python3
-m
pip
install
sgl-kernel
-i
https://docs.sglang.ai/whl/cu118
;
\
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment