Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
15859f23
Unverified
Commit
15859f23
authored
Dec 15, 2024
by
Jee Jee Li
Committed by
GitHub
Dec 15, 2024
Browse files
[[Misc]Upgrade bitsandbytes to the latest version 0.45.0 (#11201)
parent
88693683
Changes
5
Show whitespace changes
Inline
Side-by-side
Showing
5 changed files
with
8 additions
and
8 deletions
+8
-8
Dockerfile
Dockerfile
+1
-1
docs/source/quantization/bnb.rst
docs/source/quantization/bnb.rst
+1
-1
requirements-test.in
requirements-test.in
+1
-1
requirements-test.txt
requirements-test.txt
+1
-1
vllm/model_executor/layers/quantization/bitsandbytes.py
vllm/model_executor/layers/quantization/bitsandbytes.py
+4
-4
No files found.
Dockerfile
View file @
15859f23
...
@@ -218,7 +218,7 @@ FROM vllm-base AS vllm-openai
...
@@ -218,7 +218,7 @@ FROM vllm-base AS vllm-openai
# install additional dependencies for openai api server
# install additional dependencies for openai api server
RUN
--mount
=
type
=
cache,target
=
/root/.cache/pip
\
RUN
--mount
=
type
=
cache,target
=
/root/.cache/pip
\
pip
install
accelerate hf_transfer
'modelscope!=1.15.0'
'bitsandbytes>=0.4
4
.0'
timm
==
0.9.10
pip
install
accelerate hf_transfer
'modelscope!=1.15.0'
'bitsandbytes>=0.4
5
.0'
timm
==
0.9.10
ENV
VLLM_USAGE_SOURCE production-docker-image
ENV
VLLM_USAGE_SOURCE production-docker-image
...
...
docs/source/quantization/bnb.rst
View file @
15859f23
...
@@ -11,7 +11,7 @@ Below are the steps to utilize BitsAndBytes with vLLM.
...
@@ -11,7 +11,7 @@ Below are the steps to utilize BitsAndBytes with vLLM.
.. code-block:: console
.. code-block:: console
$ pip install bitsandbytes>=0.4
4
.0
$ pip install bitsandbytes>=0.4
5
.0
vLLM reads the model's config file and supports both in-flight quantization and pre-quantized checkpoint.
vLLM reads the model's config file and supports both in-flight quantization and pre-quantized checkpoint.
...
...
requirements-test.in
View file @
15859f23
...
@@ -25,7 +25,7 @@ datamodel_code_generator # required for minicpm3 test
...
@@ -25,7 +25,7 @@ datamodel_code_generator # required for minicpm3 test
lm-eval[api]==0.4.4 # required for model evaluation test
lm-eval[api]==0.4.4 # required for model evaluation test
# quantization
# quantization
bitsandbytes>=0.4
4
.0
bitsandbytes>=0.4
5
.0
buildkite-test-collector==0.1.9
buildkite-test-collector==0.1.9
numpy < 2.0.0
numpy < 2.0.0
requirements-test.txt
View file @
15859f23
...
@@ -37,7 +37,7 @@ audioread==3.0.1
...
@@ -37,7 +37,7 @@ audioread==3.0.1
# via librosa
# via librosa
awscli==1.35.23
awscli==1.35.23
# via -r requirements-test.in
# via -r requirements-test.in
bitsandbytes
=
=0.4
4.1
bitsandbytes
>
=0.4
5.0
# via -r requirements-test.in
# via -r requirements-test.in
black==24.10.0
black==24.10.0
# via datamodel-code-generator
# via datamodel-code-generator
...
...
vllm/model_executor/layers/quantization/bitsandbytes.py
View file @
15859f23
...
@@ -145,12 +145,12 @@ class BitsAndBytesLinearMethod(LinearMethodBase):
...
@@ -145,12 +145,12 @@ class BitsAndBytesLinearMethod(LinearMethodBase):
def
__init__
(
self
,
quant_config
:
BitsAndBytesConfig
):
def
__init__
(
self
,
quant_config
:
BitsAndBytesConfig
):
try
:
try
:
import
bitsandbytes
import
bitsandbytes
if
bitsandbytes
.
__version__
<
"0.4
4
.0"
:
if
bitsandbytes
.
__version__
<
"0.4
5
.0"
:
raise
ImportError
(
"bitsandbytes version is wrong. Please "
raise
ImportError
(
"bitsandbytes version is wrong. Please "
"install bitsandbytes>=0.4
4
.0."
)
"install bitsandbytes>=0.4
5
.0."
)
except
ImportError
as
err
:
except
ImportError
as
err
:
raise
ImportError
(
"Please install bitsandbytes>=0.4
4
.0 via "
raise
ImportError
(
"Please install bitsandbytes>=0.4
5
.0 via "
"`pip install bitsandbytes>=0.4
4
.0` to use "
"`pip install bitsandbytes>=0.4
5
.0` to use "
"bitsandbytes quantizer."
)
from
err
"bitsandbytes quantizer."
)
from
err
self
.
quant_config
=
quant_config
self
.
quant_config
=
quant_config
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment