Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
628ec6c1
Unverified
Commit
628ec6c1
authored
Dec 29, 2024
by
Liangfu Chen
Committed by
GitHub
Dec 30, 2024
Browse files
[Docker] bump up neuron sdk v2.21 (#11593)
Signed-off-by:
Liangfu Chen
<
liangfc@amazon.com
>
parent
3682e33f
Changes
4
Show whitespace changes
Inline
Side-by-side
Showing
4 changed files
with
6 additions
and
8 deletions
+6
-8
Dockerfile.neuron
Dockerfile.neuron
+3
-3
requirements-neuron.txt
requirements-neuron.txt
+2
-2
vllm/_custom_ops.py
vllm/_custom_ops.py
+1
-2
vllm/triton_utils/importing.py
vllm/triton_utils/importing.py
+0
-1
No files found.
Dockerfile.neuron
View file @
628ec6c1
# default base image
# https://gallery.ecr.aws/neuron/pytorch-inference-neuronx
ARG BASE_IMAGE="public.ecr.aws/neuron/pytorch-inference-neuronx:2.
1.2
-neuronx-py310-sdk2.2
0.2
-ubuntu2
0
.04"
ARG BASE_IMAGE="public.ecr.aws/neuron/pytorch-inference-neuronx:2.
5.1
-neuronx-py310-sdk2.2
1.0
-ubuntu2
2
.04"
FROM $BASE_IMAGE
...
...
@@ -22,9 +22,9 @@ WORKDIR ${APP_MOUNT}/vllm
RUN python3 -m pip install --upgrade pip
RUN python3 -m pip install --no-cache-dir fastapi ninja tokenizers pandas
RUN python3 -m pip install sentencepiece transformers==4.
36
.2 -U
RUN python3 -m pip install sentencepiece transformers==4.
45
.2 -U
RUN python3 -m pip install transformers-neuronx --extra-index-url=https://pip.repos.neuron.amazonaws.com -U
RUN python3 -m pip install
--pre
neuronx-cc==2.15.
*
--extra-index-url=https://pip.repos.neuron.amazonaws.com -U
RUN python3 -m pip install neuronx-cc==2.1
6.34
5.
0
--extra-index-url=https://pip.repos.neuron.amazonaws.com -U
COPY . .
ARG GIT_REPO_CHECK=0
...
...
requirements-neuron.txt
View file @
628ec6c1
...
...
@@ -2,6 +2,6 @@
-r requirements-common.txt
# Dependencies for Neuron devices
transformers-neuronx >= 0.1
2
.0
torch-neuronx >= 2.
1.2
transformers-neuronx >= 0.1
3
.0
torch-neuronx >= 2.
5.0
neuronx-cc
vllm/_custom_ops.py
View file @
628ec6c1
...
...
@@ -23,8 +23,7 @@ with contextlib.suppress(ImportError):
import
vllm._moe_C
# noqa: F401
supports_moe_ops
=
True
# neuron has torch version that doesn't even have impl_abstract
if
TYPE_CHECKING
or
current_platform
.
is_neuron
():
if
TYPE_CHECKING
:
def
register_fake
(
fn
):
return
lambda
name
:
fn
...
...
vllm/triton_utils/importing.py
View file @
628ec6c1
...
...
@@ -8,7 +8,6 @@ logger = init_logger(__name__)
HAS_TRITON
=
(
find_spec
(
"triton"
)
is
not
None
and
not
current_platform
.
is_xpu
()
# Not compatible
and
not
current_platform
.
is_neuron
()
# neuron has too old torch
)
if
not
HAS_TRITON
:
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment