Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
345be0e2
Unverified
Commit
345be0e2
authored
Aug 28, 2024
by
Philipp Schmid
Committed by
GitHub
Aug 27, 2024
Browse files
[benchmark] Update TGI version (#7917)
parent
fc911880
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
1 deletion
+1
-1
benchmarks/launch_tgi_server.sh
benchmarks/launch_tgi_server.sh
+1
-1
No files found.
benchmarks/launch_tgi_server.sh
View file @
345be0e2
...
@@ -6,7 +6,7 @@ TOKENS=$2
...
@@ -6,7 +6,7 @@ TOKENS=$2
docker run
-e
HF_TOKEN
=
$HF_TOKEN
--gpus
all
--shm-size
1g
-p
$PORT
:80
\
docker run
-e
HF_TOKEN
=
$HF_TOKEN
--gpus
all
--shm-size
1g
-p
$PORT
:80
\
-v
$PWD
/data:/data
\
-v
$PWD
/data:/data
\
ghcr.io/huggingface/text-generation-inference:
1.4
.0
\
ghcr.io/huggingface/text-generation-inference:
2.2
.0
\
--model-id
$MODEL
\
--model-id
$MODEL
\
--sharded
false
\
--sharded
false
\
--max-input-length
1024
\
--max-input-length
1024
\
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment