Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
8e836d98
Unverified
Commit
8e836d98
authored
Oct 15, 2024
by
Michael Goin
Committed by
GitHub
Oct 14, 2024
Browse files
[Doc] Fix code formatting in spec_decode.rst (#9348)
parent
44eaa5a5
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
3 additions
and
3 deletions
+3
-3
docs/source/models/spec_decode.rst
docs/source/models/spec_decode.rst
+3
-3
No files found.
docs/source/models/spec_decode.rst
View file @
8e836d98
...
...
@@ -44,10 +44,10 @@ To perform the same with an online mode launch the server:
..
code
-
block
::
bash
python
-
m
vllm
.
entrypoints
.
openai
.
api_server
--
host
0.0.0.0
--
port
8000
--
model
facebook
/
opt
-
6.7
b
\
--
seed
42
-
tp
1
--
speculative_model
facebook
/
opt
-
125
m
--
use
-
v2
-
block
-
manager
\
--
num_speculative_tokens
5
--
gpu_memory_utilization
0.8
--
seed
42
-
tp
1
--
speculative_model
facebook
/
opt
-
125
m
--
use
-
v2
-
block
-
manager
\
--
num_speculative_tokens
5
--
gpu_memory_utilization
0.8
Then
use
a
client
:
Then
use
a
client
:
..
code
-
block
::
python
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment