Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
SIYIXNI
vllm
Commits
3f92038b
"sgl-kernel/benchmark/bench_awq_dequant.py" did not exist on "e0917e6bd0fbbbbc8ba3db48ae26f07366ab9a0c"
Unverified
Commit
3f92038b
authored
Jun 18, 2023
by
Woosuk Kwon
Committed by
GitHub
Jun 18, 2023
Browse files
Add comments on swap space (#154)
parent
dcda03b4
Changes
2
Show whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
8 additions
and
2 deletions
+8
-2
benchmarks/benchmark_serving.py
benchmarks/benchmark_serving.py
+2
-1
vllm/core/scheduler.py
vllm/core/scheduler.py
+6
-1
No files found.
benchmarks/benchmark_serving.py
View file @
3f92038b
...
...
@@ -3,7 +3,8 @@
On the server side, run one of the following commands:
(vLLM backend)
python -m vllm.entrypoints.api_server
\
--disable-log-requests --model <your_model>
--model <your_model> --swap-space 16
\
--disable-log-requests
(TGI backend)
./launch_hf_server.sh <your_model>
...
...
vllm/core/scheduler.py
View file @
3f92038b
...
...
@@ -409,7 +409,12 @@ class Scheduler:
seq_group
:
SequenceGroup
,
blocks_to_swap_out
:
Dict
[
int
,
int
],
)
->
None
:
assert
self
.
block_manager
.
can_swap_out
(
seq_group
)
if
not
self
.
block_manager
.
can_swap_out
(
seq_group
):
# FIXME(woosuk): Abort the sequence group instead of aborting the
# entire engine.
raise
RuntimeError
(
"Aborted due to the lack of CPU swap space. Please increase "
"the swap space to avoid this error."
)
mapping
=
self
.
block_manager
.
swap_out
(
seq_group
)
blocks_to_swap_out
.
update
(
mapping
)
for
seq
in
seq_group
.
get_seqs
(
status
=
SequenceStatus
.
RUNNING
):
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment