Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
21d22cbd
Commit
21d22cbd
authored
Nov 20, 2025
by
zhuwenwen
Committed by
jujl1
Nov 21, 2025
Browse files
根据不同场景,更新默认调度和分离调度的选择
parent
155fe0d2
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
10 additions
and
4 deletions
+10
-4
vllm/model_executor/layers/rotary_embedding.py
vllm/model_executor/layers/rotary_embedding.py
+3
-2
vllm/v1/core/sched/scheduler.py
vllm/v1/core/sched/scheduler.py
+7
-2
No files found.
vllm/model_executor/layers/rotary_embedding.py
View file @
21d22cbd
...
...
@@ -937,8 +937,9 @@ class DeepseekScalingRotaryEmbedding(RotaryEmbedding):
if
self
.
cos_sin_cache
.
device
!=
positions
.
device
:
self
.
cos_sin_cache
:
torch
.
Tensor
=
self
.
cos_sin_cache
.
to
(
positions
.
device
)
cos_sin
=
self
.
cos_sin_cache
[
torch
.
add
(
positions
,
offsets
)
if
offsets
is
not
None
else
positions
]
if
not
envs
.
VLLM_USE_LIGHTOP
:
cos_sin
=
self
.
cos_sin_cache
[
torch
.
add
(
positions
,
offsets
)
if
offsets
is
not
None
else
positions
]
if
query
.
device
.
type
==
'cuda'
and
not
self
.
is_neox_style
\
and
not
self
.
reference
:
assert
len
(
query
.
shape
)
==
3
...
...
vllm/v1/core/sched/scheduler.py
View file @
21d22cbd
...
...
@@ -1028,10 +1028,15 @@ class Scheduler(SchedulerInterface):
return
scheduler_output
def
schedule
(
self
)
->
SchedulerOutput
:
if
envs
.
VLLM_USE_PD_SPLIT
or
(
self
.
full_cuda_graph
and
self
.
use_mla
and
self
.
num_spec_tokens
>
0
)
:
if
envs
.
VLLM_USE_PD_SPLIT
:
return
self
.
schedule_split_pd
()
else
:
return
self
.
schedule_default
()
if
self
.
connector
is
not
None
:
return
self
.
schedule_default
()
if
self
.
full_cuda_graph
and
self
.
use_mla
and
self
.
num_spec_tokens
>
0
:
return
self
.
schedule_split_pd
()
else
:
return
self
.
schedule_default
()
def
_update_after_schedule
(
self
,
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment