Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
f9c069c8
Unverified
Commit
f9c069c8
authored
May 14, 2025
by
bnellnm
Committed by
GitHub
May 14, 2025
Browse files
Modularize fused experts and integrate PPLX kernels (#15956)
parent
418d2f8b
Changes
42
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
4 additions
and
2 deletions
+4
-2
vllm/worker/worker.py
vllm/worker/worker.py
+2
-1
vllm/worker/xpu_worker.py
vllm/worker/xpu_worker.py
+2
-1
No files found.
vllm/worker/worker.py
View file @
f9c069c8
...
...
@@ -530,7 +530,8 @@ def init_worker_distributed_environment(
init_distributed_environment
(
parallel_config
.
world_size
,
rank
,
distributed_init_method
,
local_rank
)
ensure_model_parallel_initialized
(
parallel_config
.
tensor_parallel_size
,
parallel_config
.
pipeline_parallel_size
)
parallel_config
.
pipeline_parallel_size
,
parallel_config
.
enable_expert_parallel
)
ensure_kv_transfer_initialized
(
vllm_config
)
...
...
vllm/worker/xpu_worker.py
View file @
f9c069c8
...
...
@@ -176,7 +176,8 @@ class XPUWorker(LoRANotSupportedWorkerBase, Worker):
ensure_model_parallel_initialized
(
parallel_config
.
tensor_parallel_size
,
parallel_config
.
pipeline_parallel_size
)
parallel_config
.
pipeline_parallel_size
,
parallel_config
.
enable_expert_parallel
)
# global all_reduce needed for overall oneccl warm up
torch
.
distributed
.
all_reduce
(
torch
.
zeros
(
1
).
xpu
())
...
...
Prev
1
2
3
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment