Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
6e97eccf
Unverified
Commit
6e97eccf
authored
Nov 05, 2025
by
Frost Mitchell
Committed by
GitHub
Nov 05, 2025
Browse files
[XPU] Enable custom routing functions in IPEX for Llama4 (#28004)
Signed-off-by:
frost-intel
<
frost.mitchell@intel.com
>
parent
6ab18381
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
1 deletion
+1
-1
vllm/model_executor/layers/fused_moe/layer.py
vllm/model_executor/layers/fused_moe/layer.py
+1
-1
No files found.
vllm/model_executor/layers/fused_moe/layer.py
View file @
6e97eccf
...
...
@@ -915,7 +915,6 @@ class UnquantizedFusedMoEMethod(FusedMoEMethodBase, CustomOp):
or
logical_replica_count
is
not
None
):
raise
NotImplementedError
(
"Expert load balancing is not supported for XPU."
)
assert
custom_routing_function
is
None
return
layer
.
ipex_fusion
(
x
,
use_grouped_topk
,
...
...
@@ -924,6 +923,7 @@ class UnquantizedFusedMoEMethod(FusedMoEMethodBase, CustomOp):
renormalize
,
topk_group
,
num_expert_group
,
custom_routing_function
=
custom_routing_function
,
)
def
forward_tpu
(
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment