Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
d72b0be3
Unverified
Commit
d72b0be3
authored
Feb 25, 2026
by
Chendi.Xue
Committed by
GitHub
Feb 25, 2026
Browse files
[XPU]Fix for Qwen-OMNI crash (#35249)
Signed-off-by:
Chendi Xue
<
chendi.xue@intel.com
>
parent
42489e43
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
1 deletion
+2
-1
vllm/_xpu_ops.py
vllm/_xpu_ops.py
+2
-1
No files found.
vllm/_xpu_ops.py
View file @
d72b0be3
...
@@ -105,9 +105,10 @@ class xpu_ops:
...
@@ -105,9 +105,10 @@ class xpu_ops:
assert
len
(
window_size
)
==
2
assert
len
(
window_size
)
==
2
real_window_size
=
(
window_size
[
0
],
window_size
[
1
])
# noqa: F841
real_window_size
=
(
window_size
[
0
],
window_size
[
1
])
# noqa: F841
# In encode attention, v maybe not contiguous and current
# In encode attention,
k and
v maybe not contiguous and current
# kernel can't handle it
# kernel can't handle it
if
block_table
is
None
:
if
block_table
is
None
:
k
=
k
.
contiguous
()
v
=
v
.
contiguous
()
v
=
v
.
contiguous
()
return
flash_attn_varlen_func
(
return
flash_attn_varlen_func
(
out
=
out
,
out
=
out
,
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment