Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
e08a3a3f
Unverified
Commit
e08a3a3f
authored
Sep 20, 2025
by
Michael Goin
Committed by
GitHub
Sep 20, 2025
Browse files
[CI Failure] Disable FlashInfer RoPE to unblock CI (#25299)
Signed-off-by:
mgoin
<
mgoin64@gmail.com
>
parent
3d9a1d2d
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
7 additions
and
7 deletions
+7
-7
vllm/model_executor/layers/rotary_embedding/base.py
vllm/model_executor/layers/rotary_embedding/base.py
+7
-7
No files found.
vllm/model_executor/layers/rotary_embedding/base.py
View file @
e08a3a3f
...
...
@@ -6,8 +6,6 @@ from typing import Optional
import
torch
from
vllm.model_executor.custom_op
import
CustomOp
from
vllm.platforms
import
current_platform
from
vllm.utils.flashinfer
import
has_flashinfer
from
.common
import
apply_rotary_emb_torch
...
...
@@ -32,13 +30,15 @@ class RotaryEmbedding(CustomOp):
self
.
base
=
base
self
.
is_neox_style
=
is_neox_style
self
.
dtype
=
dtype
# TODO(mgoin): disabled for now due to failures
# Flashinfer only supports head_size=64, 128, 256, 512.
# https://github.com/flashinfer-ai/flashinfer/blob/ebfd655efe830048dba5d582aaa61d61d1cf9a87/include/flashinfer/utils.cuh#L174-L202
self
.
use_flashinfer
=
(
self
.
enabled
()
and
dtype
in
(
torch
.
float16
,
torch
.
bfloat16
)
and
current_platform
.
is_cuda
()
and
has_flashinfer
()
and
self
.
head_size
in
[
64
,
128
,
256
,
512
])
# self.use_flashinfer = (self.enabled()
# and dtype in (torch.float16, torch.bfloat16)
# and current_platform.is_cuda()
# and has_flashinfer()
# and self.head_size in [64, 128, 256, 512])
self
.
use_flashinfer
=
False
cache
=
self
.
_compute_cos_sin_cache
()
if
not
self
.
use_flashinfer
:
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment