Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
33650733
Commit
33650733
authored
Sep 07, 2025
by
zhuwenwen
Browse files
skip silu_and_mul_quant
parent
d2b52805
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
8 additions
and
8 deletions
+8
-8
vllm/compilation/activation_quant_fusion.py
vllm/compilation/activation_quant_fusion.py
+8
-8
No files found.
vllm/compilation/activation_quant_fusion.py
View file @
33650733
...
@@ -26,14 +26,14 @@ FP4_DTYPE = torch.uint8
...
@@ -26,14 +26,14 @@ FP4_DTYPE = torch.uint8
SILU_MUL_OP
=
torch
.
ops
.
_C
.
silu_and_mul
.
default
SILU_MUL_OP
=
torch
.
ops
.
_C
.
silu_and_mul
.
default
FUSED_OPS
:
dict
[
QuantKey
,
OpOverload
]
=
{
#
FUSED_OPS: dict[QuantKey, OpOverload] = {
kFp8StaticTensorSym
:
torch
.
ops
.
_C
.
silu_and_mul_quant
.
default
,
# noqa: E501
#
kFp8StaticTensorSym: torch.ops._C.silu_and_mul_quant.default, # noqa: E501
}
#
}
silu_and_mul_nvfp4_quant_supported
=
(
current_platform
.
is_cuda
()
and
hasattr
(
#
silu_and_mul_nvfp4_quant_supported = (current_platform.is_cuda() and hasattr(
torch
.
ops
.
_C
,
"silu_and_mul_nvfp4_quant"
))
#
torch.ops._C, "silu_and_mul_nvfp4_quant"))
if
silu_and_mul_nvfp4_quant_supported
:
#
if silu_and_mul_nvfp4_quant_supported:
FUSED_OPS
[
#
FUSED_OPS[
kNvfp4Quant
]
=
torch
.
ops
.
_C
.
silu_and_mul_nvfp4_quant
.
default
# noqa: E501
#
kNvfp4Quant] = torch.ops._C.silu_and_mul_nvfp4_quant.default # noqa: E501
class
ActivationQuantPattern
(
ABC
):
class
ActivationQuantPattern
(
ABC
):
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment