Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
4599e05f
Commit
4599e05f
authored
Feb 04, 2026
by
zhuwenwen
Browse files
skip AiterInt8ScaledMMLinearKernel
parent
f509adcb
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
4 additions
and
4 deletions
+4
-4
vllm/model_executor/layers/quantization/kernels/scaled_mm/__init__.py
...xecutor/layers/quantization/kernels/scaled_mm/__init__.py
+4
-4
No files found.
vllm/model_executor/layers/quantization/kernels/scaled_mm/__init__.py
View file @
4599e05f
...
...
@@ -7,9 +7,9 @@ from typing import TypeVar
import
torch
from
vllm.logger
import
init_logger
from
vllm.model_executor.layers.quantization.kernels.scaled_mm.aiter
import
(
AiterInt8ScaledMMLinearKernel
,
)
#
from vllm.model_executor.layers.quantization.kernels.scaled_mm.aiter import (
#
AiterInt8ScaledMMLinearKernel,
#
)
from
vllm.model_executor.layers.quantization.kernels.scaled_mm.cpu
import
(
CPUInt8ScaledMMLinearKernel
,
)
...
...
@@ -51,7 +51,7 @@ _POSSIBLE_INT8_KERNELS: dict[PlatformEnum, list[type[Int8ScaledMMLinearKernel]]]
CutlassInt8ScaledMMLinearKernel
,
TritonInt8ScaledMMLinearKernel
,
],
PlatformEnum
.
ROCM
:
[
AiterInt8ScaledMMLinearKernel
,
TritonInt8ScaledMMLinearKernel
],
PlatformEnum
.
ROCM
:
[
TritonInt8ScaledMMLinearKernel
],
# AiterInt8ScaledMMLinearKernel
}
# in priority/performance order (when available)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment