Unverified Commit ca179d0f authored by Tmn07's avatar Tmn07 Committed by GitHub
Browse files

[Bugfix] Fix activation quantization for compressed-tensors W4A16 (#31572)


Signed-off-by: default avatarTmn07 <tmn0796@gmail.com>
parent 013b5408
......@@ -114,7 +114,7 @@ class CompressedTensorsWNA16(CompressedTensorsScheme):
logger.info("Using %s for CompressedTensorsWNA16", kernel_type.__name__)
self._kernel_backends_being_used.add(kernel_type.__name__)
if isinstance(kernel_type, MarlinLinearKernel):
if kernel_type is MarlinLinearKernel:
input_dtype = get_marlin_input_dtype(self.layer_name)
if input_dtype is not None:
mp_linear_kernel_config.act_type = input_dtype
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment