Unverified Commit e1464c3a authored by Isotr0py's avatar Isotr0py Committed by GitHub
Browse files

[Quantization] Enable compressed-tensors AWQ for Turing GPU (#29732)


Signed-off-by: default avatarIsotr0py <mozf@mail2.sysu.edu.cn>
parent a491b091
......@@ -79,8 +79,8 @@ class CompressedTensorsWNA16(CompressedTensorsScheme):
@classmethod
def get_min_capability(cls) -> int:
# ampere and up
return 80
# Turing and up
return 75
def create_weights(
self,
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment