Unverified Commit cece8635 authored by Nicolas Patry's avatar Nicolas Patry Committed by GitHub
Browse files

Fixing rocm gptq by using triton code too (renamed cuda into triton). (#2691)

parent 43df056e
...@@ -10,8 +10,8 @@ from text_generation_server.utils.weights import Weight, Weights, WeightsLoader ...@@ -10,8 +10,8 @@ from text_generation_server.utils.weights import Weight, Weights, WeightsLoader
if SYSTEM == "ipex": if SYSTEM == "ipex":
from .ipex import QuantLinear from .ipex import QuantLinear
elif SYSTEM == "cuda": elif SYSTEM in {"cuda", "rocm"}:
from .cuda import QuantLinear from .triton import QuantLinear
@dataclass @dataclass
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment