Unverified Commit cece8635 authored by Nicolas Patry's avatar Nicolas Patry Committed by GitHub
Browse files

Fixing rocm gptq by using triton code too (renamed cuda into triton). (#2691)

parent 43df056e
......@@ -10,8 +10,8 @@ from text_generation_server.utils.weights import Weight, Weights, WeightsLoader
if SYSTEM == "ipex":
from .ipex import QuantLinear
elif SYSTEM == "cuda":
from .cuda import QuantLinear
elif SYSTEM in {"cuda", "rocm"}:
from .triton import QuantLinear
@dataclass
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment