Unverified Commit 30299a41 authored by Cody Yu's avatar Cody Yu Committed by GitHub
Browse files

[MISC] Remove FP8 warning (#5472)


Co-authored-by: default avatarPhilipp Moritz <pcmoritz@gmail.com>
parent 85657b56
...@@ -212,7 +212,7 @@ class ModelConfig: ...@@ -212,7 +212,7 @@ class ModelConfig:
f"{self.quantization} quantization is currently not " f"{self.quantization} quantization is currently not "
f"supported in ROCm.") f"supported in ROCm.")
if (self.quantization if (self.quantization
not in ["marlin", "gptq_marlin_24", "gptq_marlin"]): not in ("fp8", "marlin", "gptq_marlin_24", "gptq_marlin")):
logger.warning( logger.warning(
"%s quantization is not fully " "%s quantization is not fully "
"optimized yet. The speed can be slower than " "optimized yet. The speed can be slower than "
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment