Unverified Commit b89275d0 authored by brian033's avatar brian033 Committed by GitHub
Browse files

[ROCm] Improve error handling while loading quantized model on gfx120… (#31715)


Signed-off-by: default avatarbrian033 <85883730+brian033@users.noreply.github.com>
Co-authored-by: default avatarTJian <tunjian.tan@embeddedllm.com>
parent 28459785
...@@ -153,7 +153,11 @@ try: ...@@ -153,7 +153,11 @@ try:
fake_impl=gemm_with_dynamic_quant_fake, fake_impl=gemm_with_dynamic_quant_fake,
dispatch_key=current_platform.dispatch_key, dispatch_key=current_platform.dispatch_key,
) )
except (ImportError, AttributeError): except (ImportError, AttributeError, RuntimeError):
logger.warning(
"AITER is not found or QuarkOCP_MX is not supported on the current "
"platform. QuarkOCP_MX quantization will not be available."
)
dynamic_mxfp4_quant = gemm_afp4wfp4 = None dynamic_mxfp4_quant = gemm_afp4wfp4 = None
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment