[ROCm] Improve error handling while loading quantized model on gfx120… (#31715)
Signed-off-by:brian033 <85883730+brian033@users.noreply.github.com> Co-authored-by:
TJian <tunjian.tan@embeddedllm.com>
Showing
Please register or sign in to comment