[quantization] AWQ Marlin doesn't work when dtype is bfloat16 (#11494)
Signed-off-by:Kai-Hsun Chen <khchen@x.ai> Co-authored-by:
Xinyuan Tong <115166877+JustinTong0323@users.noreply.github.com>
Showing
Please register or sign in to comment