Enable bitsandbytes quantization on AMD GPUs that use warp size 32 (#27307)
Signed-off-by:
sstamenk <strahinja.stamenkovic@amd.com>
Showing
Please register or sign in to comment
Signed-off-by:
sstamenk <strahinja.stamenkovic@amd.com>