[Bugfix] Reject channelwise quantization (group_size <= 0) in ExllamaLinearKernel (#37331)
Signed-off-by:
Matthias Gehre <matthias.gehre@amd.com>
Showing
Please register or sign in to comment
Signed-off-by:
Matthias Gehre <matthias.gehre@amd.com>