* update supported matrix * change the default shard size when saving quantized weights * baichuan2 kv8
Attach a file by drag & drop or click to upload