skip quantizing per_layer_token_embd (#11207)
this tensor isn't compatible with cuda when quantized to q4_K so skip it
Showing
Please register or sign in to comment
this tensor isn't compatible with cuda when quantized to q4_K so skip it