set VLLM_USE_FUSED_QA_KVA_GEMM=1 feat:w4a8Linear调用apply_int8_linear,以支持blaslt
Attach a file by drag & drop or click to upload