[Bugfix] Revert QKVCrossParallelLinear usage in Mllama to keep BNB quantization work (#14498)
Signed-off-by:
Isotr0py <2037008807@qq.com>
Showing
Please register or sign in to comment
Signed-off-by:
Isotr0py <2037008807@qq.com>