fix of use of unquantized weights in cohere GQA loading, also enable … (#2291)
fix of use of unquantized weights in cohere GQA loading, also enable the model in intel platform
Signed-off-by:
Wang, Yi A <yi.a.wang@intel.com>
Showing
Please register or sign in to comment