hotfix: fix regression of attention api change in intel platform (#2439)
fix regression caused by attention api change. ipex.varlen_attention does not support paged-cache
format kv input now.
Signed-off-by:
Wang, Yi A <yi.a.wang@intel.com>
Showing
Please register or sign in to comment