[Frontend] Add chunked processing to handle long inputs in embedding models (#22280)
Signed-off-by:x22x22 <wadeking@qq.com> Signed-off-by:
Kdump <rootshellexp@gmail.com> Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
Maximilien de Bayser <maxdebayser@gmail.com> Co-authored-by:
DarkLight1337 <tlleungac@connect.ust.hk>
Showing
This diff is collapsed.
Please register or sign in to comment