"git@developer.sourcefind.cn:OpenDAS/torch-scatter.git" did not exist on "d1dd94664b854bd37ad52cb47fd44c1ea5d99b3c"
Fix embeddings memory corruption (#6467)
* Fix embeddings memory corruption The patch was leading to a buffer overrun corruption. Once removed though, parallism in server.cpp lead to hitting an assert due to slot/seq IDs being >= token count. To work around this, only use slot 0 for embeddings. * Fix embed integration test assumption The token eval count has changed with recent llama.cpp bumps (0.3.5+)
Showing
Please register or sign in to comment