Do not shift context for sliding window models (#5368)
* Do not shift context for sliding window models * truncate prompt > 2/3 tokens * only target gemma2
Showing
Please register or sign in to comment
* Do not shift context for sliding window models * truncate prompt > 2/3 tokens * only target gemma2