"pretrain_gpt.py" did not exist on "4b50683264413e4c8c60d469f3d8e9d20b3eb028"
runner.go: Add unit tests for context shifting
This also makes it easier to truncate long inputs the same as shifting but does not actually implement it. This type of truncation has a trade off between quality and time to first token.
Showing
Please register or sign in to comment