[WIP] GPT Neo cleanup (#10985)
* better names * add attention mixin * all slow tests in one class * make helper methods static so we can test * add local attention tests * better names * doc * apply review suggestions
Showing
Please register or sign in to comment