• Suraj Patil's avatar
    [WIP] GPT Neo cleanup (#10985) · 2a8115f0
    Suraj Patil authored
    * better names
    
    * add attention mixin
    
    * all slow tests in one class
    
    * make helper methods static so we can test
    
    * add local attention tests
    
    * better names
    
    * doc
    
    * apply review suggestions
    2a8115f0
test_modeling_gpt_neo.py 27.1 KB