[GPT-Neo] Simplify local attention (#13491)
* simplify local attention * update tests * add a comment and use torch.bitwise_xor
Showing
Please register or sign in to comment
* simplify local attention * update tests * add a comment and use torch.bitwise_xor