"examples/vscode:/vscode.git/clone" did not exist on "6e4bc670993e498e0709dfdce7fa54e5ec94fdba"
Improve BERT-like models performance with better self attention (#9124)
* Improve BERT-like models attention layers * Apply style * Put back error raising instead of assert * Update template * Fix copies * Apply raising valueerror in MPNet * Restore the copy check for the Intermediate layer in Longformer * Update longformer
Showing
Please register or sign in to comment