"model_cards/vscode:/vscode.git/clone" did not exist on "2d8340a91fe930b394902695c22f9dd892c309bd"
[`GPTNeoX`] Faster rotary embedding for GPTNeoX (based on llama changes) (#25830)
* Faster rotary embedding for GPTNeoX * there might be un-necessary moves from device * fixup * fix dtype issue * add copied from statements * fox copies * oupsy * add copied from Llama for scaled ones as well * fixup * fix * fix copies
Showing
Please register or sign in to comment