"example/vscode:/vscode.git/clone" did not exist on "956465c66d4487e3d65d1da3e0e2d146a3a63df9"
Update modeling_gpt_neox.py (#17575)
I'm guessing that the intention was to have the `_no_split_modules` class attribute for `GPTNeoXPreTrainedModel` to be set to `["GPTNeoXLayer"]`, akin to how its set as `["GPTJBlock"]` for `GPTJPreTrainedModel`. If this is incorrect, please feel free to just close the PR. Thanks!
Showing
Please register or sign in to comment