"vscode:/vscode.git/clone" did not exist on "7405c1c77e4637768ea0ad5d27d8a4d8d67bfb19"
Update modeling_gpt_neox.py (#17575)
I'm guessing that the intention was to have the `_no_split_modules` class attribute for `GPTNeoXPreTrainedModel` to be set to `["GPTNeoXLayer"]`, akin to how its set as `["GPTJBlock"]` for `GPTJPreTrainedModel`. If this is incorrect, please feel free to just close the PR. Thanks!
Showing
Please register or sign in to comment