"...git@developer.sourcefind.cn:chenpangpang/transformers.git" did not exist on "6faf283288ce3390281ad8c1d37ccb13f2d03990"
Update modeling_gpt_neox.py (#17575)
I'm guessing that the intention was to have the `_no_split_modules` class attribute for `GPTNeoXPreTrainedModel` to be set to `["GPTNeoXLayer"]`, akin to how its set as `["GPTJBlock"]` for `GPTJPreTrainedModel`. If this is incorrect, please feel free to just close the PR. Thanks!
Showing
Please register or sign in to comment