"...git@developer.sourcefind.cn:chenpangpang/transformers.git" did not exist on "132852203a02e320049457316a63cffb64968aa1"
GPTNeo: handle padded wte (#11079)
* GPTNeo: handle padded wte
* Switch to config.vocab_size
* apply review suggestion
Co-authored-by:
Suraj Patil <surajp815@gmail.com>
Showing
Please register or sign in to comment