"git@developer.sourcefind.cn:chenpangpang/transformers.git" did not exist on "5c6f57ee75665499c8045a8bf7c73bf2415fba20"
gradient checkpointing for GPT-NeoX (#19946)
* gradient checkpointing for GPT-NeoX * initialize gradient checkpointing flag * must set flag before init
Showing
Please register or sign in to comment