"examples/seq2seq/vscode:/vscode.git/clone" did not exist on "a573777901e662ec2e565be312ffaeedef6effec"
Fix gradient checkpointing imagegpt (#21816)
* Fix gradient checkpointing bug in gptneox
* Fix gradient checkpointing bug in modeling_imagegpt.py
* Revert gpt neox changes
---------
Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Showing
Please register or sign in to comment