"...git@developer.sourcefind.cn:chenpangpang/transformers.git" did not exist on "b0513b013b10939a2b47ab94933c2cca909716a2"
[Flax] Add remat (gradient checkpointing) (#17843)
* [Flax] Add remat (gradient checkpointing) * fix variable naming in test * flip: checkpoint using a method * fix naming * fix class naming * apply PVP's suggestions from code review * make fix-copies * fix big-bird, electra, roberta * cookie-cutter * fix flax big-bird * move test to common
Showing
Please register or sign in to comment