FlaxGPTNeo (#12493)
* flax gpt neo * fix query scaling * update generation test * use flax model for test
Showing
This diff is collapsed.
Please register or sign in to comment
* flax gpt neo * fix query scaling * update generation test * use flax model for test