"src/vscode:/vscode.git/clone" did not exist on "b96c6ce1939a2c50ddc5c8a83e92f35635377785"
Add TF implementation of GPT-J (#15623)
* Initial commit * Add TFGPTJModel * Fix a forward pass * Add TFGPTJCausalLM * Add TFGPTJForSequenceClassification * Add TFGPTJForQuestionAnswering * Fix docs * Deal with TF dynamic shapes * Add Loss parents to models * Adjust split and merge heads to handle 4 and 5-dim tensors * Update outputs for @tooslow tests
Showing
This diff is collapsed.
This diff is collapsed.
Please register or sign in to comment