"...resnet50_tensorflow.git" did not exist on "4912beaa78293522fb32f9d3c0a183d7764f0ad3"
Unverified Commit 6d6b916f authored by Thomas Wolf's avatar Thomas Wolf Committed by GitHub
Browse files

update to BERT-large results

parent c4bfc646
......@@ -206,7 +206,7 @@ Training with the previous hyper-parameters gave us the following results:
The options we list above allow to fine-tune BERT-large rather easily on GPU(s) instead of the TPU used by the original implementation.
For example, fine-tuning BERT-large on SQuAD can be done on a server with 4 k-80 (these are pretty old now) in 18 hours. Our results are similar to the TensorFlow implementation results:
For example, fine-tuning BERT-large on SQuAD can be done on a server with 4 k-80 (these are pretty old now) in 18 hours. Our results are similar to the TensorFlow implementation results (actually slightly higher):
```bash
{"exact_match": 84.56953642384106, "f1": 91.04028647786927}
```
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment