Update with additional information

Added a "Pre-training details" section

Update with additional information
Added a "Pre-training details" section
85140183 · Ilias Chalkidis · Julien Chaumond · 1eec69a9 · 85140183
Commit 85140183 authored Feb 14, 2020 by Ilias Chalkidis Committed by Julien Chaumond Feb 13, 2020
Show whitespace changes
Inline Side-by-side

Showing with 16 additions and 0 deletions

model_cards/nlpaueb/bert-base-greek-uncased-v1/README.md model_cards/nlpaueb/bert-base-greek-uncased-v1/README.md +16 -0

No files found.
--- a/model_cards/nlpaueb/bert-base-greek-uncased-v1/README.md
+++ b/model_cards/nlpaueb/bert-base-greek-uncased-v1/README.md
+---
+language: greek
+---
+
 # GreekBERT

 A Greek version of BERT pre-trained language model.
@@ -18,6 +22,14 @@ Future release will also include:
 * The entire corpus of Greek legislation, as published by the [National Publication Office](http://www.et.gr),  
 * The entire corpus of EU legislation (Greek translation), as published in [Eur-Lex](https://eur-lex.europa.eu/homepage.html?locale=en).

+## Pre-training details
+
+* We trained BERT using the official code provided in Google BERT's github repository (https://github.com/google-research/bert). We then used [Hugging Face](https://huggingface.co)'s [Transformers](https://github.com/huggingface/transformers) conversion script to convert the TF checkpoint and vocabulary in the desirable format in order to be able to load the model in two lines of code for both PyTorch and TF2 users.
+* We released a model similar to the English `bert-base-uncased` model (12-layer, 768-hidden, 12-heads, 110M parameters).
+* We chose to follow the same training set-up: 1 million training steps with batches of 256 sequences of length 512 with an initial learning rate 1e-4.
+* We were able to use a single Google Cloud TPU v3-8 provided for free from [TensorFlow Research Cloud (TFRC)](https://www.tensorflow.org/tfrc), while also utilizing [GCP research credits](https://edu.google.com/programs/credits/research). Huge thanks to both Google programs for supporting us!
+
+
 ## Requirements

 We published `bert-base-greek-uncased-v1` as part of [Hugging Face](https://huggingface.co)'s [Transformers](https://github.com/huggingface/transformers) repository. So, you need to install transfomers library through pip along with PyTorch or Tensorflow 2.
@@ -55,6 +67,10 @@ tokenizer = AutoTokenizer.from_pretrained("nlpaueb/bert-base-greek-uncased-v1")
 model = AutoModel.from_pretrained("nlpaueb/bert-base-greek-uncased-v1")
 ```

+## Evaluation on downstream tasks
+
+TBA
+
 ## Author

 Ilias Chalkidis on behalf of [AUEB's Natural Language Processing Group](http://nlp.cs.aueb.gr)