Arxiv README (#2747)

* Arxiv README * ArXiv-NLP readme

Arxiv README (#2747)
* Arxiv README * ArXiv-NLP readme
33d3072e · Lysandre Debut · GitHub · eae8ee03 · 33d3072e · 33d3072e
Unverified Commit 33d3072e authored Feb 05, 2020 by Lysandre Debut Committed by GitHub Feb 05, 2020
Show whitespace changes
Inline Side-by-side

Showing with 14 additions and 0 deletions

model_cards/lysandre/arxiv-nlp/README.md model_cards/lysandre/arxiv-nlp/README.md +7 -0

model_cards/lysandre/arxiv/README.md model_cards/lysandre/arxiv/README.md +7 -0

No files found.
--- a/model_cards/lysandre/arxiv-nlp/README.md
+++ b/model_cards/lysandre/arxiv-nlp/README.md
+# ArXiv-NLP GPT-2 checkpoint
+This is a GPT-2 small checkpoint for PyTorch. It is the official `gpt2-small` fine-tuned to ArXiv paper on the computational linguistics field.
+## Training data
+This model was trained on a subset of ArXiv papers that were parsed from PDF to txt. The resulting data is made of 80MB of text from the computational linguistics (cs.CL) field.
\ No newline at end of file
--- a/model_cards/lysandre/arxiv/README.md
+++ b/model_cards/lysandre/arxiv/README.md
+# ArXiv GPT-2 checkpoint
+This is a GPT-2 small checkpoint for PyTorch. It is the official `gpt2-small` finetuned to ArXiv paper on physics fields.
+## Training data
+This model was trained on a subset of ArXiv papers that were parsed from PDF to txt. The resulting data is made of 130MB of text, mostly from quantum physics (quant-ph) and other physics sub-fields.