Create camembert-base-README.md

cc4c3795 · Benjamin Muller · Julien Chaumond · afea70c0 · cc4c3795
Commit cc4c3795 authored Mar 13, 2020 by Benjamin Muller Committed by Julien Chaumond Mar 13, 2020
Hide whitespace changes
Inline Side-by-side

Showing with 9 additions and 0 deletions

model_cards/camembert-base-README.md model_cards/camembert-base-README.md +9 -0

No files found.
--- a/model_cards/camembert-base-README.md
+++ b/model_cards/camembert-base-README.md
+# CamemBERT 
+CamemBERT is a state-of-the-art language model for French based on the RoBERTa architecture pretrained on the French subcorpus of the newly available multilingual corpus OSCAR.  
+CamemBERT was originally evaluated on four different downstream tasks for French: part-of-speech (POS) tagging, dependency parsing, named entity recognition (NER) and natural language inference (NLI); improving the state of the art for most tasks over previous monolingual and multilingual approaches, which confirms the effectiveness of large pretrained language models for French.   
+CamemBERT was trained and evaluated by Louis Martin, Benjamin Muller, Pedro Javier Ortiz Suárez, Yoann Dupont, Laurent Romary, Éric Villemonte de la Clergerie, Djamé Seddah and Benoît Sagot.  
+Preprint can be found [CamemBERT: a Tasty French Language Model](https://arxiv.org/abs/1911.03894)