README.md 597 Bytes
Newer Older
1
2
---
language:
3
4
5
6
- bg
- cs
- pl
- ru
7
8
9
10
---

# bert-base-bg-cs-pl-ru-cased

11
SlavicBERT\[1\] \(Slavic \(bg, cs, pl, ru\), cased, 12鈥憀ayer, 768鈥慼idden, 12鈥慼eads, 180M parameters\) was trained on Russian News and four Wikipedias: Bulgarian, Czech, Polish, and Russian. Subtoken vocabulary was built using this data. Multilingual BERT was used as an initialization for SlavicBERT.
12
13


14
\[1\]: Arkhipov M., Trofimova M., Kuratov Y., Sorokin A. \(2019\). [Tuning Multilingual Transformers for Language-Specific Named Entity Recognition](https://www.aclweb.org/anthology/W19-3712/). ACL anthology W19-3712.