Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
chenpangpang
transformers
Commits
4e817ff4
Unverified
Commit
4e817ff4
authored
Apr 25, 2020
by
Txus
Committed by
GitHub
Apr 25, 2020
Browse files
Create README.md (#3966)
parent
73d6a2f9
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
25 additions
and
0 deletions
+25
-0
model_cards/codegram/calbert-base-uncased/README.md
model_cards/codegram/calbert-base-uncased/README.md
+25
-0
No files found.
model_cards/codegram/calbert-base-uncased/README.md
0 → 100644
View file @
4e817ff4
---
language
:
catalan
---
# CALBERT: a Catalan Language Model
## Introduction
CALBERT is an open-source language model for Catalan based on the ALBERT architecture.
It is now available on Hugging Face in its
`base-uncased`
version, and was pretrained on the
[
OSCAR dataset
](
https://traces1.inria.fr/oscar/
)
.
For further information or requests, please go to the
[
GitHub repository
](
https://github.com/codegram/calbert
)
## Pre-trained models
| Model | Arch. | Training data |
|-------------------------------------|------------------|-----------------------------------|
|
`codegram`
/
`calbert-base-uncased`
| Base (uncased) | OSCAR (4.3 GB of text) |
## Authors
CALBERT was trained and evaluated by
[
Txus Bach
](
https://twitter.com/txustice
)
, as part of
[
Codegram
](
https://www.codegram.com
)
's applied research.
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment