Commits · 071970feb87df4b88923ad67b306e27ca36e4604 · chenpangpang / transformers

05 Oct, 2020 2 commits

[Model card] Java Code Summarizer model (#7568) · 071970fe

Nathan Cooper authored Oct 05, 2020



* Create README.md

* Update model_cards/ncoop57/bart-base-code-summarizer-java-v0/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

071970fe

SqueezeBERT architecture (#7083) · 02ef825b

Forrest Iandola authored Oct 05, 2020

* configuration_squeezebert.py

thin wrapper around bert tokenizer

fix typos

wip sb model code

wip modeling_squeezebert.py. Next step is to get the multi-layer-output interface working

set up squeezebert to use BertModelOutput when returning results.

squeezebert documentation

formatting

allow head mask that is an array of [None, ..., None]

docs

docs cont'd

path to vocab

docs and pointers to cloud files (WIP)

line length and indentation

squeezebert model cards

formatting of model cards

untrack modeling_squeezebert_scratchpad.py

update aws paths to vocab and config files

get rid of stub of NSP code, and advise users to pretrain with mlm only

fix rebase issues

redo rebase of modeling_auto.py

fix issues with code formatting

more code format auto-fixes

move squeezebert before bert in tokenization_auto.py and modeling_auto.py because squeezebert inherits from bert

tests for squeezebert modeling and tokenization

fix typo

move squeezebert before bert in modeling_auto.py to fix inheritance problem

disable test_head_masking, since squeezebert doesn't yet implement head masking

fix issues exposed by the test_modeling_squeezebert.py

fix an issue exposed by test_tokenization_squeezebert.py

fix issue exposed by test_modeling_squeezebert.py

auto generated code style improvement

issue that we inherited from modeling_xxx.py: SqueezeBertForMaskedLM.forward() calls self.cls(), but there is no self.cls, and I think the goal was actually to call self.lm_head()

update copyright

resolve failing 'test_hidden_states_output' and remove unused encoder_hidden_states and encoder_attention_mask

docs

add integration test. rename squeezebert-mnli --> squeezebert/squeezebert-mnli

autogenerated formatting tweaks

integrate feedback from patrickvonplaten and sgugger to programming style and documentation strings

* tiny change to order of imports

02ef825b

01 Oct, 2020 9 commits
- [model_card] distilbert-base-german-cased · e3239093
  Julien Chaumond authored Oct 01, 2020
  
  e3239093
- [model_card] Fix metadata, adalbertojunior/PTT5-SMALL-SUM · 9a4e163b
  Julien Chaumond authored Oct 01, 2020
  
  9a4e163b
- Create README.md (#7299) · 8435e10e
  Adalberto authored Oct 01, 2020
```
* Create README.md

* language metadata
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
```
  8435e10e
- Update README.md (#7459) · d7274320
  Martin Müller authored Oct 01, 2020
  
  d7274320
- Create README.md (#7468) · 664da5b0
  allenyummy authored Oct 01, 2020
  
  664da5b0
- Update README.md (#7491) · f745f61c
  ahotrod authored Oct 01, 2020
```
Model now fine-tuned on Transformers 3.1.0, previous out-of-date model was fine-tuned on Transformers 2.3.0.
```
  f745f61c
- Create README.md (#7349) · 6ef7658c
  Abed khooli authored Oct 01, 2020
```
Model card for akhooli/personachat-arabic
```
  6ef7658c
- Creating readme for bert-base-mongolian-cased (#7439) · 15ab3f04
  Bayartsogt Yadamsuren authored Oct 01, 2020
```
* Creating readme for bert-base-mongolian-cased

* Update model_cards/bayartsogt/bert-base-mongolian-cased/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
```
  15ab3f04
- creating readme for bert-base-mongolian-uncased (#7440) · 0c2b9fa8
  Bayartsogt Yadamsuren authored Oct 01, 2020
  
  0c2b9fa8
30 Sep, 2020 1 commit

Add DeBERTa model (#5929) · 7a0cf0ec

Pengcheng He authored Sep 30, 2020



* Add DeBERTa model

* Remove dependency of deberta

* Address comments

* Patch DeBERTa
Documentation
Style

* Add final tests

* Style

* Enable tests + nitpicks

* position IDs

* BERT -> DeBERTa

* Quality

* Style

* Tokenization

* Last updates.

* @patrickvonplaten's comments

* Not everything can be a copy

* Apply most of @sgugger's review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Last reviews

* DeBERTa -> Deberta
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

7a0cf0ec

29 Sep, 2020 1 commit

Update README.md (#7444) · 205bf0b7

GmailB authored Sep 29, 2020

Hi, just corrected the example code, add 2 links and fixed some typos

205bf0b7

28 Sep, 2020 3 commits
- Create README.md (#7436) · 671b278e
  Typicasoft authored Sep 28, 2020
```
* Create README.md

MagBERT-NER : Added widget (Text)

* Rename model_cards/README.md to model_cards/TypicaAI/magbert-ner/README.md
```
  671b278e
- Update README.md (#7429) · a1a8ffa5
  Manuel Romero authored Sep 28, 2020
```
Add links to models fine-tuned on a downstream task
```
  a1a8ffa5
- correct RAG model cards (#7420) · 82794715
  Patrick von Platen authored Sep 28, 2020
  
  82794715
25 Sep, 2020 5 commits
- Update README.md · 1a14687e
  Patrick von Platen authored Sep 25, 2020
  
  1a14687e
- Update README.md · 3327c2b0
  Patrick von Platen authored Sep 25, 2020
  
  3327c2b0
- Update README.md · 4e5b036b
  Patrick von Platen authored Sep 25, 2020
  
  4e5b036b
- Update README.md · 55eccfbb
  Patrick von Platen authored Sep 25, 2020
  
  55eccfbb
- Update README.md · 5ff0d6d7
  Patrick von Platen authored Sep 25, 2020
  
  5ff0d6d7
22 Sep, 2020 2 commits
- [model_cards] blinoff/roberta-base-russian-v0 (#7317) · a9c7849c
  blinovpd authored Sep 23, 2020
  
  a9c7849c
- Fixed results of SQuAD-FR evaluation (#7313) · d6bc72c4
  Pavel Soriano authored Sep 22, 2020
```
The score for the F1 metric was reported as the Exact Match and vice-versa.
```
  d6bc72c4
21 Sep, 2020 5 commits

Added RobBERT-v2 model card (#7286) · 34a1b75f

Thomas Winters authored Sep 21, 2020



* Added RobBERT-v2 model card

* minor Tweaks
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

34a1b75f

IXAmBERT model card (#7283) · 6513d16a

jjacampos authored Sep 21, 2020

This PR includes the model card for the IXAmBERT model which has been recently uploaded to the huggingface repository.

6513d16a

[model card] distlbart-mnli model cards (#7278) · 7a88ed6c
Suraj Patil authored Sep 21, 2020

7a88ed6c

Add model cards for new pre-trained BERTweet-COVID19 models (#7269) · 67c4b0c5

Dat Quoc Nguyen authored Sep 21, 2020

Two new pre-trained models "vinai/bertweet-covid19-base-cased" and "vinai/bertweet-covid19-base-uncased" are resulted by further pre-training the pre-trained model "vinai/bertweet-base" on a  corpus of 23M COVID-19 English Tweets for 40 epochs.

67c4b0c5

Update README.md · 0cbe1139
Patrick von Platen authored Sep 21, 2020

0cbe1139

19 Sep, 2020 4 commits
- model card improvements (#7221) · 4f6e5257
  Stas Bekman authored Sep 19, 2020
  
  4f6e5257
- fsmt tiny model card + script (#7244) · eb074af7
  Stas Bekman authored Sep 19, 2020
  
  eb074af7
- Add title to model card (#7240) · 1d90d0f3
  Manuel Romero authored Sep 19, 2020
  
  1d90d0f3
- Create README.md (#7239) · c9b7ef04
  Manuel Romero authored Sep 19, 2020
  
  c9b7ef04
18 Sep, 2020 8 commits

Add new pre-trained models BERTweet and PhoBERT (#6129) · af2322c7

Dat Quoc Nguyen authored Sep 19, 2020

* Add BERTweet and PhoBERT models

* Update modeling_auto.py

Re-add `bart` to LM_MAPPING

* Update tokenization_auto.py

Re-add `from .configuration_mobilebert import MobileBertConfig`
not sure why it's replaced by `from transformers.configuration_mobilebert import MobileBertConfig`

* Add BERTweet and PhoBERT to pretrained_models.rst

* Update tokenization_auto.py

Remove BertweetTokenizer and PhobertTokenizer out of tokenization_auto.py (they are currently not supported by AutoTokenizer.

* Update BertweetTokenizer - without nltk

* Update model card for BERTweet

* PhoBERT - with Auto mode - without import fastBPE

* PhoBERT - with Auto mode - without import fastBPE

* BERTweet - with Auto mode - without import fastBPE

* Add PhoBERT and BERTweet to TF modeling auto

* Improve Docstrings for PhobertTokenizer and BertweetTokenizer

* Update PhoBERT and BERTweet model cards

* Fixed a merge conflict in tokenization_auto

* Used black to reformat BERTweet- and PhoBERT-related files

* Used isort to reformat BERTweet- and PhoBERT-related files

* Reformatted BERTweet- and PhoBERT-related files based on flake8

* Updated test files

* Updated test files

* Updated tf test files

* Updated tf test files

* Updated tf test files

* Updated tf test files

* Update commits from huggingface

* Delete unnecessary files

* Add tokenizers to auto and init files

* Add test files for tokenizers

* Revised model cards

* Update save_vocabulary function in BertweetTokenizer and PhobertTokenizer and test files

* Revised test files

* Update orders of Phobert and Bertweet tokenizers in auto tokenization file

af2322c7

Create README.md · 9397436e
Patrick von Platen authored Sep 18, 2020

9397436e
Create README.md · 7eeca4d3
Patrick von Platen authored Sep 18, 2020

7eeca4d3
Update README.md · 31516c77
Patrick von Platen authored Sep 18, 2020

31516c77
Update README.md · 4c14669a
Patrick von Platen authored Sep 18, 2020

4c14669a
[model_cards] · eef8d94d
Julien Chaumond authored Sep 18, 2020
```
We use ISO 639-1 cc @gentaiscool
```
eef8d94d
Create README.md · afd6a9f8
Patrick von Platen authored Sep 18, 2020

afd6a9f8
Create README.md · 9f1544b9
Patrick von Platen authored Sep 18, 2020

9f1544b9