Commits · 9c5bcab5b0c8a59e81a421dd6fbcd6a093ddcf31 · chenpangpang / transformers

17 Sep, 2020 3 commits

[model cards] fix yaml in cards (#7207) · 9c5bcab5
Stas Bekman authored Sep 17, 2020

9c5bcab5

[model cards] ported allenai Deep Encoder, Shallow Decoder models (#7153) · 0fe6e435

Stas Bekman authored Sep 17, 2020

* [model cards] ported allenai Deep Encoder, Shallow Decoder models

* typo

* fix references

* add allenai/wmt19-de-en-6-6 model cards

* fill-in the missing info for the build script as provided by the searcher.

0fe6e435

[ported model] FSMT (FairSeq MachineTranslation) (#6940) · 1eeb206b

Stas Bekman authored Sep 17, 2020

* ready for PR

* cleanup

* correct FSMT_PRETRAINED_MODEL_ARCHIVE_LIST

* fix

* perfectionism

* revert change from another PR

* odd, already committed this one

* non-interactive upload workaround

* backup the failed experiment

* store langs in config

* workaround for localizing model path

* doc clean up as in https://github.com/huggingface/transformers/pull/6956



* style

* back out debug mode

* document: run_eval.py --num_beams 10

* remove unneeded constant

* typo

* re-use bart's Attention

* re-use EncoderLayer, DecoderLayer from bart

* refactor

* send to cuda and fp16

* cleanup

* revert (moved to another PR)

* better error message

* document run_eval --num_beams

* solve the problem of tokenizer finding the right files when model is local

* polish, remove hardcoded config

* add a note that the file is autogenerated to avoid losing changes

* prep for org change, remove unneeded code

* switch to model4.pt, update scores

* s/python/bash/

* missing init (but doesn't impact the finetuned model)

* cleanup

* major refactor (reuse-bart)

* new model, new expected weights

* cleanup

* cleanup

* full link

* fix model type

* merge porting notes

* style

* cleanup

* have to create a DecoderConfig object to handle vocab_size properly

* doc fix

* add note (not a public class)

* parametrize

* - add bleu scores integration tests

* skip test if sacrebleu is not installed

* cache heavy models/tokenizers

* some tweaks

* remove tokens that aren't used

* more purging

* simplify code

* switch to using decoder_start_token_id

* add doc

* Revert "major refactor (reuse-bart)"

This reverts commit 226dad15ca6a9ef4e26178526e878e8fc5c85874.

* decouple from bart

* remove unused code #1

* remove unused code #2

* remove unused code #3

* update instructions

* clean up

* move bleu eval to examples

* check import only once

* move data+gen script into files

* reuse via import

* take less space

* add prepare_seq2seq_batch (auto-tested)

* cleanup

* recode test to use json instead of yaml

* ignore keys not needed

* use the new -y in transformers-cli upload -y

* [xlm tok] config dict: fix str into int to match definition (#7034)

* [s2s] --eval_max_generate_length (#7018)

* Fix CI with change of name of nlp (#7054)

* nlp -> datasets

* More nlp -> datasets

* Woopsie

* More nlp -> datasets

* One last

* extending to support allen_nlp wmt models

- allow a specific checkpoint file to be passed
- more arg settings
- scripts for allen_nlp models

* sync with changes

* s/fsmt-wmt/wmt/ in model names

* s/fsmt-wmt/wmt/ in model names (p2)

* s/fsmt-wmt/wmt/ in model names (p3)

* switch to a better checkpoint

* typo

* make non-optional args such - adjust tests where possible or skip when there is no other choice

* consistency

* style

* adjust header

* cards moved (model rename)

* use best custom hparams

* update info

* remove old cards

* cleanup

* s/stas/facebook/

* update scores

* s/allen_nlp/allenai/

* url maps aren't needed

* typo

* move all the doc / build /eval generators to their own scripts

* cleanup

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* fix indent

* duplicated line

* style

* use the correct add_start_docstrings

* oops

* resizing can't be done with the core approach, due to 2 dicts

* check that the arg is a list

* style

* style
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

1eeb206b

16 Sep, 2020 1 commit
- [model_cards] antoiloui/belgpt2 🇧🇪 (#7166) · df165065
  Antoine Louis authored Sep 16, 2020
```
* Create README.md

* Update README.md
```
  df165065
15 Sep, 2020 5 commits
- Create README.md · 7af2791d
  Patrick von Platen authored Sep 15, 2020
  
  7af2791d
- Funnel model cards (#7147) · 153ec2f1
  Sylvain Gugger authored Sep 15, 2020
  
  153ec2f1
- [model_cards] pvl/labse_bert model card · 52d250f6
  Pedro Lima authored Sep 15, 2020
```
From **Language-Agnostic BERT Sentence Embedding**

https://ai.googleblog.com/2020/08/language-agnostic-bert-sentence.html
```
  52d250f6
- Create README.md (#7097) · 84d64805
  tuner007 authored Sep 15, 2020
```
Model card for PEGASUS finetuned for paraphrasing task
```
  84d64805
- German electra model card v3 update (#7089) · 52bb7ccc
  Philip May authored Sep 15, 2020
```
* changed eval table model order

* Update install

* update mc
```
  52bb7ccc
11 Sep, 2020 3 commits
- Create README.md (#7066) · 563ffb3d
  李明浩 authored Sep 12, 2020
  
  563ffb3d
- Create README.md (#7067) · 1ad49cde
  李明浩 authored Sep 12, 2020
  
  1ad49cde
- added bangla-bert-base model card and also modified other model cards (#7071) · 4753816e
  Sagor Sarker authored Sep 12, 2020
```
* added bangla-bert-base

* Apply suggestions from code review
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
```
  4753816e
10 Sep, 2020 8 commits
- Create README.md · eb2feb5d
  Patrick von Platen authored Sep 10, 2020
  
  eb2feb5d
- Update README.md · 9ccdb1d5
  Patrick von Platen authored Sep 10, 2020
  
  9ccdb1d5
- Create README.md · 60698936
  Patrick von Platen authored Sep 10, 2020
  
  60698936
- Create README.md · e0c3bc8e
  Patrick von Platen authored Sep 10, 2020
  
  e0c3bc8e
- Create README.md · c356b987
  Patrick von Platen authored Sep 10, 2020
  
  c356b987
- Create README.md · 5afd3f61
  Patrick von Platen authored Sep 10, 2020
  
  5afd3f61
- Update README.md · 63e53945
  Patrick von Platen authored Sep 10, 2020
  
  63e53945
- Create README.md · 054db06b
  Patrick von Platen authored Sep 10, 2020
  
  054db06b
09 Sep, 2020 1 commit
- Create README.md · 76818cc4
  Patrick von Platen authored Sep 09, 2020
  
  76818cc4
07 Sep, 2020 5 commits
- README for HooshvareLab/bert-fa-base-uncased (#6990) · 60fc0329
  Mehrdad Farahani authored Sep 08, 2020
```
ParsBERT v2.0 is a fine-tuned and vocab-reconstructed version of ParsBERT, and it's able to be used in other scopes!

It includes these features:
- We added some unused-vocab for use in summarization and other scopes.
- We fine-tuned the model on vast styles of writing in the Persian language.
```
  60fc0329
- Create README.md (#6974) · e9d0d4c7
  Abed khooli authored Sep 07, 2020
  
  e9d0d4c7
- Create README.md model card (#6964) · e20d8895
  Richard Bownes authored Sep 07, 2020
```
* Create README.md

* Add some custom prompts
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
```
  e20d8895
- [model_card] register jplu/tf-xlm-r-ner-40-lang as multilingual · 10c6f94a
  Julien Chaumond authored Sep 07, 2020
  
  10c6f94a
- [model_card] jplu/tf-xlm-r-ner-40-lang: Fix link · d4aa7284
  Julien Chaumond authored Sep 07, 2020
```
cc @jplu
```
  d4aa7284
06 Sep, 2020 1 commit
- Correct wrong spacing in README · f72fe1f3
  Patrick von Platen authored Sep 06, 2020
  
  f72fe1f3
05 Sep, 2020 1 commit

create model card for astroGPT (#6960) · d31031f6

Steven Liu authored Sep 05, 2020



* create model card for astroGPT

* Hotlink to actual image file
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

d31031f6

04 Sep, 2020 1 commit

Create Readme.MD for KanBERTo (#6942) · 56742e9f

Naveenkhasyap authored Sep 05, 2020



* Create Readme.MD for KanBERTo

KanBERTo language model readme for Kannada language.

* Update model_cards/Naveen-k/KanBERTo/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

56742e9f

03 Sep, 2020 5 commits

Corrected link to paper (#6905) · a66db7d8
Stefan Engl authored Sep 03, 2020

a66db7d8
Added a link to the thesis. (#6906) · 55d61ce8
David Mark Nemeskey authored Sep 03, 2020

55d61ce8

Loodos model cards had errors on "Usage" section. It is fixed. Also... · 653a79cc

abdullaholuk-loodos authored Sep 03, 2020


Loodos model cards had errors on "Usage" section. It is fixed. Also "electra-base-turkish-uncased" model removed from s3 and re-uploaded as "electra-base-turkish-uncased-discriminator". Its README added. (#6921)
Co-authored-by: Abdullah Oluk <abdullaholuk123@gmail.com>

653a79cc

[model_card] link to correctly cased piaf dataset · 5a3aec90
Julien Chaumond authored Sep 03, 2020
```
cc @psorianom @rachelker
```
5a3aec90

Adding the LXMERT pretraining model (MultiModal languageXvision) to... · ea2c6f1a

Antonio V Mendoza authored Sep 03, 2020


Adding the LXMERT pretraining model (MultiModal  languageXvision)  to HuggingFace's suite of models (#5793)

* added template files for LXMERT and competed the configuration_lxmert.py

* added modeling, tokization, testing, and finishing touched for lxmert [yet to be tested]

* added model card for lxmert

* cleaning up lxmert code

* Update src/transformers/modeling_lxmert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/modeling_tf_lxmert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/modeling_tf_lxmert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/modeling_lxmert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* tested torch lxmert, changed documtention, updated outputs, and other small fixes

* Update src/transformers/convert_pytorch_checkpoint_to_tf2.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/convert_pytorch_checkpoint_to_tf2.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/convert_pytorch_checkpoint_to_tf2.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* renaming, other small issues, did not change TF code in this commit

* added lxmert question answering model in pytorch

* added capability to edit number of qa labels for lxmert

* made answer optional for lxmert question answering

* add option to return hidden_states for lxmert

* changed default qa labels for lxmert

* changed config archive path

* squshing 3 commits: merged UI + testing improvments + more UI and testing

* changed some variable names for lxmert

* TF LXMERT

* Various fixes to LXMERT

* Final touches to LXMERT

* AutoTokenizer order

* Add LXMERT to index.rst and README.md

* Merge commit test fixes + Style update

* TensorFlow 2.3.0 sequential model changes variable names

Remove inherited test

* Update src/transformers/modeling_tf_pytorch_utils.py

* Update docs/source/model_doc/lxmert.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/model_doc/lxmert.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_lxmert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* added suggestions

* Fixes

* Final fixes for TF model

* Fix docs
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

ea2c6f1a

02 Sep, 2020 1 commit

Model card for huBERT (#6893) · e3c55ceb

David Mark Nemeskey authored Sep 02, 2020



* Create README.md

Model card for huBERT.

* Update README.md

lowercase h

* Update model_cards/SZTAKI-HLT/hubert-base-cc/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

e3c55ceb

01 Sep, 2020 5 commits
- [model_cards] Fix file path for flexudy/t5-base-multi-sentence-doctor · d822ab63
  Julien Chaumond authored Sep 02, 2020
  
  d822ab63
- Create README.md (#6598) · ad5fb33c
  Rohan Rajpal authored Sep 02, 2020
  
  ad5fb33c
- Create README.md (#6602) · f9dadcd8
  Rohan Rajpal authored Sep 02, 2020
  
  f9dadcd8
- Update multilingual passage rereanking model card (#6788) · f5d69c75
  Igli Manaj authored Sep 01, 2020
```
Fix range of possible score, add inference .
```
  f5d69c75
- Model card for primer/BART-Squad2 (#6801) · 5d820f3c
  Tom Grek authored Sep 01, 2020
  
  5d820f3c