Commits · 7fc6f41d9116b712dbd45d9550de86245a5a59c8 · chenpangpang / transformers

31 Jan, 2022 9 commits
- Add doc for add-new-model-like command (#15433) · 7fc6f41d
  Sylvain Gugger authored Jan 31, 2022
  
  7fc6f41d
- add t5 ner finetuning (#15432) · 282ae123
  Ogundepo Odunayo authored Jan 31, 2022
  
  282ae123
- [Hotfix] Fix Swin model outputs (#15414) · d4b3e56d
  NielsRogge authored Jan 31, 2022
```
* Fix Swin model outputs

* Rename pooler
```
  d4b3e56d
- import torch.utils.checkpoint (#15427) · 38dfb40a
  Suraj Patil authored Jan 31, 2022
  
  38dfb40a
- [Robust Speech Challenge] Add missing LR parameter (#15428) · f624249d
  Jonatas Grosman authored Jan 31, 2022
  
  f624249d
- Update README.md (#15430) · 3254080d
  Kamal Raj authored Jan 31, 2022
```
fix typo
```
  3254080d
- Add (M)Luke model training for Token Classification in the examples (#14880) · aa19f478
  Julien Plu authored Jan 31, 2022
```
* Add Luke training

* Fix true label tags

* Fix true label tags

* Fix true label tags

* Update the data collator for Luke

* Some training refactor for Luke

* Improve data collator for Luke

* Fix import

* Fix datasets concatenation

* Add the --max_entity_length argument for Luke models

* Remove unused code

* Fix style issues

* Fix style issues

* Move the Luke training into a separate folder

* Fix style

* Fix naming

* Fix filtering

* Fix filtering

* Fix filter

* Update some preprocessing

* Move luke to research_projects

* Checkstyle

* Address comments

* Fix style
```
  aa19f478
- Fix additional DataTrainingArguments documentation (#15408) · 0094eba3
  François REMY authored Jan 31, 2022
```
(This is an editorial change only)
```
  0094eba3
- Add SegformerFeatureExtractor to Auto API (#15410) · ee5de663
  NielsRogge authored Jan 31, 2022
  
  ee5de663
30 Jan, 2022 1 commit
- [XGLMTokenizer] fix init and add in AutoTokenizer (#15406) · 0f69b924
  Suraj Patil authored Jan 30, 2022
  
  0f69b924
29 Jan, 2022 4 commits

Fix the inconsistency of loss calculation between PT/TF XLNetLMHeadModel (#15298) · f380bf2b

Yih-Dar authored Jan 29, 2022



* Fix the inconsistency of loss calculation between PT/TF XLNetLMHeadModel

* overwrite test_loss_computation
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

f380bf2b

Add support for XLM-R XL and XXL models by modeling_xlm_roberta_xl.py (#13727) · e09473a8

Soonhwan-Kwon authored Jan 29, 2022



* add xlm roberta xl

* add convert xlm xl fairseq checkpoint to pytorch

* fix init and documents for xlm-roberta-xl

* fix indention

* add test for XLM-R xl,xxl

* fix model hub name

* fix some stuff

* up

* correct init

* fix more

* fix as suggestions

* add torch_device

* fix default values of doc strings

* fix leftovers

* merge to master

* up

* correct hub names

* fix docs

* fix model

* up

* finalize

* last fix

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add copied from

* make style
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

e09473a8

Get started docs (#15098) · 16d4acbf

Steven Liu authored Jan 28, 2022

* clean commit of changes

* apply review feedback, make edits

* fix backticks, minor formatting

* 🖍 make fixup and minor edits

* 🖍 fix # in header

* 📝 update code sample without from_pt

* 📝 final review

16d4acbf

Update model share tutorial (#15288) · cabd6d26

Steven Liu authored Jan 28, 2022

* add model sharing tutorial

* 🖍 apply feedback from review

* 📝 make edits

* 🖍 fix formatting

* 📝 convert from pt checkpoint to flax

* 📝 final review

cabd6d26

28 Jan, 2022 11 commits

Use argument for preprocessing workers in run_summairzation (#15394) · c98a6ac2
Sylvain Gugger authored Jan 28, 2022

c98a6ac2

Fix missing eps arg for LayerNorm in ElectraGeneratorPredictions (#15332) · db079567

Yih-Dar authored Jan 29, 2022



* fix missing eps

* Same fix for ConvBertGeneratorPredictions

* Same fix for AlbertMLMHead
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

db079567

[deepspeed] saving checkpoint fallback when fp16 weights aren't saved (#14948) · 297602c7

Stas Bekman authored Jan 28, 2022



* [deepspeed] saving checkpoint fallback when fp16 weights aren't saved

* Bump required deepspeed version to match usage when saving checkpoints

* update version
Co-authored-by: Mihai Balint <balint.mihai@gmail.com>

297602c7

Add XGLM models (#14876) · d25e25ee

Suraj Patil authored Jan 28, 2022



* add xglm

* update vocab size

* fix model name

* style and tokenizer

* typo

* no mask token

* fix pos embed compute

* fix args

* fix tokenizer

* fix positions

* fix tokenization

* style and dic fixes

* fix imports

* add fast tokenizer

* update names

* add pt tests

* fix tokenizer

* fix typo

* fix tokenizer import

* fix fast tokenizer

* fix tokenizer

* fix converter

* add tokenizer test

* update checkpoint names

* fix tokenizer tests

* fix slow tests

* add copied from comments

* rst -> mdx

* flax model

* update flax tests

* quality

* style

* doc

* update index and readme

* fix copies

* fix doc

* update toctrr

* fix indent

* minor fixes

* fix config doc

* don't save embed_pos weights

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* address Sylvains commnets, few doc fixes

* fix check_repo

* align order of arguments

* fix copies

* fix labels

* remove unnecessary mapping

* fix saving tokenizer
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d25e25ee

Make links explicit (#15395) · b6b79faa

Matt authored Jan 28, 2022

* Make links explicit

* Removing reference to compute_metrics() since it's kind of PyTorch-specific

b6b79faa

fix wrong tokenizer checkpoint name in flax marian (#15391) · 6df29ba5
Yih-Dar authored Jan 28, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
6df29ba5
Prepare deprecated ONNX exporter for torch v1.11 (#15388) · 507601a5
lewtun authored Jan 28, 2022
```
* Prepare deprecated ONNX exporter for PyTorch v1.11

* Add deprecation warning
```
507601a5
[docs] fix wrong file name in `pr_check` (#15380) · 4996922b
Ngo Quang Huy authored Jan 28, 2022

4996922b

Fix `bad_words_ids` not working with sentencepiece-based tokenizers (#15343) · 8f5d62fd

Ngo Quang Huy authored Jan 28, 2022



* Fix `bad_word_ids` not working with sentencepiece-based tokenizers

* make style
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

8f5d62fd

Fixing support `batch_size` and `num_return_Sequences` in `text-generation` pipeline (#15318) · 06107541

Nicolas Patry authored Jan 28, 2022

* Fixing support `batch_size` and `num_return_Sequences` in
`text-generation` pipeline

And `text2text-generation` too.

The bug was caused by the batch_size containing both the incoming batch
**and** the generated `num_sequences`.

The fix simply consists into splitting both of these again into
different dimensions.

* TF support.

* Odd backward compatibility script in the way.

06107541

Set syncfree AdamW as the default optimizer for xla:gpu device in amp mode (#15361) · c4d1fd77
Yanming Wang authored Jan 27, 2022
```
* Use syncfree AdamW for xla:gpu device by default

* Make syncfree AdamW optional
```
c4d1fd77

27 Jan, 2022 15 commits

Add init to BORT (#15378) · 2e4559fa
Lysandre Debut authored Jan 27, 2022
```
* Add init to BORT

* BORT should be in init
```
2e4559fa

Fix code format for Accelerate doc (#15335) · f5db6ce7

Steven Liu authored Jan 27, 2022

* 🖍 fix code syntax to external libraries and replace image

* 🔄revert code formatting, replace image with code block

* 🖍 apply feedback

f5db6ce7

Allow relative imports in dynamic code (#15352) · 0b072304
Sylvain Gugger authored Jan 27, 2022
```
* Allow dynamic modules to use relative imports

* Add tests

* Add one last test

* Changes
```
0b072304

Bump numpy from 1.19.2 to 1.21.0 in /examples/research_projects/lxmert (#15369) · 628b59e5

dependabot[bot] authored Jan 27, 2022

Bumps [numpy](https://github.com/numpy/numpy) from 1.19.2 to 1.21.0.
- [Release notes](https://github.com/numpy/numpy/releases)
- [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst.txt)
- [Commits](https://github.com/numpy/numpy/compare/v1.19.2...v1.21.0

)

---
updated-dependencies:
- dependency-name: numpy
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

628b59e5

Bump notebook in /examples/research_projects/visual_bert (#15368) · ca0848b2

dependabot[bot] authored Jan 27, 2022

Bumps [notebook](http://jupyter.org

) from 6.1.5 to 6.4.1.

---
updated-dependencies:
- dependency-name: notebook
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

ca0848b2

Bump numpy in /examples/research_projects/visual_bert (#15367) · 7d45a2e8

dependabot[bot] authored Jan 27, 2022

Bumps [numpy](https://github.com/numpy/numpy) from 1.19.2 to 1.21.0.
- [Release notes](https://github.com/numpy/numpy/releases)
- [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst.txt)
- [Commits](https://github.com/numpy/numpy/compare/v1.19.2...v1.21.0

)

---
updated-dependencies:
- dependency-name: numpy
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

7d45a2e8

Fix tests_fetcher (#15376) · a81fd355
Sylvain Gugger authored Jan 27, 2022

a81fd355
Docs for version v4.16.0 · eab33810
Lysandre authored Jan 27, 2022

eab33810
Release: v4.16.0 · f87db5e4
Lysandre authored Jan 27, 2022

f87db5e4
Example script for PushToHubCallback (#15375) · c4374928
Matt authored Jan 27, 2022
```
* Example script for PushToHubCallback

* Expanding description slightly
```
c4374928
Add proper documentation for Keras callbacks (#15374) · 8f6454bf
Sylvain Gugger authored Jan 27, 2022
```
* Add proper documentation for Keras callbacks

* Add dummies
```
8f6454bf
Super-small fix stops us confusing Keras console logging by modifying its logs (#15373) · 2de90bee
Matt authored Jan 27, 2022

2de90bee

Implement fixes for TrainingArguments doc (#15370) · fa6dce25

Sylvain Gugger authored Jan 27, 2022


Co-authored-by: osanseviero <osanseviero@gmail.com>
Co-authored-by: osanseviero <osanseviero@gmail.com>

fa6dce25

improve saving strategy of sentencepiece tokenizer (#15328) · ade7371a

SaulLu authored Jan 27, 2022



* add new test

* add a feature to same the sentencepiece tokenizer model when the init file was deleted

* update marian

* update m2m_100

* fix marian

* update speech to text

* override test for layoutxlm

* fix saving bartpho

* remove harcoded values bartpho

* special token string version

* finish bartpho

* override layoutxml test

* add mbart

* move special tokens list

* format

* Revert "format"

This reverts commit 37a40df37903a932c2f951cbd33acb684246bae7.

* simplify list of string of special tokens

* Re-write `self.fairseq_tokens_to_ids ` initialization logic with special tokens
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

ade7371a

Add a device argument to the eval script (#15371) · 196cce6e
Anton Lozhkov authored Jan 27, 2022
```
* Device argument for the eval script

* Default to none

* isort
```
196cce6e