Commits · e1c02e018c09a6ddb6d5465d6dff0f3816aa8501 · chenpangpang / transformers

05 Apr, 2021 9 commits

Add example for registering callbacks with trainers (#10928) · e1c02e01

Amala Deshmukh authored Apr 05, 2021

* Add example for callback registry

Resolves: #9036

* Update callback registry documentation

* Added comments for other ways to register callback

e1c02e01

Documentation about loading a fast tokenizer within Transformers (#11029) · 9f4e0c23

Lysandre Debut authored Apr 05, 2021



* Documentation about loading a fast tokenizer within Transformers

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

9f4e0c23

Refactor AutoModel classes and add Flax Auto classes (#11027) · 6c25f522

Sylvain Gugger authored Apr 05, 2021



* Refactor AutoModel classes and add Flax Auto classes

* Add new objects to the init

* Fix hubconf and sort models

* Fix TF tests

* Missing coma

* Update src/transformers/models/auto/auto_factory.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Fix init

* Fix dummies

* Other init to fix
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

6c25f522

Some models have no tokenizers (#11064) · eb3479e7
Lysandre Debut authored Apr 05, 2021

eb3479e7
Remove unnecessary space (#11060) · 773e4c72
Lysandre Debut authored Apr 05, 2021

773e4c72
Pin docutils (#11062) · ef62f038
Lysandre Debut authored Apr 05, 2021
```
* Pin docutils

* Versions table
```
ef62f038
[doc] update code-block rendering (#11053) · 6e310141
Eren Şahin authored Apr 05, 2021
```
double : prevents code-block section to be rendered, so made it single :
```
6e310141
s|Pretrained|PreTrained| (#11048) · 3d39226a
Stas Bekman authored Apr 04, 2021

3d39226a
Add a script to check inits are consistent (#11024) · b0d49fd5
Sylvain Gugger authored Apr 04, 2021

b0d49fd5

02 Apr, 2021 1 commit
- fixed typo: logging instead of logger (#11025) · 335c0ca3
  versis authored Apr 02, 2021
  
  335c0ca3
01 Apr, 2021 7 commits

added new notebook and merge of trainer (#11015) · 34e1bec6

Philipp Schmid authored Apr 01, 2021



* added new notebook and merge of trainer

* Update docs/source/sagemaker.md
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

34e1bec6

[doc] no more bucket · e8da77d1
Julien Chaumond authored Apr 01, 2021

e8da77d1
minor typo fix · f4ad3d8c
Joe Davison authored Apr 01, 2021
```
*negative* log-likelihood
```
f4ad3d8c

DebertaTokenizer Rework closes #10258 (#10703) · 57c1749e

cronoik authored Apr 01, 2021



* closes #10258

* typo

* reworked deberta test

* implemented the comments from BigBird01 regarding sequence pair encoding of deberta

* Update style

* VOCAB_FILES_NAMES is now a oneliner as suggested by @sgugger
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* added #fmt: on as requested by @sgugger

* Style
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

57c1749e

Add Vision Transformer and ViTFeatureExtractor (#10950) · 30677dc7

NielsRogge authored Apr 01, 2021



* Squash all commits into one

* Update ViTFeatureExtractor to use image_utils instead of torchvision

* Remove torchvision and add Pillow

* Small docs improvement

* Address most comments by @sgugger

* Fix tests

* Clean up conversion script

* Pooler first draft

* Fix quality

* Improve conversion script

* Make style and quality

* Make fix-copies

* Minor docs improvements

* Should use fix-copies instead of manual handling

* Revert "Should use fix-copies instead of manual handling"

This reverts commit fd4e591bce4496d41406425c82606a8fdaf8a50b.

* Place ViT in alphabetical order
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

30677dc7

Improve the speed of adding tokens from added_tokens.json (#10780) · af673222
cchen-dialpad authored Apr 01, 2021
```
* use bisect to add one token to unique_no_split_tokens

* fix style
```
af673222

Fix Adafactor documentation (recommend correct settings) (#10526) · c301c263

Josh authored Mar 31, 2021



* Update optimization.py

Fix documentation to reflect optimal settings for Adafactor

* update and expand on the recommendations

* style

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* flip scale_parameter to True for the 2nd recommendatoin
Co-authored-by: Stas Bekman <stas@stason.org>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

c301c263

31 Mar, 2021 13 commits

Add `examples/language_modeling/run_mlm_no_trainer.py` (#11001) · 838f83d8

Hemil Desai authored Apr 01, 2021



* Add initial script for finetuning MLM models with accelerate

* Add evaluation metric calculation

* Fix bugs

* Use no_grad on evaluation

* update script docstring

* Update examples/language-modeling/run_mlm_no_trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* PR feedback

* Fix CI failure

* Update examples/language-modeling/run_mlm_no_trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

838f83d8

Update training_args.py (#11000) · 455f8171
JohnnyC08 authored Mar 31, 2021
```
In the group by length documentation length is misspelled as legnth
```
455f8171
add blog to docs (#10997) · 01068abd
Patrick von Platen authored Mar 31, 2021

01068abd

Merge trainers (#10975) · cd56f3fe

Sylvain Gugger authored Mar 31, 2021



* Replace is_sagemaker_distributed_available

* Merge SageMakerTrainer into Trainer

* Test with shorter condition

* Put back deleted line

* Deprecate SageMakerTrainer and SageMakerTrainingArguments

* Apply suggestions from code review
Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com>
Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com>

cd56f3fe

add notebook (#10995) · b6dddda4
Patrick von Platen authored Mar 31, 2021

b6dddda4

Enforce string-formatting with f-strings (#10980) · acc3bd9d

Sylvain Gugger authored Mar 31, 2021



* First third

* Styling and fix mistake

* Quality

* All the rest

* Treat %s and %d

* typo

* Missing )

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

acc3bd9d

Add more metadata to the user agent (#10972) · d0b3797a

Sylvain Gugger authored Mar 31, 2021

* Add more metadata to the user agent

* Fix typo

* Use DISABLE_TELEMETRY

* Address review comments

* Use global env

* Add clean envs on circle CI

d0b3797a

fix example in config (#10993) · a8549bdd
Suraj Patil authored Mar 31, 2021

a8549bdd
GPT Neo configuration needs to be set to use GPT2 tokenizer (#10992) · a96edb85
Lysandre Debut authored Mar 31, 2021

a96edb85
Fix the checkpoint for I-BERT (#10994) · bf0840ac
Lysandre Debut authored Mar 31, 2021

bf0840ac
Sagemaker test fix (#10987) · ced7284a
Philipp Schmid authored Mar 31, 2021
```
* wrong makefile command

* ddp test fix
```
ced7284a

Fixed some typos and removed legacy url (#10989) · 645f45c4

WybeKoper authored Mar 31, 2021



* Fixed typos

* Removed legacy colab notebook from readme
Co-authored-by: WybeKoper <WybeKoper@users.noreply.github.com>

645f45c4

[Flax] Add other BERT classes (#10977) · e87505f3

Patrick von Platen authored Mar 31, 2021

* add first code structures

* add all bert models

* add to init and docs

* correct docs

* make style

e87505f3

30 Mar, 2021 10 commits

fix md file to avoid evaluation crash (#10962) · e031162a
Yih-Dar authored Mar 30, 2021

e031162a
[examples/s2s] added py7zr dep (#10971) · 3e09d813
Philipp Schmid authored Mar 30, 2021
```
* added py7zr

* comment out check_min for sagemaker test

* added min version again
```
3e09d813

Fixed a bug where the `pipeline.framework` would actually contain (#10970) · c32b432a

Nicolas Patry authored Mar 30, 2021

a fully qualified model.

We simply forgot to change the call for this one when this landed:
https://github.com/huggingface/transformers/pull/10888

It's odd that tests didn't catch that. Should we add some ?
(It's a pretty edgy test case, but it does run within the API).

c32b432a

improved sagemaker documentation for git_config and examples (#10966) · e3c8443f
Philipp Schmid authored Mar 30, 2021
```
* improved branch usage

* fixed grammar and comma
```
e3c8443f
GPT Neo few fixes (#10968) · 83d38c9f
Suraj Patil authored Mar 30, 2021
```
* fix checkpoint names

* auto model

* fix doc
```
83d38c9f
fix big bird gpu test (#10967) · 7772ddb4
Patrick von Platen authored Mar 30, 2021

7772ddb4

GPT Neo (#10848) · 86026437

Suraj Patil authored Mar 30, 2021



* lets begin

* boom boom

* fix out proj in attn

* fix attention

* fix local attention

* add tokenizer

* fix imports

* autotokenizer

* fix checkpoint name

* cleanup

* more clean-up

* more cleanup

* output attentions

* fix attn mask creation

* fix imports

* config doc

* add tests

* add slow tests

* quality

* add conversion script

* copyright

* typo

* another bites the dust

* fix attention tests

* doc

* add embed init in convert function

* fix copies

* remove tokenizer

* enable caching

* address review comments

* improve config and create attn layer list internally

* more consistent naming

* init hf config from mesh-tf config json file

* remove neo tokenizer from doc

* handle attention_mask in local attn layer

* attn_layers => attention_layers

* add tokenizer_class in config

* fix docstring

* raise if len of attention_layers is not same as num_layers

* remove tokenizer_class from config

* more consistent naming

* fix doc

* fix checkpoint names

* fp16 compat

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

86026437

Fix summarization notebook link (#10959) · a04eb8d3
Philipp Schmid authored Mar 30, 2021

a04eb8d3

[WIP][Flax] Add general conversion script (#10809) · 8780caa3

Patrick von Platen authored Mar 30, 2021



* save intermediate

* finish first version

* delete some more

* improve import

* fix roberta

* Update src/transformers/modeling_flax_pytorch_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_flax_pytorch_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* small corrections

* apply all comments

* fix deterministic

* make fix-copies
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

8780caa3

Sagemaker test (#10925) · 604c0850

Philipp Schmid authored Mar 30, 2021

* init

* first working test

* added todo for setup.py

* working test for single node multi node ddp and smd

* added tensorflow single node test

* added directory for pytorch and tensorflow due to different requirements.txt

* added directory for pytorch and tensorflow

* added comment for run_glue until it is available

* added output_dir to it

* smaller dataset to make test running faster

* adjust HP and script

* adjusted parameter for tensorflow

* refactored test scripts

* adjusted make file

* init

* first working test

* added todo for setup.py

* working test for single node multi node ddp and smd

* added tensorflow single node test

* added directory for pytorch and tensorflow due to different requirements.txt

* added directory for pytorch and tensorflow

* added comment for run_glue until it is available

* added output_dir to it

* smaller dataset to make test running faster

* adjust HP and script

* adjusted parameter for tensorflow

* refactored test scripts

* adjusted make file

* updated dlc container

* commented in all tests

* added both ecr images

* added new master branches

* debug

* added new datasets version

* init

* strange rebase bug

* removed changes

* changed min version for tests to work

* updated DLC

* added model parallel test

* removed test files

* removed test files

* tested with ned dlc

* added correct sagemaker sdk version

* adjust DLCs for official one

* reworked tests

* quality

* removed default profile added documentation to it

* added step in release for sagemaker tests

* reverted version for example script removed duplicated script and added install from master to requirements.txt

* removed mistaken .DS_Stores from mac

* fixed tests

* added Sylvains feedback

* make style

* added lysandre's feedback

604c0850