Commits · 403d530eec105c0e229fc2b754afdf77a4439def · chenpangpang / transformers

06 Apr, 2021 5 commits
- Auto feature extractor (#11097) · 403d530e
  Sylvain Gugger authored Apr 06, 2021
```
* AutoFeatureExtractor

* Init and first tests

* Tests

* Damn you gitignore

* Quality

* Defensive test for when not all backends are here

* Use pattern for Speech2Text models
```
  403d530e
- [doc] gpt-neo (#11098) · 520198f5
  Stas Bekman authored Apr 06, 2021
```
make the example work
```
  520198f5
- Development on v4.6.0dev0 · 9853c5dd
  Lysandre authored Apr 06, 2021
  
  9853c5dd
- added social thumbnail for docs (#11083) · b219d6b5
  Philipp Schmid authored Apr 06, 2021
  
  b219d6b5
- Link to new blog · 6c1bee7d
  Sylvain Gugger authored Apr 06, 2021
  
  6c1bee7d
05 Apr, 2021 5 commits

Add example for registering callbacks with trainers (#10928) · e1c02e01

Amala Deshmukh authored Apr 05, 2021

* Add example for callback registry

Resolves: #9036

* Update callback registry documentation

* Added comments for other ways to register callback

e1c02e01

Documentation about loading a fast tokenizer within Transformers (#11029) · 9f4e0c23

Lysandre Debut authored Apr 05, 2021



* Documentation about loading a fast tokenizer within Transformers

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

9f4e0c23

Refactor AutoModel classes and add Flax Auto classes (#11027) · 6c25f522

Sylvain Gugger authored Apr 05, 2021



* Refactor AutoModel classes and add Flax Auto classes

* Add new objects to the init

* Fix hubconf and sort models

* Fix TF tests

* Missing coma

* Update src/transformers/models/auto/auto_factory.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Fix init

* Fix dummies

* Other init to fix
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

6c25f522

Remove unnecessary space (#11060) · 773e4c72
Lysandre Debut authored Apr 05, 2021

773e4c72
[doc] update code-block rendering (#11053) · 6e310141
Eren Şahin authored Apr 05, 2021
```
double : prevents code-block section to be rendered, so made it single :
```
6e310141

01 Apr, 2021 4 commits

added new notebook and merge of trainer (#11015) · 34e1bec6

Philipp Schmid authored Apr 01, 2021



* added new notebook and merge of trainer

* Update docs/source/sagemaker.md
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

34e1bec6

[doc] no more bucket · e8da77d1
Julien Chaumond authored Apr 01, 2021

e8da77d1
minor typo fix · f4ad3d8c
Joe Davison authored Apr 01, 2021
```
*negative* log-likelihood
```
f4ad3d8c

Add Vision Transformer and ViTFeatureExtractor (#10950) · 30677dc7

NielsRogge authored Apr 01, 2021



* Squash all commits into one

* Update ViTFeatureExtractor to use image_utils instead of torchvision

* Remove torchvision and add Pillow

* Small docs improvement

* Address most comments by @sgugger

* Fix tests

* Clean up conversion script

* Pooler first draft

* Fix quality

* Improve conversion script

* Make style and quality

* Make fix-copies

* Minor docs improvements

* Should use fix-copies instead of manual handling

* Revert "Should use fix-copies instead of manual handling"

This reverts commit fd4e591bce4496d41406425c82606a8fdaf8a50b.

* Place ViT in alphabetical order
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

30677dc7

31 Mar, 2021 3 commits
- add blog to docs (#10997) · 01068abd
  Patrick von Platen authored Mar 31, 2021
  
  01068abd
- add notebook (#10995) · b6dddda4
  Patrick von Platen authored Mar 31, 2021
  
  b6dddda4
- [Flax] Add other BERT classes (#10977) · e87505f3
  Patrick von Platen authored Mar 31, 2021
```
* add first code structures

* add all bert models

* add to init and docs

* correct docs

* make style
```
  e87505f3
30 Mar, 2021 4 commits

improved sagemaker documentation for git_config and examples (#10966) · e3c8443f
Philipp Schmid authored Mar 30, 2021
```
* improved branch usage

* fixed grammar and comma
```
e3c8443f
GPT Neo few fixes (#10968) · 83d38c9f
Suraj Patil authored Mar 30, 2021
```
* fix checkpoint names

* auto model

* fix doc
```
83d38c9f

GPT Neo (#10848) · 86026437

Suraj Patil authored Mar 30, 2021



* lets begin

* boom boom

* fix out proj in attn

* fix attention

* fix local attention

* add tokenizer

* fix imports

* autotokenizer

* fix checkpoint name

* cleanup

* more clean-up

* more cleanup

* output attentions

* fix attn mask creation

* fix imports

* config doc

* add tests

* add slow tests

* quality

* add conversion script

* copyright

* typo

* another bites the dust

* fix attention tests

* doc

* add embed init in convert function

* fix copies

* remove tokenizer

* enable caching

* address review comments

* improve config and create attn layer list internally

* more consistent naming

* init hf config from mesh-tf config json file

* remove neo tokenizer from doc

* handle attention_mask in local attn layer

* attn_layers => attention_layers

* add tokenizer_class in config

* fix docstring

* raise if len of attention_layers is not same as num_layers

* remove tokenizer_class from config

* more consistent naming

* fix doc

* fix checkpoint names

* fp16 compat

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

86026437

BigBird (#10183) · 6dfd0272

Vasudev Gupta authored Mar 30, 2021



* init bigbird

* model.__init__ working, conversion script ready, config updated

* add conversion script

* BigBirdEmbeddings working :)

* slightly update conversion script

* BigBirdAttention working :) ; some bug in layer.output.dense

* add debugger-notebook

* forward() working for BigBirdModel :) ; replaced gelu with gelu_fast

* tf code adapted to torch till rand_attn in bigbird_block_sparse_attention ; till now everything working :)

* BigBirdModel working in block-sparse attention mode :)

* add BigBirdForPreTraining

* small fix

* add tokenizer for BigBirdModel

* fix config & hence modeling

* fix base prefix

* init testing

* init tokenizer test

* pos_embed must be absolute, attn_type=original_full when add_cross_attn=True , nsp loss is optional in BigBirdForPreTraining, add assert statements

* remove position_embedding_type arg

* complete normal tests

* add comments to block sparse attention

* add attn_probs for sliding & global tokens

* create fn for block sparse attn mask creation

* add special tests

* restore pos embed arg

* minor fix

* attn probs update

* make big bird fully gpu friendly

* fix tests

* remove pruning

* correct tokenzier & minor fixes

* update conversion script , remove norm_type

* tokenizer-inference test add

* remove extra comments

* add docs

* save intermediate

* finish trivia_qa conversion

* small update to forward

* correct qa and layer

* better error message

* BigBird QA ready

* fix rebased

* add triva-qa debugger notebook

* qa setup

* fixed till embeddings

* some issue in q/k/v_layer

* fix bug in conversion-script

* fixed till self-attn

* qa fixed except layer norm

* add qa end2end test

* fix gradient ckpting ; other qa test

* speed-up big bird a bit

* hub_id=google

* clean up

* make quality

* speed up einsum with bmm

* finish perf improvements for big bird

* remove wav2vec2 tok

* fix tokenizer

* include docs

* correct docs

* add helper to auto pad block size

* make style

* remove fast tokenizer for now

* fix some

* add pad test

* finish

* fix some bugs

* fix another bug

* fix buffer tokens

* fix comment and merge from master

* add comments

* make style

* commit some suggestions
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Fix typos

* fix some more suggestions

* add another patch
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* fix copies

* another path
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* update

* update nit suggestions

* make style
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

6dfd0272

29 Mar, 2021 1 commit

Instantiate model only once in pipeline (#10888) · 06a6fea7

Sylvain Gugger authored Mar 29, 2021



* Instantiate model only once in pipeline

* Remove documentation of deprecated method

* Add FutureWarning

* Update src/transformers/pipelines/base.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

06a6fea7

26 Mar, 2021 2 commits

Add ImageFeatureExtractionMixin (#10905) · b0595d33

Sylvain Gugger authored Mar 26, 2021

* Add ImageFeatureExtractionMixin

* Add dummy vision objects

* Add require_vision

* Add tests

* Fix test

b0595d33

Rename NLP library to Datasets library (#10920) · 4b2b50aa
Tomy Hsieh authored Mar 26, 2021
```
* Rename NLP library to Datasets library

* Update github template

* Fix styling
```
4b2b50aa

25 Mar, 2021 2 commits

Layout lm tf 2 (#10636) · 4684bfc7

Amir Tahmasbi authored Mar 25, 2021



* Added embeddings layer

* Added layoutlm layers, main model, maskedlm and token classification classes

* Added model classes to tf auto models

* Added model to PT to TF conversion script

* Added model to doc README

* Added tests

* Removed unused imports

* Added layoutlm model, test, and doc for sequence classification, and fix imports in __init__.py

* Made tests pass!

* Fixed typos in imports and docs

* Fixed a typo in embeddings layer

* Removed imports

* Fixed formatting issues, imports, tests

* Added layoutlm layers, main model, maskedlm and token classification classes

* Added model classes to tf auto models

* Added model to PT to TF conversion script

* Removed unused imports

* Added layoutlm model, test, and doc for sequence classification, and fix imports in __init__.py

* Made tests pass!

* Fixed typos in imports and docs

* Removed imports

* Fixed small formatting issues

* Removed duplicates import from main __init__.py

* Chnaged deafult arg to true for adding  pooling layer to tf layoutlm

* Fixed formatting issues

* Style

* Added copied from to classes copied from bert

* Fixed doc strings examples to work with layoutlm inputs

* Removed PyTorch reference in doc strings example

* Added integration tests

* Cleaned up initialization file

* Updated model checkpoint identifiers

* Fixed imports
Co-authored-by: Amir Tahmasbi <amir@ehsai.ca>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

4684bfc7

make local setup more clearer and added missing links (#10899) · 1a3e0c4f
Philipp Schmid authored Mar 25, 2021

1a3e0c4f

24 Mar, 2021 1 commit
- Add notebook on fine-tuning Bart (#10883) · 1f5ea9e0
  Eliza Szczechla authored Mar 24, 2021
```
Co-authored-by: Eliza <eliza@habanero.tiger.com.pl>
```
  1f5ea9e0
23 Mar, 2021 1 commit

Amazon SageMaker Documentation (#10867) · 77ffd5ed

Philipp Schmid authored Mar 23, 2021

* added finished documentation

* changed version from 1.6 to 1.6.0 for distributed

* updated versions

* updated urls

77ffd5ed

22 Mar, 2021 1 commit
- [Generate] Add save mode logits processor to remove nans and infs if necessary (#10769) · 77bf3fe7
  Patrick von Platen authored Mar 23, 2021
```
* push

* finish

* finish

* make fix copies

* change name
```
  77bf3fe7
21 Mar, 2021 1 commit

Add new community notebook - wav2vec2 with GPT (#10794) · be87b842

Eric Lam authored Mar 21, 2021



* Add new community notebook - wav2vec2 with GPT

* Update:community.md, new nb add
* feat: notebook of wav2vec xlsr ctc decoding with gpt logit adjustment
* Update: Wav2vec2 CTC decoding with gpt2 adjustment

* Update docs/source/community.md
Co-authored-by: Suraj Patil <surajp815@gmail.com>

be87b842

18 Mar, 2021 1 commit
- Document v4.4.2 · dcebe254
  Sylvain Gugger authored Mar 18, 2021
  
  dcebe254
17 Mar, 2021 1 commit

[doc] [testing] extend the pytest -k section with more examples (#10761) · 8715d20c

Stas Bekman authored Mar 17, 2021

* [doc] [testing] extend -k section

This PR adds more examples on using `pytest -k` - I always forget that I want to use `-k A OR B` when I want several tests - I keep trying AND and it doesn't match any.

* style

8715d20c

16 Mar, 2021 6 commits

[Deepspeed] Allow HF optimizer and scheduler to be passed to deepspeed (#10464) · c83fbc5f

Cheng Li authored Mar 16, 2021



* pass hf optimizer and scheduler to deepspeed if not specified in ds config

* pass hf optimizer and scheduler to deepspeed if not specified in ds config

* update

* make init_deepspeed support config dict

* fix docstring formatting

* clean up trainer's comments

* add new tests

* fix type

* composit argparse doesn't work

* style

* add a new test, rename others

* document new functionality

* complete tests, add docs

* style

* correct level

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add new methods to the doc

* must tell DS we are using a non-native optimizer

* add protection against cpu_offload + HF optimizer combo

* fix the cli overrides

* sync docs + tests

* restore AdamW

* better docs

* need new version

* no longer needed

* remove outdate information

* refactor duplicated code
Co-authored-by: Stas Bekman <stas@stason.org>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

c83fbc5f

Docs for v4.4.1 · 73fe4089
Lysandre authored Mar 16, 2021

73fe4089
Development on v4.5.0dev0 · 1b5ce1e6
Lysandre authored Mar 16, 2021

1b5ce1e6
Release v4.4.0 · c988db5a
Lysandre authored Mar 16, 2021

c988db5a
fix M2M100 example (#10745) · d3d388b9
Suraj Patil authored Mar 16, 2021

d3d388b9
Fix S2T example (#10741) · 5dcc08f1
Lysandre Debut authored Mar 16, 2021

5dcc08f1

15 Mar, 2021 1 commit

split seq2seq script into summarization & translation (#10611) · 6f840990

Théo Matussière authored Mar 15, 2021



* split seq2seq script, update docs

* needless diff

* fix readme

* remove test diff

* s/summarization/translation
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* cr

* fix arguments & better mbart/t5 refs

* copyright
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* reword readme
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* s/summarization/translation

* short script names

* fix tests

* fix isort, include mbart doc

* delete old script, update tests

* automate source prefix

* automate source prefix for translation

* s/translation/trans
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* fix script name (short version)

* typos
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* exact parameter
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* remove superfluous source_prefix calls in docs

* rename scripts & warn for source prefix

* black

* flake8
Co-authored-by: theo <theo@matussie.re>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

6f840990

12 Mar, 2021 1 commit
- AdamW is now supported by default (#9624) · 4c32f9f2
  Stas Bekman authored Mar 12, 2021
  
  4c32f9f2