Commits · 4a18337baed89e8cfd524c4b307a93b451ea1ef6 · chenpangpang / transformers

11 Oct, 2021 1 commit

Honor existing attention mask in tokenzier.pad (#13926) · 4a18337b

Sylvain Gugger authored Oct 11, 2021

* Honor existing attention mask in tokenzier.pad

* Fix initialization of attention mask

* Roll the implem on all subclasses

* Fix tests

4a18337b

08 Oct, 2021 3 commits

[Generation] Fix max_new_tokens (#13919) · c8b07612
Patrick von Platen authored Oct 08, 2021
```
* up

* Update src/transformers/generation_stopping_criteria.py

* finish
```
c8b07612
Adding support for tokens being suffixes or part of each other. (#13918) · d70919e6
Nicolas Patry authored Oct 08, 2021
```
* Adding support for tokens being suffixes or part of each other.

* Better test name.
```
d70919e6

Image Segmentation pipeline (#13828) · 026866df

Mishig Davaadorj authored Oct 08, 2021



* Implement img seg pipeline

* Update src/transformers/pipelines/image_segmentation.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/pipelines/image_segmentation.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update output shape with individual masks

* Rm dev change

* Remove loops in test
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

026866df

07 Oct, 2021 2 commits
- Fix incorrect output shapes for TF/PT LED (#13882) · 61cf2ea9
  Matt authored Oct 07, 2021
```
* Fix issues with LED model

* Style pass

* Bugfixes

* correct attentions as well
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  61cf2ea9
- [Wav2Vec2] Fix mask_feature_prob (#13921) · 0f5488f7
  Patrick von Platen authored Oct 07, 2021
```
* up

* overwrite hubert
```
  0f5488f7
06 Oct, 2021 2 commits
- Fixing Backward compatiblity for zero-shot (#13855) · 013bdc6d
  Nicolas Patry authored Oct 06, 2021
```
Fixes #13846
```
  013bdc6d
- Fixing GPU for token-classification in a better way. (#13856) · e7b16f33
  Nicolas Patry authored Oct 06, 2021
```
Co-authored-by: Pierre Snell <pierre.snell@botpress.com>
Co-authored-by: Pierre Snell <pierre.snell@botpress.com>
```
  e7b16f33
05 Oct, 2021 5 commits

Fixing question-answering with long contexts (#13873) · 0ddadbf0

Nicolas Patry authored Oct 05, 2021

* Tmp.

* Fixing BC for question answering with long context.

* Capping model_max_length to avoid tf overflow.

* Bad workaround bugged roberta.

* Fixing name.

0ddadbf0

Allow dataset to be an optional argument for (Distributed)LengthGroupedSampler (#13820) · 1b74af76
Zhaofeng Wu authored Oct 05, 2021
```
* Allow dataset to be an optional argument for (Distributed)LengthGroupedSampler

* Fix
```
1b74af76

Initial support for symbolic tracing with torch.fx allowing dynamic axes (#13579) · d4e4efce

Michael Benayoun authored Oct 05, 2021



* Symbolic trace dynamic axes support for BERT like models (albert, bert, distilbert, mobilebert, electra, megatron-bert)
* Sanity checks before tracing that make sure the model to trace is supported
* Adapted to PyTorch 1.9
Co-authored-by: Michael Benayoun <michael@huggingface.co>

d4e4efce

Fixing empty prompts for text-generation when BOS exists. (#13859) · 3a9c0f23

Nicolas Patry authored Oct 05, 2021

* Fixing empty prompts for text-generation when BOS exists.

* Fixing odd case with Pegasus.

* Fixing Bert is Assertion Error.

3a9c0f23

Fixing 1-length special tokens cut. (#13862) · 7079a99e
Nicolas Patry authored Oct 05, 2021

7079a99e

04 Oct, 2021 2 commits

Update no_* argument (HfArgumentParser) (#13865) · 12b4d66a

Bram Vanroy authored Oct 04, 2021

* update no_* argument

Changes the order so that the no_* argument is created after the original argument AND sets the default for this no_* argument to False

* import copy

* update test

* make style

* Use kwargs to set default=False

* make style

12b4d66a

Add Mistral GPT-2 Stability Tweaks (#13573) · 3a8de58c

Sidd Karamcheti authored Oct 04, 2021



* Add layer-wise scaling

* Add reorder & upcasting argument

* Add OpenAI GPT-2 weight initialization scheme

* start `layer_idx` count at zero for consistency

* disentangle attn and reordered and upscaled attn function

* rename `scale_attn_by_layer` to `scale_attn_by_layer_id`

* make autocast from amp compatible with pytorch<1.6

* fix docstring

* style fixes

* Add fixes from PR feedback, style tweaks

* Fix doc whitespace

* Reformat

* First pass scale_attn_by_layer_idx and reorder_and_upcast_attn tests

* Rename scale_attn_by_layer_idx, add tip

* Remove extra newline

* add test for weight initialization

* update code format

* add assert check weights are fp32

* remove assert

* Fix incorrect merge

* Fix shape mismatch in baddbmm

* Add generation test for Mistral flags
Co-authored-by: leandro <leandro.vonwerra@spoud.io>
Co-authored-by: Keshav Santhanam <keshav2@stanford.edu>
Co-authored-by: J38 <jebolton@stanford.edu>

3a8de58c

30 Sep, 2021 2 commits
- skip gptj slow generate tests for now (#13809) · 8bbb53e2
  Suraj Patil authored Oct 01, 2021
  
  8bbb53e2
- [DPR] Correct init (#13796) · 41436d3d
  Patrick von Platen authored Sep 30, 2021
```
* update

* add to docs and init

* make fix-copies
```
  41436d3d
29 Sep, 2021 2 commits
- Fix length of IterableDatasetShard and add test (#13792) · 63cc5bda
  Sylvain Gugger authored Sep 29, 2021
```
* Fix length of IterableDatasetShard and add test

* Add comments
```
  63cc5bda
- Enable readme link synchronization (#13785) · 7d84c3a4
  Li-Huai (Allan) Lin authored Sep 29, 2021
```
* Enable readme link synchronization

* Style

* Reuse regex pattern

* Apply suggestions

* Update
```
  7d84c3a4
26 Sep, 2021 1 commit
- [Tests] Cast Hubert test models to fp16 (#13755) · e0d31a89
  Anton Lozhkov authored Sep 26, 2021
  
  e0d31a89
25 Sep, 2021 1 commit
- finish (#13743) · 067413fb
  Patrick von Platen authored Sep 25, 2021
  
  067413fb
24 Sep, 2021 2 commits
- up (#13729) · e579f855
  Patrick von Platen authored Sep 24, 2021
  
  e579f855
- Fixing zero-shot backward compatiblity (#13725) · 0eabe492
  Nicolas Patry authored Sep 24, 2021
```
Fixes #13697
```
  0eabe492
23 Sep, 2021 1 commit

Add SigOpt HPO to transformers trainer api (#13572) · 6a3a197f

kding1 authored Sep 23, 2021



* add sigopt hpo to transformers.
Signed-off-by: Ding, Ke <ke.ding@intel.com>

* extend sigopt changes to test code and others..
Signed-off-by: Ding, Ke <ke.ding@intel.com>

* Style.

* fix style for sigopt integration.
Signed-off-by: Ding, Ke <ke.ding@intel.com>

* Add necessary information to run unittests on SigOpt.
Co-authored-by: Morgan Funtowicz <funtowiczmo@gmail.com>

6a3a197f

22 Sep, 2021 4 commits

Fix torchscript tests (#13701) · ca257a06
Lysandre Debut authored Sep 22, 2021

ca257a06

[GPT-J] Use the `float16` checkpoints in integration tests (#13676) · 7c7d2ec9

Anton Lozhkov authored Sep 22, 2021

* Use fp16 checkpoints

* Style

* Fix outputs and disable OOM tests

* Correct another output

* Use a random smaller model for generation tests

* repo quickfix

* fix gradient checkpointing

7c7d2ec9

Make gradient_checkpointing a training argument (#13657) · 27d46397

Sylvain Gugger authored Sep 22, 2021



* Make gradient_checkpointing a training argument

* Update src/transformers/modeling_utils.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Update src/transformers/configuration_utils.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Fix tests

* Style

* document Gradient Checkpointing as a performance feature

* Small rename

* PoC for not using the config

* Adapt BC to new PoC

* Forgot to save

* Rollout changes to all other models

* Fix typo
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas@stason.org>

27d46397

[Wav2Vec2FeatureExtractor] Fix `extractor.pad()` dtype backwards compatibility (#13693) · 75f6641e
Anton Lozhkov authored Sep 22, 2021
```
* Force dtype, add tests

* Local torch imports

* Remove unused logic (always ndarray)
```
75f6641e

21 Sep, 2021 8 commits

[AutoTokenizer] Allow creation of tokenizers by tokenizer type (#13668) · 8e908c8c
Patrick von Platen authored Sep 22, 2021
```
* up

* up
```
8e908c8c
up (#13688) · 2608944d
Patrick von Platen authored Sep 22, 2021

2608944d
Skip FlaxWav2Vec2 test until fixed · d16bec95
Sylvain Gugger authored Sep 21, 2021

d16bec95

Layoutlm onnx support (Issue #13300) (#13562) · ddd4d02f

Nishant Prabhu authored Sep 22, 2021



* Add support for exporting PyTorch LayoutLM to ONNX

* Added tests for converting LayoutLM to ONNX

* Add support for exporting PyTorch LayoutLM to ONNX

* Added tests for converting LayoutLM to ONNX

* cleanup

* Removed regression/ folder

* Add support for exporting PyTorch LayoutLM to ONNX

* Added tests for converting LayoutLM to ONNX

* cleanup

* Fixed import error

* Remove unnecessary import statements

* Changed max_2d_positions from class variable to instance variable of the config class

* Add support for exporting PyTorch LayoutLM to ONNX

* Added tests for converting LayoutLM to ONNX

* cleanup

* Add support for exporting PyTorch LayoutLM to ONNX

* cleanup

* Fixed import error

* Changed max_2d_positions from class variable to instance variable of the config class

* Use super class generate_dummy_inputs method
Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com>

* Add support for Masked LM, sequence classification and token classification
Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com>

* Removed uncessary import and method

* Fixed code styling

* Raise error if PyTorch is not installed

* Remove unnecessary import statement
Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com>

ddd4d02f

[SequenceFeatureExtractor] Rewrite padding logic from pure python to numpy (#13650) · 1417978c

Anton Lozhkov authored Sep 21, 2021

* Test np padding

* Pass feature extraction tests

* Update type hints

* Fix flaky integration tests

* Try a more stable waveform

* Add to_numpy jax support

* int32 attention masks

* Refactor normalization tests

1417978c

Typo "UNKWOWN" -> "UNKNOWN" (#13675) · 8d533e6a
Kamal Raj authored Sep 21, 2021

8d533e6a

beit-flax (#13515) · a2dec768

Kamal Raj authored Sep 21, 2021

* beit-flax

* updated FLAX_BEIT_MLM_DOCSTRING

* removed bool_masked_pos from classification

* updated Copyright

* code refactoring: x -> embeddings

* updated test: rm from_pt

* Update docs/source/model_doc/beit.rst

* model code dtype updates and
other changes according to review

* relative_position_bias
revert back to pytorch design

a2dec768

Add Speech AutoModels (#13655) · 48fa42e5
Patrick von Platen authored Sep 21, 2021
```
* upload

* correct

* correct

* correct

* finish

* up

* up

* up again
```
48fa42e5

20 Sep, 2021 2 commits

Dynamically load model code from the Hub (#13467) · 002a078a

Sylvain Gugger authored Sep 20, 2021



* Dynamic model

* Use defensive flag

* Style

* Doc and arg rename

* Arg rename

* Add tests

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Address review comments

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

002a078a

Add FNet (#13045) · d8049331

Gunjan Chhablani authored Sep 20, 2021



* Init FNet

* Update config

* Fix config

* Update model classes

* Update tokenizers to use sentencepiece

* Fix errors in model

* Fix defaults in config

* Remove position embedding type completely

* Fix typo and take only real numbers

* Fix type vocab size in configuration

* Add projection layer to embeddings

* Fix position ids bug in embeddings

* Add minor changes

* Add conversion script and remove CausalLM vestiges

* Fix conversion script

* Fix conversion script

* Remove CausalLM Test

* Update checkpoint names to dummy checkpoints

* Add tokenizer mapping

* Fix modeling file and corresponding tests

* Add tokenization test file

* Add PreTraining model test

* Make style and quality

* Make tokenization base tests work

* Update docs

* Add FastTokenizer tests

* Fix fast tokenizer special tokens

* Fix style and quality

* Remove load_tf_weights vestiges

* Add FNet to  main README

* Fix configuration example indentation

* Comment tokenization slow test

* Fix style

* Add changes from review

* Fix style

* Remove bos and eos tokens from tokenizers

* Add tokenizer slow test, TPU transforms, NSP

* Add scipy check

* Add scipy availabilty check to test

* Fix tokenizer and use correct inputs

* Remove remaining TODOs

* Fix tests

* Fix tests

* Comment Fourier Test

* Uncomment Fourier Test

* Change to google checkpoint

* Add changes from review

* Fix activation function

* Fix model integration test

* Add more integration tests

* Add comparison steps to MLM integration test

* Fix style

* Add masked tokenization fix

* Improve mask tokenization fix

* Fix index docs

* Add changes from review

* Fix issue

* Fix failing import in test

* some more fixes

* correct fast tokenizer

* finalize

* make style

* Remove additional tokenization logic

* Set do_lower_case to False

* Allow keeping accents

* Fix tokenization test

* Fix FNet Tokenizer Fast

* fix tests

* make style

* Add tips to FNet docs
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>

d8049331

17 Sep, 2021 2 commits
- Fix GPT2Config parameters in GPT2ModelTester (#13630) · b518aaf1
  calpt authored Sep 17, 2021
  
  b518aaf1
- Updated tiny distilbert models (#13631) · 300ee0c7
  Lysandre Debut authored Sep 17, 2021
  
  300ee0c7