Commits · da36c557f763cb4e515f71ba7c49191fede8966f · chenpangpang / transformers

18 Nov, 2021 4 commits

NielsRogge authored Nov 18, 2021

* First draft

* More improvements

* Improve conversion script

* Fix init weights for layer norm

* Fix correct model for conversion script

* Don't tie input and output embeddings

* Add print statements for debugging

* Add print statements for debugging

* Fix vocab size of model

* Improve documentation, remove fast tokenizer

* Add ImageGPTForImageClassification, improve docs

* Fix docs issue

* Set verbosity level back to info

* Improve tests

* Fix tests and add figure

* Delete tokenizer file

* Remove ImageGPTTokenizer from init files

* Remove ImageGPTLayer from init files

* Remove ImageGPT tokenizer from docs

* First draft of ImageGPTFeatureExtractor

* Fix typo

* Fix bug

* More improvements

* Apply suggestions from code review, add tests for feature extractor

* Fix layernorm

* Update save_pretrained method

* Fix issue

* Make all tests of ImageGPTFeatureExtractor pass

* Update code examples

* Rename model inputs to pixel_values

* Improve code examples

* Update init_weights to post_init

* Fix post_init

da36c557

Add a post init method to all models (#14431) · d83b0e0c

Sylvain Gugger authored Nov 18, 2021

* Add a post init method to all models

* Fix tests

* Fix last tests

* Fix templates

* Add comment

* Forgot to save

d83b0e0c

Fix code example (#14441) · 08816de1
NielsRogge authored Nov 18, 2021

08816de1
Recover Deleted XNLI Instructions (#14437) · 01f8e639
William Held authored Nov 18, 2021

01f8e639

17 Nov, 2021 6 commits

[WIP] Ensure TF model configs can be converted to proper JSON (#14415) · 1991da07

N authored Nov 17, 2021



* test: make sure model configs are jsonifiable

* fix: return python dict instead of config object

* fix: accept pretrained config and use correct class

* Re-enabling slow tests and applying them to core models only

* Re-enabling slow tests and applying them to core models only

* Add new test file to fetcher

* Remove tooslow tests from test_modeling_tf_common.py

* make style

* Style fixes

* Style fixes

* Style fixes

* Style fixes

* Adding core tests to GPT2 and BART

* Removing unused imports
Co-authored-by: niklas.fruehauf <niklas.fruehauf@sovanta.com>
Co-authored-by: matt <rocketknight1@gmail.com>

1991da07

[Bart] Fix docs (#14434) · 754202de
Patrick von Platen authored Nov 17, 2021

754202de
[Gradient checkpoining] Update Wav2Vec scripts (#14036) · 7544efc9
Antonio Carlos Falcão Petri authored Nov 17, 2021
```
Co-authored-by: Stas Bekman <stas@stason.org>
```
7544efc9
Docs for version v4.12.5 · c6c07554
Lysandre authored Nov 17, 2021

c6c07554

Improve semantic segmentation models (#14355) · a2864a50

NielsRogge authored Nov 17, 2021

* Improve tests

* Improve documentation

* Add ignore_index attribute

* Add semantic_ignore_index to BEiT model

* Add segmentation maps argument to BEiTFeatureExtractor

* Simplify SegformerFeatureExtractor and corresponding tests

* Improve tests

* Apply suggestions from code review

* Minor docs improvements

* Streamline segmentation map tests of SegFormer and BEiT

* Improve reduce_labels docs and test

* Fix code quality

* Fix code quality again

a2864a50

[Wav2Vec2] Add New Wav2Vec2 Translation (#14392) · 700a748f

Patrick von Platen authored Nov 17, 2021

* add new wav2vec2 translation

* correct

* up

* add tests

* correct end copy

* correct more

* up

* correct unispeech sat

* finish

* finalize

* finish

* up

700a748f

16 Nov, 2021 5 commits

Debug doc (#14424) · b567510c

Sylvain Gugger authored Nov 16, 2021

* Create branch for tests

* Pin first upgrade

* Really pin

* Polish fix

b567510c

Docs for v4.12.4 · 888fb211
Lysandre authored Nov 16, 2021

888fb211

Avoid looping when data exhausted (#14413) · a33168aa

Valentin authored Nov 16, 2021

* stop training when a finite IterableDataset is exhausted

when using an iterable dataset num_epochs is set to
sys.maxsize to make sure all data is consumed
likewise we want to set max_steps high enough
but still stop when all data is consumed

(cherry picked from commit 6f0e1d6363153da9051e93acffe1cbab3a3f3b12)

* fix typo flase -> false

* add test for stopping training on exhausted finite iterable dataset

* remove redundant gradient_accumulation_steps

* run make style

reformat training_args docstring

a33168aa

Add forward method to dummy models (#14419) · 3e8d17e6
Sylvain Gugger authored Nov 16, 2021
```
* Add forward method to dummy models

* Fix quality
```
3e8d17e6

Fix gradient_checkpointing backward compatibility (#14408) · 040fd471

Sylvain Gugger authored Nov 16, 2021



* Fix gradient_checkpointing backward compatibility

* Remove needless line

* make sure mask prob is big enough and length small enough

* Fix tests
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>

040fd471

15 Nov, 2021 8 commits

Allow per-version configurations (#14344) · 1cc453d3

Lysandre Debut authored Nov 15, 2021



* Allow per-version configurations

* Update tests/test_configuration_common.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update tests/test_configuration_common.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

1cc453d3

[Wav2Vec2] Make sure that gradient checkpointing is only run if needed (#14407) · 76d0d41e
Patrick von Platen authored Nov 15, 2021
```
* [Wav2Vec2] Make sure that gradient checkpointing is only run if needed

* make fix-copies
```
76d0d41e

Replace BertLayerNorm with LayerNorm (#14385) · 9fd937ea

Eldar Kurtic authored Nov 15, 2021

Running Movement pruning experiments with the newest HuggingFace would crash due to non-existing BertLayerNorm.

9fd937ea

Fix weight loading issue (#14016) · a67d47b4
Yih-Dar authored Nov 15, 2021
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
a67d47b4
Fix test and docs (#14399) · 74e6111b
NielsRogge authored Nov 15, 2021

74e6111b

[Speech2Text2] Enable tokenizers (#14390) · 4ce74edf

Patrick von Platen authored Nov 15, 2021



* [Speech2Text2] Enable tokenizers

* minor fix

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

4ce74edf

Quick fix to TF summarization example (#14401) · 267867e8
Matt authored Nov 15, 2021

267867e8
[doc] performance and parallelism updates (#14391) · 29dfb2db
Stas Bekman authored Nov 14, 2021
```
* [doc] performance and parallelism doc update

* improve

* improve
```
29dfb2db

14 Nov, 2021 1 commit
- Raise exceptions instead of using asserts in modeling_openai #12789 (#14386) · 790cdc2e
  nbertagnolli authored Nov 13, 2021
```
* Raise exceptions instead of using asserts for control flow in modeling_openai #12789

* reformatted file
```
  790cdc2e
13 Nov, 2021 2 commits

[M2M100Tokenizer] fix _build_translation_inputs (#14382) · 2e60276b

Suraj Patil authored Nov 13, 2021



* add return_tensors paramter

* fix test

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

2e60276b

support wmt21 tokenizer in m2m100 tokenizer (#14376) · 31659304
Suraj Patil authored Nov 13, 2021

31659304

12 Nov, 2021 4 commits

Use `AlbertConverter` for FNet instead of using FNet's own converter (#14365) · 280a811e
Li-Huai (Allan) Lin authored Nov 13, 2021
```
* Add normalizer to FNetConverter

* Style

* Directly use AlbertConverter
```
280a811e
[Wav2Vec2 Example] Improve fine-tuning script (#14373) · 55f49c5f
Patrick von Platen authored Nov 12, 2021
```
* improve some stuff

* finish

* correct last
```
55f49c5f
fix docs (#14377) · 21546e59
Suraj Patil authored Nov 12, 2021

21546e59

Adding support for raw python `generator` in addition to `Dataset` for pipelines (#14352) · ed5d1551

Nicolas Patry authored Nov 12, 2021

* Adding support for raw python `generator` in addition to `Dataset`

The main goal is to ease the create of streaming data to the pipe.

`Dataset` is more involved and pytorch specific.

This PR, provides a way to use a python iterator too.
This enabled #14250 but can be proposed as a standalone PR.

```python
from transformers import pipeline

def read_data(filename):
    with open(filename, 'r') as f:
        for line in f:
            yield f

pipe = pipeline("text-classification")
for classified in pipe(read_data("large_file.txt")):
    print("Success ! ", classified)
```

The main caveat of this, is the interaction with `DataLoader` with
`num_workers>1`. When you have multiple workers, each receive a copy
of the generator (like `IterableDataset`). That means the naive Iterator
will fail since all workers iterate on all items of the generator.

There are ways to do clever "skipping", but it could be bad still
because all workers still do have to pass through all items of the
generator (they just ignore items they don't handle), depending on
the case it might be bad.

Using `num_workers=1` is the simplest fix and if the cost of loading
your data is small enough should be good enough. In the above example
trying to do smart tricks to skip some lines is unlikely to be a net
positive for instance.

If there are better ways to do "jumps" on some data, then using
`Dataset` is more advised (since then differents workers can just jump
themselves).

* Adding iterator support for `tf` too.

ed5d1551

11 Nov, 2021 7 commits

fix --gradient_checkpointing (#13964) · 77262ef7
Stas Bekman authored Nov 11, 2021

77262ef7

fix loading flax bf16 weights in pt (#14369) · 3d607df8

Suraj Patil authored Nov 11, 2021



* fix loading flax bf16 weights in pt

* fix clip test

* fix t5 test

* add logging statement

* Update src/transformers/modeling_flax_pytorch_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* switch back to native any

* fix check for bf16 weights
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

3d607df8

Fixing requirements for TF LM models and use correct model mappings (#14372) · 7f20bf0d
Matt authored Nov 11, 2021
```
* Fixing requirements for TF LM models and use correct model mappings

* make style
```
7f20bf0d

Experimenting with adding proper get_config() and from_config() methods (#14361) · 4c35c8d8

Matt authored Nov 11, 2021

* Experimenting with adding proper get_config() and from_config() methods

* Adding a test for get/from config

* Fix test for get/from config

4c35c8d8

pass params to encode (#14370) · b1dbdf22
Suraj Patil authored Nov 11, 2021

b1dbdf22

Fix Flax params dtype (#13098) · e92190c0

Suraj Patil authored Nov 11, 2021



* fix inits

* fix embed dtype

* fix embed dtype

* add test to check default dtype

* quality

* add type conversion methods for flax models

* more robust casting

* cast sinusoidal positions

* update pegasus

* update albert

* update test

* make sure dtype is passed to every module

* style

* fix electra dense

* fix t5

* quality

* add more tests

* better name

* use the dtype for lm head computation

* fix albert

* style

* fix albert embed dtype

* more tests

* fix vision enc-dec

* cleanup

* fix embed dtype pegasus

* fix default param test

* doc

* update template

* fix final_logits_bias dtype

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* fix doc

* fix doc

* add detailed docstring for dtype parameter

* remove un-necessary import
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

e92190c0

solve the port conflict (#14362) · 1c76a516
Stas Bekman authored Nov 10, 2021

1c76a516

10 Nov, 2021 3 commits
- Fix list index out of range when padding nested empty lists (#13876) · 9e37c5cd
  Li-Huai (Allan) Lin authored Nov 11, 2021
```
* Fix index out of range when padding

* Apply suggestions from code review

* Style
```
  9e37c5cd
- enhance rewrite state_dict missing _metadata (#14348) · bec02ff2
  Chang Wang authored Nov 10, 2021
  
  bec02ff2
- Add notebook INC quantization for text classification tasks (#14293) · 2b0d9389
  Ella Charlaix authored Nov 10, 2021
```
* Add notebook applying Intel Neural Compressor quantization for text classification tasks

* Add Optimum notebooks section
```
  2b0d9389