Commits · a4553e6c6474e19b6b19f98687869773d9c7781e · chenpangpang / transformers

22 Nov, 2021 1 commit
- Moving pipeline tests from `Narsil` to `hf-internal-testing`. (#14463) · a4553e6c
  Nicolas Patry authored Nov 22, 2021
```
* Moving everything to `hf-internal-testing`.

* Fixing test values.

* Moving to other repo.

* Last touch?
```
  a4553e6c
21 Nov, 2021 2 commits
- Fix dummy objects for quantization (#14478) · 1a92bc57
  Sylvain Gugger authored Nov 21, 2021
```
* Fix dummy objects for quantization

* Add more models
```
  1a92bc57
- add Tuple as possible type hint for EvalPredictions label_ids (#14473) · c9d2cf85
  Alexander Measure authored Nov 21, 2021
```
* Update trainer_utils.py

* add Tuple type hints to all label_ids outputs

affects EvalLoopOutput and PredicctionOutput
```
  c9d2cf85
19 Nov, 2021 5 commits

Add QDQBert model and quantization examples of SQUAD task (#14066) · a59e7c1e

Shang Zhang authored Nov 19, 2021



* clean up branch for add-qdqbert-model

* README update for QAT example; update docstrings in modeling_qdqbert.py

* Update qdqbert.rst

* Update README.md

* Update README.md

* calibration data using traning set; QAT example runs in fp32

* re-use BERTtokenizer for qdqbert

* Update docs/source/model_doc/qdqbert.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/model_doc/qdqbert.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/model_doc/qdqbert.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* remove qdqbert tokenizer

* Update qdqbert.rst

* update evaluate-hf-trt-qa.py

* update configuration_qdqbert.py

* update modeling_qdqbert.py: add copied statement; replace assert with ValueError

* update copied from statement

* add is_quantization_available; run make fix-copies

* unittest add require_quantization

* add backend dependency to qdqbert model

* update README; update evaluate script; make style

* lint

* docs qdqbert update

* circleci build_doc add pytorch-quantization for qdqbert

* update README

* update example readme with instructions to upgrade TensorRT to 8.2

* Update src/transformers/models/qdqbert/configuration_qdqbert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/models/qdqbert/configuration_qdqbert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/models/qdqbert/configuration_qdqbert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/models/qdqbert/configuration_qdqbert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* change quantization to pytorch_quantization for backend requirement

* feed_forward_chunking not supported in QDQBert

* make style

* update model docstrings and comments in testing scripts

* rename example to quantization-qdqbert; rename example scripts from qat to quant

* Update src/transformers/models/qdqbert/modeling_qdqbert.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* rm experimental functions in quant_trainer

* qa cleanup

* make fix-copies for docs index.rst

* fix doctree; use post_init() for qdqbert

* fix early device assignment for qdqbert

* fix CI:Model templates runner
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

a59e7c1e

Adding support for `hidden_states` and `attentions` in unbatching (#14420) · 81fe8afa
Nicolas Patry authored Nov 19, 2021
```
support.
```
81fe8afa
[Generation] Allow `inputs_embeds` as an input (#14443) · f25a9332
Patrick von Platen authored Nov 19, 2021
```
* up

* finalize

* finalize

* finish

* Update src/transformers/generation_utils.py

* apply feedback
```
f25a9332
[ImageGPT] Small fixes (#14460) · 0490b988
NielsRogge authored Nov 19, 2021
```
* Add integration test

* Fix typo
```
0490b988
Add GitPython to quality tools (#14459) · 331c3d2a
Lysandre Debut authored Nov 19, 2021
```
* Update setup.py

* Update setup.py

* Update setup.py

* Remove GitPython install
```
331c3d2a

18 Nov, 2021 7 commits

[Speech Recognition] More examples · efea0f86
Patrick von Platen authored Nov 18, 2021
```
Add more XLS-R training runs to the official examples
```
efea0f86
[Bert, et al] fix early device assignment (#14447) · 72a6bf33
Stas Bekman authored Nov 18, 2021
```
* fix early device assignment

* more models
```
72a6bf33
Fix finite IterableDataset test on multiple GPUs (#14445) · 83ef8bca
Sylvain Gugger authored Nov 18, 2021

83ef8bca

Add ImageGPT (#14240) · da36c557

NielsRogge authored Nov 18, 2021

* First draft

* More improvements

* Improve conversion script

* Fix init weights for layer norm

* Fix correct model for conversion script

* Don't tie input and output embeddings

* Add print statements for debugging

* Add print statements for debugging

* Fix vocab size of model

* Improve documentation, remove fast tokenizer

* Add ImageGPTForImageClassification, improve docs

* Fix docs issue

* Set verbosity level back to info

* Improve tests

* Fix tests and add figure

* Delete tokenizer file

* Remove ImageGPTTokenizer from init files

* Remove ImageGPTLayer from init files

* Remove ImageGPT tokenizer from docs

* First draft of ImageGPTFeatureExtractor

* Fix typo

* Fix bug

* More improvements

* Apply suggestions from code review, add tests for feature extractor

* Fix layernorm

* Update save_pretrained method

* Fix issue

* Make all tests of ImageGPTFeatureExtractor pass

* Update code examples

* Rename model inputs to pixel_values

* Improve code examples

* Update init_weights to post_init

* Fix post_init

da36c557

Add a post init method to all models (#14431) · d83b0e0c

Sylvain Gugger authored Nov 18, 2021

* Add a post init method to all models

* Fix tests

* Fix last tests

* Fix templates

* Add comment

* Forgot to save

d83b0e0c

Fix code example (#14441) · 08816de1
NielsRogge authored Nov 18, 2021

08816de1
Recover Deleted XNLI Instructions (#14437) · 01f8e639
William Held authored Nov 18, 2021

01f8e639

17 Nov, 2021 6 commits

[WIP] Ensure TF model configs can be converted to proper JSON (#14415) · 1991da07

N authored Nov 17, 2021



* test: make sure model configs are jsonifiable

* fix: return python dict instead of config object

* fix: accept pretrained config and use correct class

* Re-enabling slow tests and applying them to core models only

* Re-enabling slow tests and applying them to core models only

* Add new test file to fetcher

* Remove tooslow tests from test_modeling_tf_common.py

* make style

* Style fixes

* Style fixes

* Style fixes

* Style fixes

* Adding core tests to GPT2 and BART

* Removing unused imports
Co-authored-by: niklas.fruehauf <niklas.fruehauf@sovanta.com>
Co-authored-by: matt <rocketknight1@gmail.com>

1991da07

[Bart] Fix docs (#14434) · 754202de
Patrick von Platen authored Nov 17, 2021

754202de
[Gradient checkpoining] Update Wav2Vec scripts (#14036) · 7544efc9
Antonio Carlos Falcão Petri authored Nov 17, 2021
```
Co-authored-by: Stas Bekman <stas@stason.org>
```
7544efc9
Docs for version v4.12.5 · c6c07554
Lysandre authored Nov 17, 2021

c6c07554

Improve semantic segmentation models (#14355) · a2864a50

NielsRogge authored Nov 17, 2021

* Improve tests

* Improve documentation

* Add ignore_index attribute

* Add semantic_ignore_index to BEiT model

* Add segmentation maps argument to BEiTFeatureExtractor

* Simplify SegformerFeatureExtractor and corresponding tests

* Improve tests

* Apply suggestions from code review

* Minor docs improvements

* Streamline segmentation map tests of SegFormer and BEiT

* Improve reduce_labels docs and test

* Fix code quality

* Fix code quality again

a2864a50

[Wav2Vec2] Add New Wav2Vec2 Translation (#14392) · 700a748f

Patrick von Platen authored Nov 17, 2021

* add new wav2vec2 translation

* correct

* up

* add tests

* correct end copy

* correct more

* up

* correct unispeech sat

* finish

* finalize

* finish

* up

700a748f

16 Nov, 2021 5 commits

Debug doc (#14424) · b567510c

Sylvain Gugger authored Nov 16, 2021

* Create branch for tests

* Pin first upgrade

* Really pin

* Polish fix

b567510c

Docs for v4.12.4 · 888fb211
Lysandre authored Nov 16, 2021

888fb211

Avoid looping when data exhausted (#14413) · a33168aa

Valentin authored Nov 16, 2021

* stop training when a finite IterableDataset is exhausted

when using an iterable dataset num_epochs is set to
sys.maxsize to make sure all data is consumed
likewise we want to set max_steps high enough
but still stop when all data is consumed

(cherry picked from commit 6f0e1d6363153da9051e93acffe1cbab3a3f3b12)

* fix typo flase -> false

* add test for stopping training on exhausted finite iterable dataset

* remove redundant gradient_accumulation_steps

* run make style

reformat training_args docstring

a33168aa

Add forward method to dummy models (#14419) · 3e8d17e6
Sylvain Gugger authored Nov 16, 2021
```
* Add forward method to dummy models

* Fix quality
```
3e8d17e6

Fix gradient_checkpointing backward compatibility (#14408) · 040fd471

Sylvain Gugger authored Nov 16, 2021



* Fix gradient_checkpointing backward compatibility

* Remove needless line

* make sure mask prob is big enough and length small enough

* Fix tests
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>

040fd471

15 Nov, 2021 8 commits

Allow per-version configurations (#14344) · 1cc453d3

Lysandre Debut authored Nov 15, 2021



* Allow per-version configurations

* Update tests/test_configuration_common.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update tests/test_configuration_common.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

1cc453d3

[Wav2Vec2] Make sure that gradient checkpointing is only run if needed (#14407) · 76d0d41e
Patrick von Platen authored Nov 15, 2021
```
* [Wav2Vec2] Make sure that gradient checkpointing is only run if needed

* make fix-copies
```
76d0d41e

Replace BertLayerNorm with LayerNorm (#14385) · 9fd937ea

Eldar Kurtic authored Nov 15, 2021

Running Movement pruning experiments with the newest HuggingFace would crash due to non-existing BertLayerNorm.

9fd937ea

Fix weight loading issue (#14016) · a67d47b4
Yih-Dar authored Nov 15, 2021
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
a67d47b4
Fix test and docs (#14399) · 74e6111b
NielsRogge authored Nov 15, 2021

74e6111b

[Speech2Text2] Enable tokenizers (#14390) · 4ce74edf

Patrick von Platen authored Nov 15, 2021



* [Speech2Text2] Enable tokenizers

* minor fix

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

4ce74edf

Quick fix to TF summarization example (#14401) · 267867e8
Matt authored Nov 15, 2021

267867e8
[doc] performance and parallelism updates (#14391) · 29dfb2db
Stas Bekman authored Nov 14, 2021
```
* [doc] performance and parallelism doc update

* improve

* improve
```
29dfb2db

14 Nov, 2021 1 commit
- Raise exceptions instead of using asserts in modeling_openai #12789 (#14386) · 790cdc2e
  nbertagnolli authored Nov 13, 2021
```
* Raise exceptions instead of using asserts for control flow in modeling_openai #12789

* reformatted file
```
  790cdc2e
13 Nov, 2021 2 commits

[M2M100Tokenizer] fix _build_translation_inputs (#14382) · 2e60276b

Suraj Patil authored Nov 13, 2021



* add return_tensors paramter

* fix test

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

2e60276b

support wmt21 tokenizer in m2m100 tokenizer (#14376) · 31659304
Suraj Patil authored Nov 13, 2021

31659304

12 Nov, 2021 3 commits
- Use `AlbertConverter` for FNet instead of using FNet's own converter (#14365) · 280a811e
  Li-Huai (Allan) Lin authored Nov 13, 2021
```
* Add normalizer to FNetConverter

* Style

* Directly use AlbertConverter
```
  280a811e
- [Wav2Vec2 Example] Improve fine-tuning script (#14373) · 55f49c5f
  Patrick von Platen authored Nov 12, 2021
```
* improve some stuff

* finish

* correct last
```
  55f49c5f
- fix docs (#14377) · 21546e59
  Suraj Patil authored Nov 12, 2021
  
  21546e59