Commits · 14cc50d081c320331d850a64a54f1d732fa557ea · chenpangpang / transformers

30 Nov, 2021 1 commit

Kamal Raj authored Nov 30, 2021

* TF Tapas first commit

* updated docs

* updated logger message

* updated pytorch weight conversion
script to support scalar array

* added use_cache to tapas model config to
work properly with tf input_processing

* 1. rm embeddings_sum
2. added # Copied
3. + TFTapasMLMHead
4. and lot other small fixes

* updated docs

* + test for tapas

* updated testing_utils to check
is_tensorflow_probability_available

* converted model logits post processing using
numpy to work with both PT and TF models

* + TFAutoModelForTableQuestionAnswering

* added TF support

* added test for
TFAutoModelForTableQuestionAnswering

* added test for
TFAutoModelForTableQuestionAnswering pipeline

* updated auto model docs

* fixed typo in import

* added tensorflow_probability to run tests

* updated MLM head

* updated tapas.rst with TF  model docs

* fixed optimizer import in docs

* updated convert to np
data from pt model is not
`transformers.tokenization_utils_base.BatchEncoding`
after pipeline upgrade

* updated pipeline:
1. with torch.no_gard removed, pipeline forward handles
2. token_type_ids converted to numpy

* updated docs.

* removed `use_cache` from config

* removed floats_tensor

* updated code comment

* updated Copyright Year and
logits_aggregation Optional

* updated docs and comments

* updated docstring

* fixed model weight loading

* make fixup

* fix indentation

* added tf slow pipeline test

* pip upgrade

* upgrade python to 3.7

* removed from_pt from tests

* revert commit f18cfa9

c468a87a

24 Nov, 2021 1 commit
- [Tests] Improve vision tests (#14458) · 3772af49
  NielsRogge authored Nov 24, 2021
```
* Improve tests

* Install vision for tf tests
```
  3772af49
19 Nov, 2021 2 commits

Add QDQBert model and quantization examples of SQUAD task (#14066) · a59e7c1e

Shang Zhang authored Nov 19, 2021



* clean up branch for add-qdqbert-model

* README update for QAT example; update docstrings in modeling_qdqbert.py

* Update qdqbert.rst

* Update README.md

* Update README.md

* calibration data using traning set; QAT example runs in fp32

* re-use BERTtokenizer for qdqbert

* Update docs/source/model_doc/qdqbert.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/model_doc/qdqbert.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/model_doc/qdqbert.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* remove qdqbert tokenizer

* Update qdqbert.rst

* update evaluate-hf-trt-qa.py

* update configuration_qdqbert.py

* update modeling_qdqbert.py: add copied statement; replace assert with ValueError

* update copied from statement

* add is_quantization_available; run make fix-copies

* unittest add require_quantization

* add backend dependency to qdqbert model

* update README; update evaluate script; make style

* lint

* docs qdqbert update

* circleci build_doc add pytorch-quantization for qdqbert

* update README

* update example readme with instructions to upgrade TensorRT to 8.2

* Update src/transformers/models/qdqbert/configuration_qdqbert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/models/qdqbert/configuration_qdqbert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/models/qdqbert/configuration_qdqbert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/models/qdqbert/configuration_qdqbert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* change quantization to pytorch_quantization for backend requirement

* feed_forward_chunking not supported in QDQBert

* make style

* update model docstrings and comments in testing scripts

* rename example to quantization-qdqbert; rename example scripts from qat to quant

* Update src/transformers/models/qdqbert/modeling_qdqbert.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* rm experimental functions in quant_trainer

* qa cleanup

* make fix-copies for docs index.rst

* fix doctree; use post_init() for qdqbert

* fix early device assignment for qdqbert

* fix CI:Model templates runner
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

a59e7c1e

Add GitPython to quality tools (#14459) · 331c3d2a
Lysandre Debut authored Nov 19, 2021
```
* Update setup.py

* Update setup.py

* Update setup.py

* Remove GitPython install
```
331c3d2a

17 Nov, 2021 1 commit
- Docs for version v4.12.5 · c6c07554
  Lysandre authored Nov 17, 2021
  
  c6c07554
16 Nov, 2021 1 commit
- Docs for v4.12.4 · 888fb211
  Lysandre authored Nov 16, 2021
  
  888fb211
03 Nov, 2021 1 commit

Quality explain (#14264) · f0d6e952

Sylvain Gugger authored Nov 03, 2021



* Start PR doc

* Cleanup the quality checks and document them

* Add reference in the contributing guide

* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Rename file as per review suggestion
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

f0d6e952

29 Oct, 2021 4 commits
- Docs for v4.12.2 · 9fc19517
  Lysandre authored Oct 29, 2021
  
  9fc19517
- Docs for v4.12.1 · 513fa30a
  Lysandre authored Oct 29, 2021
  
  513fa30a
- Torch 1.10 (#14169) · 63d91f44
  Lysandre Debut authored Oct 29, 2021
```
* Torch 1.10

* torch scatter for 1.10

* style

* Skip tests
ok
```
  63d91f44
- Fix pipeline tests env and fetch (#14209) · 4ab6a4a0
  Sylvain Gugger authored Oct 29, 2021
```
* Fix pipeline tests env and fetch

* Fix quality
```
  4ab6a4a0
28 Oct, 2021 1 commit
- v4.13.0.dev0 · b8fad022
  Lysandre authored Oct 28, 2021
  
  b8fad022
14 Oct, 2021 1 commit
- Scatter dummies + skip pipeline tests (#13996) · 5b317f7e
  Lysandre Debut authored Oct 14, 2021
```
* Scatter dummies + skip pipeline tests

* Add torch scatter to build docs
```
  5b317f7e
06 Oct, 2021 1 commit
- Deploy docs for v4.11.3 · 5be59a36
  Lysandre authored Oct 06, 2021
  
  5be59a36
30 Sep, 2021 1 commit
- Update doc for v4.11.2 · 5f25855b
  Sylvain Gugger authored Sep 30, 2021
  
  5f25855b
29 Sep, 2021 1 commit
- Update doc for v4.11.1 · cf4aa359
  Sylvain Gugger authored Sep 29, 2021
  
  cf4aa359
27 Sep, 2021 1 commit
- Docs for version v4.11.0 · 11c69b80
  Lysandre authored Sep 27, 2021
  
  11c69b80
25 Sep, 2021 1 commit
- Update test dependence for torch examples (#13738) · a8ec0029
  Sylvain Gugger authored Sep 25, 2021
  
  a8ec0029
16 Sep, 2021 1 commit

Properly use test_fetcher for examples (#13604) · af5c6ae5

Sylvain Gugger authored Sep 16, 2021

* Properly use test_fetcher for examples

* Fake example modification

* Fake modeling file modification

* Clean fake modifications

* Run example tests for any modification.

af5c6ae5

10 Sep, 2021 1 commit
- Docs for v4.10.1 · 72ec2f3e
  patrickvonplaten authored Sep 10, 2021
  
  72ec2f3e
01 Sep, 2021 2 commits
- Redeploy stable documentation · c1b20e42
  Sylvain Gugger authored Sep 01, 2021
  
  c1b20e42
- Revert "Correct wrong function signatures on the docs website (#13198)" · 85cb4477
  Li-Huai (Allan) Lin authored Aug 30, 2021
```
This reverts commit ffecfea9.
```
  85cb4477
31 Aug, 2021 3 commits
- Re-deploy documentation · e53af030
  Lysandre authored Aug 31, 2021
  
  e53af030
- Docs for v4.10.0 · 5ee67a44
  Lysandre authored Aug 31, 2021
  
  5ee67a44
- [Testing] Add Flax Tests on GPU, Add Speech and Vision to Flax & TF tests (#13313) · 062300ba
  Patrick von Platen authored Aug 31, 2021
```
* up

* finish

* Apply suggestions from code review

* apply Lysandres suggestions

* adapt circle ci as well

* finish

* Update setup.py
```
  062300ba
30 Aug, 2021 2 commits

Correct wrong function signatures on the docs website (#13198) · ffecfea9

Li-Huai (Allan) Lin authored Aug 30, 2021

* Correct outdated function signatures on website.

* Upgrade sphinx to 3.5.4 (latest 3.x)

* Test

* Test

* Test

* Test

* Test

* Test

* Revert unnecessary changes.

* Change sphinx version to 3.5.4"

* Test python 3.7.11

ffecfea9

Add LayoutLMv2 + LayoutXLM (#12604) · b6ddb08a

NielsRogge authored Aug 30, 2021



* First commit

* Make style

* Fix dummy objects

* Add Detectron2 config

* Add LayoutLMv2 pooler

* More improvements, add documentation

* More improvements

* Add model tests

* Add clarification regarding image input

* Improve integration test

* Fix bug

* Fix another bug

* Fix another bug

* Fix another bug

* More improvements

* Make more tests pass

* Make more tests pass

* Improve integration test

* Remove gradient checkpointing and add head masking

* Add integration test

* Add LayoutLMv2ForSequenceClassification to the tests

* Add LayoutLMv2ForQuestionAnswering

* More improvements

* More improvements

* Small improvements

* Fix _LazyModule

* Fix fast tokenizer

* Move sync_batch_norm to a separate method

* Replace dummies by requires_backends

* Move calculation of visual bounding boxes to separate method + update README

* Add models to main init

* First draft

* More improvements

* More improvements

* More improvements

* More improvements

* More improvements

* Remove is_split_into_words

* More improvements

* Simply tesseract - no use of pandas anymore

* Add LayoutLMv2Processor

* Update is_pytesseract_available

* Fix bugs

* Improve feature extractor

* Fix bug

* Add print statement

* Add truncation of bounding boxes

* Add tests for LayoutLMv2FeatureExtractor and LayoutLMv2Tokenizer

* Improve tokenizer tests

* Make more tokenizer tests pass

* Make more tests pass, add integration tests

* Finish integration tests

* More improvements

* More improvements - update API of the tokenizer

* More improvements

* Remove support for VQA training

* Remove some files

* Improve feature extractor

* Improve documentation and one more tokenizer test

* Make quality and small docs improvements

* Add batched tests for LayoutLMv2Processor, remove fast tokenizer

* Add truncation of labels

* Apply suggestions from code review

* Improve processor tests

* Fix failing tests and add suggestion from code review

* Fix tokenizer test

* Add detectron2 CI job

* Simplify CI job

* Comment out non-detectron2 jobs and specify number of processes

* Add pip install torchvision

* Add durations to see which tests are slow

* Fix tokenizer test and make model tests smaller

* Frist draft

* Use setattr

* Possible fix

* Proposal with configuration

* First draft of fast tokenizer

* More improvements

* Enable fast tokenizer tests

* Make more tests pass

* Make more tests pass

* More improvements

* Addd padding to fast tokenizer

* Mkae more tests pass

* Make more tests pass

* Make all tests pass for fast tokenizer

* Make fast tokenizer support overflowing boxes and labels

* Add support for overflowing_labels to slow tokenizer

* Add support for fast tokenizer to the processor

* Update processor tests for both slow and fast tokenizers

* Add head models to model mappings

* Make style & quality

* Remove Detectron2 config file

* Add configurable option to label all subwords

* Fix test

* Skip visual segment embeddings in test

* Use ResNet-18 backbone in tests instead of ResNet-101

* Proposal

* Re-enable all jobs on CI

* Fix installation of tesseract

* Fix failing test

* Fix index table

* Add LayoutXLM doc page, first draft of code examples

* Improve documentation a lot

* Update expected boxes for Tesseract 4.0.0 beta

* Use offsets to create labels instead of checking if they start with ##

* Update expected boxes for Tesseract 4.1.1

* Fix conflict

* Make variable names cleaner, add docstring, add link to notebooks

* Revert "Fix conflict"

This reverts commit a9b46ce9afe47ebfcfe7b45e6a121d49e74ef2c5.

* Revert to make integration test pass

* Apply suggestions from @LysandreJik's review

* Address @patrickvonplaten's comments

* Remove fixtures DocVQA in favor of dataset on the hub
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

b6ddb08a

13 Aug, 2021 1 commit
- Fix CircleCI nightly tests (#13113) · b0a917c4
  Sylvain Gugger authored Aug 13, 2021
  
  b0a917c4
10 Aug, 2021 1 commit

Roll out the test fetcher on push tests (#13055) · 9e9b8f1d

Sylvain Gugger authored Aug 10, 2021

* Use test fetcher for push tests as well

* Force diff with last commit for circleCI on master

* Fix syntax error

* Style

* Schedule nightly tests

9e9b8f1d

09 Aug, 2021 1 commit
- Documentation for patch v4.9.2 · a8bf2fa7
  Lysandre authored Aug 09, 2021
  
  a8bf2fa7
26 Jul, 2021 1 commit
- Update doc · a492aec8
  Sylvain Gugger authored Jul 26, 2021
  
  a492aec8
22 Jul, 2021 1 commit
- Docs for v4.10.0dev0 · 40de2d5a
  Lysandre authored Jul 22, 2021
  
  40de2d5a
20 Jul, 2021 1 commit
- add troubleshooting docs (#12791) · 7fae5350
  Stas Bekman authored Jul 20, 2021
  
  7fae5350
14 Jul, 2021 1 commit

Only test the files impacted by changes in the diff (#12644) · 084873b0

Sylvain Gugger authored Jul 14, 2021



* Base test

* More test

* Fix mistake

* Add a docstring change

* Add doc ignore

* Add changes

* Add recursive dep search

* Add recursive dep search

* save

* Finalize test mapping

* Fix bug

* Print prettier

* Ignore comments and empty lines

* Make script runnable from anywhere

* Need dev install

* Like that

* Adapt

* Add as artifact

* Try on torch tests

* Fix yaml error

* Install GitPython

* Apply everywhere

* Be more defensive

* Revert to all tests if something is wrong

* Install GitPython

* Test if there are tests before launching.

* Fixes

* Fixes

* Fixes

* Fixes

* Bash syntax is horrible

* Be less stupid

* Try differently

* Typo

* Typo

* Typo

* Style

* Better name

* Escape quotes

* Ignore black unhelpful re-formatting

* Not a docstring

* Deal with inits in dependency map

* Run all tests once PR is merged.

* Add last job

* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Stronger dependencies gather

* Ignore empty lines too!

* Clean up

* Fix quality
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

084873b0

08 Jul, 2021 1 commit

[RFC] Laying down building stone for more flexible ONNX export capabilities (#11786) · 2aa3cd93

Funtowicz Morgan authored Jul 08, 2021

* Laying down building stone for more flexible ONNX export capabilities

* Ability to provide a map of config key to override before exporting.

* Makes it possible to export BART with/without past keys.

* Supports simple mathematical syntax for OnnxVariable.repeated

* Effectively apply value override from onnx config for model

* Supports export with additional features such as with-past for seq2seq

* Store the output path directly in the args for uniform usage across.

* Make BART_ONNX_CONFIG_* constants and fix imports.

* Support BERT model.

* Use tokenizer for more flexibility in defining the inputs of a model.

* Add TODO as remainder to provide the batch/sequence_length as CLI args

* Enable optimizations to be done on the model.

* Enable GPT2 + past

* Improve model validation with outputs containing nested structures

* Enable Roberta

* Enable Albert

* Albert requires opset >= 12

* BERT-like models requires opset >= 12

* Remove double printing.

* Enable XLM-Roberta

* Enable DistilBERT

* Disable optimization by default

* Fix missing setattr when applying optimizer_features

* Add value field to OnnxVariable to define constant input (not from tokenizers)

* Add T5 support.

* Simplify model type retrieval

* Example exporting token_classification pipeline for DistilBERT.

* Refactoring to package `transformers.onnx`

* Solve circular dependency & __main__

* Remove unnecessary imports in `__init__`

* Licences

* Use @Narsil's suggestion to forward the model's configuration to the ONNXConfig to avoid interpolation.

* Onnx export v2 fixes (#12388)

* Tiny fixes
Remove `convert_pytorch` from onnxruntime-less runtimes
Correct reference to model

* Style

* Fix Copied from

* LongFormer ONNX config.

* Removed optimizations

* Remvoe bad merge relicas.

* Remove unused constants.

* Remove some deleted constants from imports.

* Fix unittest to remove usage of PyTorch model for onnx.utils.

* Fix distilbert export

* Enable ONNX export test for supported model.

* Style.

* Fix lint.

* Enable all supported default models.

* GPT2 only has one output

* Fix bad property name when overriding config.

* Added unittests and docstrings.

* Disable with_past tests for now.

* Enable outputs validation for default export.

* Remove graph opt lvls.

* Last commit with on-going past commented.

* Style.

* Disabled `with_past` for now

* Remove unused imports.

* Remove framework argument

* Remove TFPreTrainedModel reference

* Add documentation

* Add onnxruntime tests to CircleCI

* Add test

* Rename `convert_pytorch` to `export`

* Use OrderedDict for dummy inputs

* WIP Wav2Vec2

* Revert "WIP Wav2Vec2"

This reverts commit f665efb04c92525c3530e589029f0ae7afdf603e.

* Style

* Use OrderedDict for I/O

* Style.

* Specify OrderedDict documentation.

* Style :)
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

2aa3cd93

06 Jul, 2021 1 commit
- Bump CircleCI machine sizes · 2870fd19
  Lysandre authored Jul 06, 2021
  
  2870fd19
30 Jun, 2021 1 commit
- Document patch release v4.8.2 · 89073a95
  Lysandre authored Jun 30, 2021
  
  89073a95
28 Jun, 2021 1 commit

[CI] add dependency table sync verification (#12364) · d25ad34c

Stas Bekman authored Jun 28, 2021

* add dependency table sync verification

* improve the message

* improve the message

* revert

* ready to merge

d25ad34c

24 Jun, 2021 1 commit
- Document patch release v4.8.1 · 5b1b5635
  Sylvain Gugger authored Jun 24, 2021
  
  5b1b5635
23 Jun, 2021 1 commit
- v4.9.0.dev0 · 2150dfed
  Sylvain Gugger authored Jun 23, 2021
  
  2150dfed