Commits · 6667b0d7bf0a135b9a85d2d21b7eb25ec8ad58cb · chenpangpang / transformers

"vscode:/vscode.git/clone" did not exist on "3502c202f998dc2d0eee4257c219342cc794f46a"

24 Aug, 2022 2 commits

add warning to let the user know that the `__call__` method is faster than... · 6667b0d7

SaulLu authored Aug 24, 2022

add warning to let the user know that the `__call__` method is faster than `encode` + `pad` for a fast tokenizer (#18693)

* add warning to let the user know that the  method is slower that  for a fast tokenizer

* user warnings

* fix layoutlmv2

* fix layout*

* change warnings into logger.warning

6667b0d7

Add TF implementation of `XGLMModel` (#16543) · c72d7d91

Daniel Stancl authored Aug 24, 2022



* Add TFXGLM models 

* Add todo: self.supports_xla_generation = False
Co-authored-by: Daniel Stancl <stancld@Daniels-MacBook-Pro.local>
Co-authored-by: Daniel Stancl <stancld@daniels-mbp.home>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: Daniel <daniel.stancl@rossum.ai>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

c72d7d91

22 Aug, 2022 1 commit
- Add missing tokenizer tests - Longformer (#17677) · 0f257a87
  tgadeliya authored Aug 22, 2022
  
  0f257a87
19 Aug, 2022 1 commit
- Generate: add missing `**model_kwargs` in sample tests (#18696) · e95d433d
  Joao Gante authored Aug 19, 2022
  
  e95d433d
18 Aug, 2022 2 commits
- Rename second input dimension from "sequence" to "num_channels" for CV models (#17976) · 76454b08
  regisss authored Aug 18, 2022
  
  76454b08
- Generate: validate model_kwargs on FLAX (and catch typos in generate arguments) (#18653) · a541d974
  Joao Gante authored Aug 18, 2022
  
  a541d974
17 Aug, 2022 2 commits

Update feature extractor methods to enable type cast before normalize (#18499) · 49e44b21

amyeroberts authored Aug 17, 2022

* Update methods to optionally rescale
This is necessary to allow for casting our images / videos to numpy arrays within the feature extractors' call. We want to do this to make sure the behaviour is as expected when flags like  are False. If some transformations aren't applied, then the output type can't be unexpected e.g. a list of PIL images instead of numpy arrays.

* Cast images to numpy arrays in call to enable consistent behaviour with different configs

* Remove accidental clip changes

* Update tests to reflect the scaling logic
We write a generic  function to handle rescaling of our arrays. In order for the API to be intuitive, we take some factor c and rescale the image values by that. This means, the rescaling done in normalize and to_numpy_array are now done with array * (1/255) instead of array / 255. This leads to small differences in the resulting image. When testing, this was in the order of 1e-8, and so deemed OK

49e44b21

Fix Yolos ONNX export test (#18606) · c99e9846

Yih-Dar authored Aug 17, 2022


Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

c99e9846

16 Aug, 2022 1 commit

[bnb] Minor modifications (#18631) · 6d175c11

Younes Belkada authored Aug 17, 2022



* bnb minor modifications

- refactor documentation
- add troubleshooting README
- add PyPi library on DockerFile

* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Apply suggestions from code review

* Apply suggestions from code review

* Apply suggestions from code review

* put in one block

- put bash instructions in one block

* update readme

- refactor a bit hardware requirements

* change text a bit

* Apply suggestions from code review
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

* apply suggestions
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

* add link to paper

* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Update tests/mixed_int8/README.md

* Apply suggestions from code review

* refactor a bit

* add instructions Turing & Amperer
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* add A6000

* clarify a bit

* remove small part

* Update tests/mixed_int8/README.md
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

6d175c11

12 Aug, 2022 6 commits

small change (#18584) · 1ccd2515
Younes Belkada authored Aug 12, 2022

1ccd2515
Update BLOOM parameter counts (#18531) · 56ef0ba4
Niklas Muennighoff authored Aug 12, 2022
```
* Update BLOOM parameter counts

* Update BLOOM parameter counts
```
56ef0ba4

Add Donut (#18488) · 2ab790e8

NielsRogge authored Aug 12, 2022



* First draft

* Improve script

* Update script

* Make conversion work

* Add final_layer_norm attribute to Swin's config

* Add DonutProcessor

* Convert more models

* Improve feature extractor and convert base models

* Fix bug

* Improve integration tests

* Improve integration tests and add model to README

* Add doc test

* Add feature extractor to docs

* Fix integration tests

* Remove register_buffer

* Fix toctree and add missing attribute

* Add DonutSwin

* Make conversion script work

* Improve conversion script

* Address comment

* Fix bug

* Fix another bug

* Remove deprecated method from docs

* Make Swin and Swinv2 untouched

* Fix code examples

* Fix processor

* Update model_type to donut-swin

* Add feature extractor tests, add token2json method, improve feature extractor

* Fix failing tests, remove integration test

* Add do_thumbnail for consistency

* Improve code examples

* Add code example for document parsing

* Add DonutSwin to MODEL_NAMES_MAPPING

* Add model to appropriate place in toctree

* Update namespace to appropriate organization
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

2ab790e8

Supporting seq2seq models for `bitsandbytes` integration (#18579) · a5ca56ff

Younes Belkada authored Aug 12, 2022

* Supporting seq2seq models for `bitsandbytes` integration

- `bitsandbytes` integration supports now seq2seq models
- check if a model has tied weights as an additional check

* small modification

- tie the weights before looking at tied weights!

a5ca56ff

Generate: validate `model_kwargs` (and catch typos in generate arguments) (#18261) · ed1924e8
Joao Gante authored Aug 12, 2022
```
* validate generate model_kwargs

* generate tests -- not all models have an attn mask
```
ed1924e8

Load sharded pt to flax (#18419) · bce36ee0

Arthur authored Aug 12, 2022



* initial commit

* add small test

* add cross pt tf flag to test

* fix quality

* style

* update test with new repo

* fix failing test

* update

* fix wrong param ordering

* style

* update based on review

* update related to recent new caching mechanism

* quality

* Update based on review
Co-authored-by: sgugger <sylvain.gugger@gmail.com>

* quality and style

* Update src/transformers/modeling_flax_utils.py
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

bce36ee0

11 Aug, 2022 2 commits
- fix owlvit tests, update docstring examples (#18586) · f28f2408
  Alara Dirik authored Aug 11, 2022
  
  f28f2408
- Skip broken tests · 3f0707b2
  Sylvain Gugger authored Aug 11, 2022
  
  3f0707b2
10 Aug, 2022 3 commits

Adds CLIP to models exportable with ONNX (#18515) · f62cb831

Dhruv Karan authored Aug 11, 2022



* onnx config for clip

* default opset as 14

* changes from the original repo

* input values order fix

* outputs fix

* remove unused import

* ran make fix-copies

* black format

* review comments: forward ref, import fix, model change revert, .to cleanup

* make style

* formatting fixes

* revert groupvit

* comment for cast to int32

* comment fix

* make .T as .t() for onnx conversion

* ran make fix-copies

* remove unneeded comment
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* fix copies

* remove comment
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

f62cb831

Use commit hash to look in cache instead of calling head (#18534) · 0d0aada5

Sylvain Gugger authored Aug 10, 2022



* Use commit hash to look in cache instead of calling head

* Add tests

* Add attr for local configs too

* Stupid typos

* Fix tests

* Update src/transformers/utils/hub.py
Co-authored-by: Julien Chaumond <julien@huggingface.co>

* Address Julien's comments
Co-authored-by: Julien Chaumond <julien@huggingface.co>

0d0aada5

`bitsandbytes` - `Linear8bitLt` integration into `transformers` models (#17901) · 4a51075a

Younes Belkada authored Aug 10, 2022



* first commit

* correct replace function

* add final changes

- works like charm!
- cannot implement tests yet
- tested

* clean up a bit

* add bitsandbytes dependencies

* working version

- added import function
- added bitsandbytes utils file

* small fix

* small fix

- fix import issue

* fix import issues

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* refactor a bit

- move bitsandbytes utils to utils
- change comments on functions

* reformat docstring

- reformat docstring on init_empty_weights_8bit

* Update src/transformers/__init__.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* revert bad formatting

* change to bitsandbytes

* refactor a bit

- remove init8bit since it is useless

* more refactoring

- fixed init empty weights issue
- added threshold param

* small hack to make it work

* Update src/transformers/modeling_utils.py

* Update src/transformers/modeling_utils.py

* revmoe the small hack

* modify utils file

* make style + refactor a bit

* create correctly device map

* add correct dtype for device map creation

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* apply suggestions

- remove with torch.grad
- do not rely on Python bool magic!

* add docstring

 - add docstring for new kwargs

* add docstring

- comment `replace_8bit_linear` function
- fix weird formatting

* - added more documentation
- added new utility function for memory footprint tracking
- colab demo to add

* few modifs

- typo doc
- force cast into float16 when load_in_8bit is enabled

* added colab link

* add test architecture + docstring a bit

* refactor a bit testing class

* make style + refactor a bit

* enhance checks

- add more checks
- start writing saving test

* clean up a bit

* male style

* add more details on doc

* add more tests

- still needs to fix 2 tests

* replace by "or"

- could not fix it from GitHub GUI
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* refactor a bit testing code + add readme

* make style

* fix import issue

* Update src/transformers/modeling_utils.py
Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com>

* add few comments

* add more doctring + make style

* more docstring

* raise error when loaded in 8bit

* make style

* add warning if loaded on CPU

* add small sanity check

* fix small comment

* add bitsandbytes on dockerfile

* Improve documentation

- improve documentation from comments

* add few comments

* slow tests pass on the VM but not on the CI VM

* Fix merge conflict

* make style

* another test should pass on a multi gpu setup

* fix bad import in testing file

* Fix slow tests

- remove dummy batches
- no more CUDA illegal memory errors

* odify dockerfile

* Update docs/source/en/main_classes/model.mdx

* Update Dockerfile

* Update model.mdx

* Update Dockerfile

* Apply suggestions from code review

* few modifications

- lm head can stay on disk/cpu
- change model name so that test pass

* change test value

- change test value to the correct output
- torch bmm changed to baddmm in bloom modeling when merging

* modify installation guidelines

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* replace `n`by `name`

* merge `load_in_8bit` and `low_cpu_mem_usage`

* first try - keep the lm head in full precision

* better check

- check the attribute `base_model_prefix` instead of computing the number of parameters

* added more tests

* Update src/transformers/utils/bitsandbytes.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Merge branch 'integration-8bit' of https://github.com/younesbelkada/transformers

 into integration-8bit

* improve documentation

- fix typos for installation
- change title in the documentation
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com>

4a51075a

09 Aug, 2022 3 commits

Adding a new `align_to_words` param to qa pipeline. (#18010) · 9f5fe635

Nicolas Patry authored Aug 09, 2022



* Adding a new `align_to_words` param to qa pipeline.

* Update src/transformers/pipelines/question_answering.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Import protection.
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

9f5fe635

Add mt5 onnx config (#18394) · 8cb5ecd9

Thomas Chaigneau authored Aug 09, 2022

* update features

* MT5OnnxConfig added with updated with tests and docs

* fix imports

* fix onnc_config_cls for mt5

Co-authored-by: Thomas Chaigneau <thomas.deeptools.ai>

8cb5ecd9

fix: data2vec-vision Onnx ready-made configuration. (#18427) · fe785730

Niklas Hansson authored Aug 09, 2022

* feat: add the data2vec conf that are missing https://huggingface.co/docs/transformers/serialization

* fix: wrong config

fe785730

08 Aug, 2022 2 commits

Clean up hub (#18497) · 377cdded
Sylvain Gugger authored Aug 08, 2022
```
* Clean up utils.hub

* Remove imports

* More fixes

* Last fix
```
377cdded

[DX fix] Fixing QA pipeline streaming a dataset. (#18516) · a4562552

Nicolas Patry authored Aug 08, 2022

* [DX fix] Fixing QA pipeline streaming a dataset.

QuestionAnsweringArgumentHandler would iterate over the whole dataset
effectively killing all properties of the pipeline.
This restores nice properties when using `Dataset` or `Generator` since
those are meant to be consumed lazily.

* Handling TF better.

a4562552

06 Aug, 2022 1 commit

`transformers-cli login` => `huggingface-cli login` (#18490) · 9129fd03

Julien Chaumond authored Aug 06, 2022

* zero chance anyone's using that constant no?

* `transformers-cli login` => `huggingface-cli login`

* `transformers-cli repo create` => `huggingface-cli repo create`

* `make style`

9129fd03

05 Aug, 2022 6 commits

disable Onnx test for google/long-t5-tglobal-base (#18454) · 38d65604
Yih-Dar authored Aug 05, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
38d65604
Fix `test_dbmdz_english` by updating expected values (#18482) · 280db2e3
Yih-Dar authored Aug 05, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
280db2e3

Use new huggingface_hub tools for download models (#18438) · 5cd40323

Sylvain Gugger authored Aug 05, 2022

* Draft new cached_file

* Initial draft for config and model

* Small fixes

* Fix first batch of tests

* Look in cache when internet is down

* Fix last tests

* Bad black, not fixing all quality errors

* Make diff less

* Implement change for TF and Flax models

* Add tokenizer and feature extractor

* For compatibility with main

* Add utils to move the cache and auto-do it at first use.

* Quality

* Deal with empty commit shas

* Deal with empty etag

* Address review comments

5cd40323

Fix pipeline tests (#18487) · 70fa1a8d
Sylvain Gugger authored Aug 05, 2022
```
* Fix pipeline tests

* Make sure all pipelines tests run with init changes
```
70fa1a8d
Add TF prefix to TF-Res test class (#18481) · 893122f6
Yih-Dar authored Aug 05, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
893122f6

Fixing issue where generic model types wouldn't load properly with the pipeline (#18392) · 586dcf6b

Nicolas Patry authored Aug 05, 2022

* Adding a better error message when the model is improperly configured

within transformers.

* Update src/transformers/pipelines/__init__.py

* Black version.

* Overriding task aliases so that tokenizer+feature_extractor

values are correct.

* Fixing task aliases by overriding their names early

* X.

* Fixing feature-extraction.

* black again.

* Normalizing `translation` too.

* Fixing last few corner cases.

translation need to use its non normalized name (translation_XX_to_YY,
so that the task_specific_params are correctly overloaded).
This can be removed and cleaned up in a later PR.

`speech-encode-decoder` actually REQUIRES to pass a `tokenizer` manually
so the error needs to be discarded when the `tokenizer` is already
there.

* doc-builder fix.

* Fixing the real issue.

* Removing dead code.

* Do not import the actual config classes.

586dcf6b

04 Aug, 2022 3 commits

Add `TF_MODEL_FOR_SEMANTIC_SEGMENTATION_MAPPING` (#18469) · 14928921
Yih-Dar authored Aug 04, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
14928921

Add VideoMAE (#17821) · f9a0008d

NielsRogge authored Aug 04, 2022



* First draft

* Add VideoMAEForVideoClassification

* Improve conversion script

* Add VideoMAEForPreTraining

* Add VideoMAEFeatureExtractor

* Improve VideoMAEFeatureExtractor

* Improve docs

* Add first draft of model tests

* Improve VideoMAEForPreTraining

* Fix base_model_prefix

* Make model take pixel_values of shape (B, T, C, H, W)

* Add loss computation of VideoMAEForPreTraining

* Improve tests

* Improve model testsé

* Make all tests pass

* Add VideoMAE to main README

* Add tests for VideoMAEFeatureExtractor

* Add integration test

* Improve conversion script

* Rename patch embedding class

* Remove VideoMAELayer from init

* Update design of patch embeddings

* Improve comments

* Improve conversion script

* Improve conversion script

* Add conversion of pretrained model

* Add loss verification of pretrained model

* Add loss verification of unnormalized targets

* Add integration test for pretraining model

* Apply suggestions from code review

* Fix bug to make feature extractor resize only shorter edge

* Address more comments

* Improve normalization of videos

* Add doc examples

* Move constants to dedicated script

* Remove scripts

* Transfer checkpoints, fix docs

* Update script

* Update image mean and std

* Fix doc tests

* Set return_tensors to NumPy by default

* Revert the previous change
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

f9a0008d

change shape to support dynamic batch input in tf.function XLA generate for tf serving (#18372) · fc1d841b
nlpcat authored Aug 04, 2022
```
* change shape to support dynamic batch input in tf.generate

* add tests
Co-authored-by: nlpcatcode <nlpcodecat@gmail.com>
```
fc1d841b

03 Aug, 2022 3 commits

fix: keras fit tests for segformer tf and minor refactors. (#18412) · be41eaf5

Sayak Paul authored Aug 03, 2022

* fix: keras fit tests for segformer tf and minor refactors.

* refactor: test_keras_fit to make it simpler using the existing one.

* fix: styling issues.

be41eaf5

Fix failing tests for XLA generation in TF (#18298) · 8fb7c908
Daniel Suess authored Aug 03, 2022
```
* Fix failing test_xla_generate_slow tests

* Fix failing speech-to-text xla_generate tests
```
8fb7c908

support ONNX export of XDropout in deberta{,_v2} and sew_d (#17502) · 9d7b70bc

Gary Miguel authored Aug 03, 2022

* support ONNX export of XDropout in deberta{,_v2}

* black

* copy to sew_d

* add test

* isort

* use pytest.mark.filterwarnings

* review comments

9d7b70bc

02 Aug, 2022 1 commit

Update pipeline word heuristic to work with whitespace in token offsets (#18402) · 042f4203

David authored Aug 02, 2022

* Update pipeline word heuristic to work with whitespace in token offsets

This change checks for whitespace in the input string at either the
character preceding the token or in the first character of the token.
This works with tokenizers that return offsets excluding whitespace
between words or with offsets including whitespace.

fixes #18111

starting

* Use smaller model, ensure expected tokenization

* Re-run CI (please squash)

042f4203

01 Aug, 2022 1 commit

Rewrite push_to_hub to use upload_files (#18366) · 01db72ab

Sylvain Gugger authored Aug 01, 2022

* Rewrite push_to_hub to use upload_files

* Adapt the doc a bit

* Address review comments and clean doc

01db72ab