Commits · f1a6df3210695fa7311b4e8905c520cb738decbb · chenpangpang / transformers

09 Sep, 2022 2 commits

[JAX] Replace all jax.tree_* calls with jax.tree_util.tree_* (#18361) · e6f221c8
Sanchit Gandhi authored Sep 09, 2022
```
* [JAX] Replace all jax.tree_* calls with jax.tree_util.tree_*

* fix double tree_util
```
e6f221c8

add task_type_id to BERT to support ERNIE-2.0 and ERNIE-3.0 models (#18686) · 22f72185

HuYong authored Sep 09, 2022



* add_ernie

* remove Tokenizer in ernie

* polish code

* format code style

* polish code

* fix style

* update doc

* make fix-copies

* change model name

* change model name

* fix dependency

* add more copied from

* rename ErnieLMHeadModel to ErnieForCausalLM
do not expose ErnieLayer
update doc

* fix

* make style

* polish code

* polish code

* fix

* fix

* fix

* fix

* fix

* final fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

22f72185

08 Sep, 2022 1 commit

Add X-CLIP (#18852) · bb6f6d53

NielsRogge authored Sep 08, 2022

* First draft

* Improve conversion script

* Make vision encoder work

* More improvements

* Improve conversion script

* Fix quality

* Add MultiframeIntegrationTransformer

* More improvements

* Make MiT output work

* Fix quality

* Add prompts generator

* Add tests

* Fix some tests

* Fix some more tests

* Fix more tests

* Improve conversion script

* Fix model outputs

* Fix more tests

* Add XClipProcessor

* Use processor in conversion script

* Fix integration test

* Update README, fix docs

* Fix all tests

* Add MIT output to XClipOutput

* Create better variable names

* Rename XClip to XCLIP

* Extend conversion script

* Add support for large models

* Add support for 16 frame models

* Add another model'

* Fix module issue

* Apply suggestions from code review

* Add figure to docs

* Fix CLIPProcessor issue

* Apply suggestions from code review

* Delete file

* Convert more checkpoints

* Convert last checkpoint

* Update nielsr to microsoft

bb6f6d53

07 Sep, 2022 2 commits

Add DocumentQuestionAnswering pipeline (#18414) · 2ef77421

Ankur Goyal authored Sep 07, 2022



* [WIP] Skeleton of VisualQuestionAnweringPipeline extended to support LayoutLM-like models

* Fixup

* Use the full encoding

* Basic refactoring to DocumentQuestionAnsweringPipeline

* Cleanup

* Improve args, docs, and implement preprocessing

* Integrate OCR

* Refactor question_answering pipeline

* Use refactored QA code in the document qa pipeline

* Fix tests

* Some small cleanups

* Use a string type annotation for Image.Image

* Update encoding with image features

* Wire through the basic docs

* Handle invalid response

* Handle empty word_boxes properly

* Docstring fix

* Integrate Donut model

* Fixup

* Incorporate comments

* Address comments

* Initial incorporation of tests

* Address Comments

* Change assert to ValueError

* Comments

* Wrap `score` in float to make it JSON serializable

* Incorporate AutoModeLForDocumentQuestionAnswering changes

* Fixup

* Rename postprocess function

* Fix auto import

* Applying comments

* Improve docs

* Remove extra assets and add copyright

* Address comments
Co-authored-by: Ankur Goyal <ankur@impira.com>

2ef77421

remvoe `_create_and_check_torch_fx_tracing` in specific test files (#18667) · 10c774cf

Yih-Dar authored Sep 07, 2022



* remvoe _create_and_check_torch_fx_tracing defined in specific model test files
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

10c774cf

06 Sep, 2022 2 commits
- Further reduce the number of alls to head for cached objects (#18871) · 71ff88fa
  Sylvain Gugger authored Sep 06, 2022
```
* Further reduce the number of alls to head for cached models/tokenizers/pipelines

* Fix tests

* Address review comments
```
  71ff88fa
- Fix `test_tf_encode_plus_sent_to_model` for `LayoutLMv3` (#18898) · 998a90bc
  Yih-Dar authored Sep 06, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  998a90bc
05 Sep, 2022 2 commits
- Generate: get the correct beam index on eos token (#18851) · d4dbd7ca
  Joao Gante authored Sep 05, 2022
  
  d4dbd7ca
- Correct naming pegasus x (#18896) · badb9d2a
  Patrick von Platen authored Sep 05, 2022
```
* add first generation tutorial

* [Pegasus X] correct naming

* [Generation] Remove
```
  badb9d2a
02 Sep, 2022 4 commits

PEGASUS-X (#18551) · 53e33e6f

Jason Phang authored Sep 02, 2022

* PegasusX Initial commit

* rename

* pegasus X implementation

* pegx update

* pegx fix

* pegasus-x fixes

* pegx updates

* cleanup

* cleanup

* cleanup

* tests

* stylefixes

* Documentation update

* Model hub fix

* cleanup

* update

* update

* testfix

* Check fix

* tweaks for merging

* style

* style

* updates for pr

* style

* change pegasus-x repo

53e33e6f

Generate: validate `model_kwargs` on TF (and catch typos in generate arguments) (#18651) · 9196f48b
Joao Gante authored Sep 02, 2022

9196f48b
Clean up utils.hub using the latest from hf_hub (#18857) · 38c3cd52
Sylvain Gugger authored Sep 02, 2022
```
* Clean up utils.hub using the latest from hf_hub

* Adapt test

* Address review comment

* Fix test
```
38c3cd52
Fix naming issue with ImageToText pipeline (#18864) · 129d7329
OlivierDehaene authored Sep 02, 2022
```
Co-authored-by: Olivier Dehaene <olivier@huggingface.co>
```
129d7329

01 Sep, 2022 3 commits

Add Image To Text Generation pipeline (#18821) · ddb69e5a

OlivierDehaene authored Sep 01, 2022



* Add Image2TextGenerationPipeline to supported pipelines

* Add Flax and Tensorflow support

* Add Flax and Tensorflow small tests

* Add default model for Tensorflow

* Add docstring

* Fix doc style

* Add tiny models for pytorch and flax

* Remove flax from pipeline.
Fix tests

* Use ydshieh/vit-gpt2-coco-en as a default for both PyTorch and Tensorflow

* Fix Tensorflow support
Co-authored-by: Olivier Dehaene <olivier@huggingface.co>

ddb69e5a

TensorFlow MobileViT (#18555) · 954e18ab

Sayak Paul authored Sep 01, 2022



* initial implementation.

* add: working model till image classification.

* add: initial implementation that passes intg tests.
Co-authored-by: Amy <aeroberts4444@gmail.com>

* chore: formatting.

* add: tests (still breaking because of config mismatch).
Coo-authored-by: Yih <2521628+ydshieh@users.noreply.github.com>

* add: corrected tests and remaning changes.

* fix code style and repo consistency.

* address PR comments.

* address Amy's comments.

* chore: remove from_pt argument.

* chore: add full-stop.

* fix: TFLite model conversion in the doc.

* Update src/transformers/models/mobilevit/modeling_tf_mobilevit.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/mobilevit/modeling_tf_mobilevit.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/mobilevit/modeling_tf_mobilevit.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/mobilevit/modeling_tf_mobilevit.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/mobilevit/modeling_tf_mobilevit.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* apply formatting.

* chore: remove comments from the example block.

* remove identation in the example.
Co-authored-by: Amy <aeroberts4444@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

954e18ab

Generate: smaller TF serving test (#18840) · 6e016634
Joao Gante authored Sep 01, 2022

6e016634

31 Aug, 2022 4 commits

Add SegFormer ONNX support (#18006) · 7e7f7434

NielsRogge authored Aug 31, 2022



* Add ONNX support

* Make height and width dynamic axes
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

7e7f7434

Add an option to `HfArgumentParser.parse_{dict,json_file}` to raise an... · 86387fe8

Felix Schneider authored Aug 31, 2022

Add an option to `HfArgumentParser.parse_{dict,json_file}` to raise an Exception when there extra keys (#18692)

* Update parser to track unneeded keys, off by default

* Fix formatting

* Fix docstrings and defaults in HfArgparser

* Fix formatting

86387fe8

[DETR] Add num_channels attribute (#18714) · 3b6943e7

NielsRogge authored Aug 31, 2022



* Add num_channels attribute

* Fix code quality
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

3b6943e7

Add LayoutLMForQuestionAnswering model (#18407) · 5c4c8690

Ankur Goyal authored Aug 31, 2022



* Add LayoutLMForQuestionAnswering model

* Fix output

* Remove TF TODOs

* Add test cases

* Add docs

* TF implementation

* Fix PT/TF equivalence

* Fix loss

* make fixup

* Fix up documentation code examples

* Fix up documentation examples + test them

* Remove LayoutLMForQuestionAnswering from the auto mapping

* Docstrings

* Add better docstrings

* Undo whitespace changes

* Update tokenizers in comments

* Fixup code and remove `from_pt=True`

* Fix tests

* Revert some unexpected docstring changes

* Fix tests by overriding _prepare_for_class
Co-authored-by: Ankur Goyal <ankur@impira.com>

5c4c8690

30 Aug, 2022 5 commits
- LayoutXLMProcessor: ensure 1-to-1 mapping between samples and images, and add test for it (#18774) · a98f6a1d
  anthony2261 authored Aug 30, 2022
  
  a98f6a1d
- Adds GroupViT to models exportable with ONNX (#18628) · 220da3b8
  Dhruv Karan authored Aug 30, 2022
```
* groupvit to onnx

* dynamic shape for pixel values dim
```
  220da3b8
- Adds OWLViT to models exportable with ONNX (#18588) · 46d0e26a
  Dhruv Karan authored Aug 30, 2022
```
* onnx conversion for owlvit

* .T to .t()

* dynamic shapes for pixel values
```
  46d0e26a
- Run tests if skip condition not met (#18764) · ef91a2d1
  amyeroberts authored Aug 30, 2022
```
* Run tests if skip condition not met

* Update comment - remove outdated ref to TF 2.8
```
  ef91a2d1
- [LayoutLMv3] Add TensorFlow implementation (#18678) · de8548eb
  Christoffer Koo Øhrstrøm authored Aug 30, 2022
```
Co-authored-by: Esben Toke Christensen <esben.christensen@visma.com>
Co-authored-by: Lasse Reedtz <lasse.reedtz@visma.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
```
  de8548eb
29 Aug, 2022 3 commits
- send model to the correct device (#18800) · da5bb292
  Yih-Dar authored Aug 29, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  da5bb292
- Fix mock in `test_cached_files_are_used_when_internet_is_down` (#18804) · 169b8cde
  Lucain authored Aug 29, 2022
  
  169b8cde
- Fix memory leak issue in `torch_fx` tests (#18547) · 8b67f209
  Yih-Dar authored Aug 29, 2022
```
Co-authored-by: Lysandre Debut <hi@lysand.re>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  8b67f209
26 Aug, 2022 2 commits

[Wav2vec2 + LM Test] Improve wav2vec2 with lm tests and make torch version... · 62ceb4d6

Patrick von Platen authored Aug 26, 2022

[Wav2vec2 + LM Test] Improve wav2vec2 with lm tests and make torch version dependent for now (#18749)

* add first generation tutorial

* remove generation

* make version dependent expected values

* Apply suggestions from code review

* Update tests/models/wav2vec2_with_lm/test_processor_wav2vec2_with_lm.py

* fix typo

62ceb4d6

[VisionEncoderDecoder] Add gradient checkpointing (#18697) · 8869bf41
Patrick von Platen authored Aug 26, 2022
```
* add first generation tutorial

* VisionEnocderDecoder gradient checkpointing

* remove generation

* add tests
```
8869bf41

25 Aug, 2022 2 commits

Determine framework automatically before ONNX export (#18615) · fbf382c8

Craig Chan authored Aug 25, 2022



* Automatic detection for framework to use when exporting to ONNX

* Log message change

* Incorporating PR comments, adding unit test

* Adding tf for pip install for run_tests_onnxruntime CI

* Restoring past changes to circleci yaml and test_onnx_v2.py, tests moved to tests/onnx/test_features.py

* Fixup

* Adding test to fetcher

* Updating circleci config to log more

* Changing test class name

* Comment typo fix in tests/onnx/test_features.py
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

* Moving torch_str/tf_str to self.framework_pt/tf

* Remove -rA flag in circleci config
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

fbf382c8

Add ONNX support for Longformer (#17176) · 3223d493

Patrick Deutschmann authored Aug 25, 2022

* Implement ONNX support for Longformer

Fix repo consistency check complaints

Fix value mismatches

Add pooler output for default model

Increase validation atol to accommodate multiple-choice error

Fix copies

Fix chunking for longer sequence lengths

Add future comment

* Fix issue in mask_invalid_locations

* Remove torch imports in configuration_longformer

* Change config access to fix LED

* Push opset version to support tril

* Work in review comments (mostly style)

* Add Longformer to ONNX tests

3223d493

24 Aug, 2022 2 commits

add warning to let the user know that the `__call__` method is faster than... · 6667b0d7

SaulLu authored Aug 24, 2022

add warning to let the user know that the `__call__` method is faster than `encode` + `pad` for a fast tokenizer (#18693)

* add warning to let the user know that the  method is slower that  for a fast tokenizer

* user warnings

* fix layoutlmv2

* fix layout*

* change warnings into logger.warning

6667b0d7

Add TF implementation of `XGLMModel` (#16543) · c72d7d91

Daniel Stancl authored Aug 24, 2022



* Add TFXGLM models 

* Add todo: self.supports_xla_generation = False
Co-authored-by: Daniel Stancl <stancld@Daniels-MacBook-Pro.local>
Co-authored-by: Daniel Stancl <stancld@daniels-mbp.home>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: Daniel <daniel.stancl@rossum.ai>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

c72d7d91

22 Aug, 2022 1 commit
- Add missing tokenizer tests - Longformer (#17677) · 0f257a87
  tgadeliya authored Aug 22, 2022
  
  0f257a87
19 Aug, 2022 1 commit
- Generate: add missing `**model_kwargs` in sample tests (#18696) · e95d433d
  Joao Gante authored Aug 19, 2022
  
  e95d433d
18 Aug, 2022 2 commits
- Rename second input dimension from "sequence" to "num_channels" for CV models (#17976) · 76454b08
  regisss authored Aug 18, 2022
  
  76454b08
- Generate: validate model_kwargs on FLAX (and catch typos in generate arguments) (#18653) · a541d974
  Joao Gante authored Aug 18, 2022
  
  a541d974
17 Aug, 2022 2 commits

Update feature extractor methods to enable type cast before normalize (#18499) · 49e44b21

amyeroberts authored Aug 17, 2022

* Update methods to optionally rescale
This is necessary to allow for casting our images / videos to numpy arrays within the feature extractors' call. We want to do this to make sure the behaviour is as expected when flags like  are False. If some transformations aren't applied, then the output type can't be unexpected e.g. a list of PIL images instead of numpy arrays.

* Cast images to numpy arrays in call to enable consistent behaviour with different configs

* Remove accidental clip changes

* Update tests to reflect the scaling logic
We write a generic  function to handle rescaling of our arrays. In order for the API to be intuitive, we take some factor c and rescale the image values by that. This means, the rescaling done in normalize and to_numpy_array are now done with array * (1/255) instead of array / 255. This leads to small differences in the resulting image. When testing, this was in the order of 1e-8, and so deemed OK

49e44b21

Fix Yolos ONNX export test (#18606) · c99e9846

Yih-Dar authored Aug 17, 2022


Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

c99e9846