Commits · dcff504e1806467965e2ac1f1e3864cddabaf31f · chenpangpang / transformers

24 Aug, 2022 3 commits

fixed docstring typos (#18739) · dcff504e

Juyoung Kim authored Aug 24, 2022



* fixed docstring typos

* Added missing colon
Co-authored-by: 김주영 <juyoung@zezedu.com>

dcff504e

Add TF implementation of `XGLMModel` (#16543) · c72d7d91

Daniel Stancl authored Aug 24, 2022



* Add TFXGLM models 

* Add todo: self.supports_xla_generation = False
Co-authored-by: Daniel Stancl <stancld@Daniels-MacBook-Pro.local>
Co-authored-by: Daniel Stancl <stancld@daniels-mbp.home>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: Daniel <daniel.stancl@rossum.ai>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

c72d7d91

Add minor doc-string change to include hp_name param in hyperparameter_search (#18700) · a442884b

Constantin Hütterer authored Aug 24, 2022

* Add minor doc-string change to include hp_name

* fix: missing type-information for kwargs

* fix: missing white-space in hyperparameter_search doc-strings

a442884b

23 Aug, 2022 4 commits
- CLI: Don't check the model head when there is no model head (#18733) · 6faf2832
  Joao Gante authored Aug 23, 2022
  
  6faf2832
- improve `add_tokens` docstring (#18687) · 43869808
  SaulLu authored Aug 23, 2022
```
* improve add_tokens documentation

* format
```
  43869808
- Removing warning of model type for `microsoft/tapex-base-finetuned-wtq` (#18711) · 891704b3
  Nicolas Patry authored Aug 23, 2022
```
and friends.
```
  891704b3
- Unpin detectron2 (#18727) · 84beb8a4
  Yih-Dar authored Aug 23, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  84beb8a4
22 Aug, 2022 1 commit

Fix Data2VecVision ONNX test (#18587) · 3fa45dbd

Yih-Dar authored Aug 22, 2022


Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

3fa45dbd

19 Aug, 2022 1 commit

Temp fix for broken detectron2 import (#18699) · 1f3c2282

Patrick von Platen authored Aug 19, 2022

* add first generation tutorial

* [Circle CI] Temporary fix for broken detectron2 import

* remove generation

1f3c2282

18 Aug, 2022 7 commits
- Fix breaking change in `onnxruntime` for ONNX quantization (#18336) · d243112b
  Severin Simmler authored Aug 18, 2022
```
* Fix quantization

* Save model

* Remove unused comments

* Fix formatting
```
  d243112b
- Fix repo consistency (#18682) · 5987c637
  lewtun authored Aug 18, 2022
  
  5987c637
- Rename second input dimension from "sequence" to "num_channels" for CV models (#17976) · 76454b08
  regisss authored Aug 18, 2022
  
  76454b08
- Rename method to avoid clash with property (#18677) · 780253ce
  amyeroberts authored Aug 18, 2022
  
  780253ce
- Generate: validate model_kwargs on FLAX (and catch typos in generate arguments) (#18653) · a541d974
  Joao Gante authored Aug 18, 2022
  
  a541d974
- [LongT5] Correct docs long t5 (#18669) · 0ea53822
  Patrick von Platen authored Aug 18, 2022
```
* add first generation tutorial

* [LongT5 Docs] Correct docs

* correct expected string

* remove incorrect file
```
  0ea53822
- Allow users to force TF availability (#18650) · 582c5371
  Matt authored Aug 18, 2022
```
* Allow users to force TF availability

* Correctly name the envvar!
```
  582c5371
17 Aug, 2022 2 commits

Update feature extractor methods to enable type cast before normalize (#18499) · 49e44b21

amyeroberts authored Aug 17, 2022

* Update methods to optionally rescale
This is necessary to allow for casting our images / videos to numpy arrays within the feature extractors' call. We want to do this to make sure the behaviour is as expected when flags like  are False. If some transformations aren't applied, then the output type can't be unexpected e.g. a list of PIL images instead of numpy arrays.

* Cast images to numpy arrays in call to enable consistent behaviour with different configs

* Remove accidental clip changes

* Update tests to reflect the scaling logic
We write a generic  function to handle rescaling of our arrays. In order for the API to be intuitive, we take some factor c and rescale the image values by that. This means, the rescaling done in normalize and to_numpy_array are now done with array * (1/255) instead of array / 255. This leads to small differences in the resulting image. When testing, this was in the order of 1e-8, and so deemed OK

49e44b21

Fix matmul inputs dtype (#18585) · 86d0b26d
Jingya HUANG authored Aug 17, 2022

86d0b26d

16 Aug, 2022 2 commits

TF: Fix generation repetition penalty with XLA (#18648) · fd9aa82b
Joao Gante authored Aug 16, 2022

fd9aa82b

mac m1 `mps` integration (#18598) · 9cf27468

Sourab Mangrulkar authored Aug 16, 2022



* mac m1 `mps` integration

* Update docs/source/en/main_classes/trainer.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* addressing comments

* Apply suggestions from code review
Co-authored-by: Dan Saattrup Nielsen <47701536+saattrupdan@users.noreply.github.com>

* resolve comment
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Dan Saattrup Nielsen <47701536+saattrupdan@users.noreply.github.com>

9cf27468

14 Aug, 2022 1 commit

Flax Remat for LongT5 (#17994) · d6eeb871

Karim Foda authored Aug 14, 2022



* [Flax] Add remat (gradient checkpointing)

* fix variable naming in test

* flip: checkpoint using a method

* fix naming

* fix class naming

* apply PVP's suggestions from code review

* add gradient_checkpointing to examples

* Add gradient_checkpointing to run_mlm_flax

* Add remat to longt5

* Add gradient checkpointing test longt5

* Fix args errors

* Fix remaining tests

* Make fixup & quality fixes

* replace kwargs

* remove unecessary kwargs

* Make fixup changes

* revert long_t5_flax changes

* Remove return_dict and copy to LongT5

* Remove test_gradient_checkpointing
Co-authored-by: sanchit-gandhi <sanchit@huggingface.co>

d6eeb871

12 Aug, 2022 10 commits

[fsmt] deal with -100 indices in decoder ids (#18592) · b3ff7c68

Stas Bekman authored Aug 12, 2022

* [fsmt] deal with -100 indices in decoder ids

Fixes: https://github.com/huggingface/transformers/issues/17945

decoder ids get the default index -100, which breaks the model - like t5 and many other models add a fix to replace -100 with the correct pad index. 

For some reason this use case hasn't been used with this model until recently - so this issue was there since the beginning it seems.

Any suggestions to how to add a simple test here? or perhaps we have something similar already? user's script is quite massive.

* style

b3ff7c68

Update BLOOM parameter counts (#18531) · 56ef0ba4
Niklas Muennighoff authored Aug 12, 2022
```
* Update BLOOM parameter counts

* Update BLOOM parameter counts
```
56ef0ba4

Add Donut (#18488) · 2ab790e8

NielsRogge authored Aug 12, 2022



* First draft

* Improve script

* Update script

* Make conversion work

* Add final_layer_norm attribute to Swin's config

* Add DonutProcessor

* Convert more models

* Improve feature extractor and convert base models

* Fix bug

* Improve integration tests

* Improve integration tests and add model to README

* Add doc test

* Add feature extractor to docs

* Fix integration tests

* Remove register_buffer

* Fix toctree and add missing attribute

* Add DonutSwin

* Make conversion script work

* Improve conversion script

* Address comment

* Fix bug

* Fix another bug

* Remove deprecated method from docs

* Make Swin and Swinv2 untouched

* Fix code examples

* Fix processor

* Update model_type to donut-swin

* Add feature extractor tests, add token2json method, improve feature extractor

* Fix failing tests, remove integration test

* Add do_thumbnail for consistency

* Improve code examples

* Add code example for document parsing

* Add DonutSwin to MODEL_NAMES_MAPPING

* Add model to appropriate place in toctree

* Update namespace to appropriate organization
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

2ab790e8

Supporting seq2seq models for `bitsandbytes` integration (#18579) · a5ca56ff

Younes Belkada authored Aug 12, 2022

* Supporting seq2seq models for `bitsandbytes` integration

- `bitsandbytes` integration supports now seq2seq models
- check if a model has tied weights as an additional check

* small modification

- tie the weights before looking at tied weights!

a5ca56ff

Generate: validate `model_kwargs` (and catch typos in generate arguments) (#18261) · ed1924e8
Joao Gante authored Aug 12, 2022
```
* validate generate model_kwargs

* generate tests -- not all models have an attn mask
```
ed1924e8
Add `TFAutoModelForSemanticSegmentation` to the main `__init__.py` (#18600) · 2156619f
Yih-Dar authored Aug 12, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
2156619f
FSDP bug fix for `load_state_dict` (#18596) · 4eed2bec
Sourab Mangrulkar authored Aug 12, 2022

4eed2bec
typos (#18594) · d344534b
Stas Bekman authored Aug 12, 2022

d344534b
Add type hints for ViLT models (#18577) · 46d09410
Ian Castillo authored Aug 12, 2022
```
* Add type hints for Vilt models

* Add missing return type for TokenClassification class
```
46d09410

Load sharded pt to flax (#18419) · bce36ee0

Arthur authored Aug 12, 2022



* initial commit

* add small test

* add cross pt tf flag to test

* fix quality

* style

* update test with new repo

* fix failing test

* update

* fix wrong param ordering

* style

* update based on review

* update related to recent new caching mechanism

* quality

* Update based on review
Co-authored-by: sgugger <sylvain.gugger@gmail.com>

* quality and style

* Update src/transformers/modeling_flax_utils.py
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

bce36ee0

11 Aug, 2022 9 commits

Return the permuted hidden states if return_dict=True (#18578) · c8b6ae85
amyeroberts authored Aug 11, 2022

c8b6ae85
fix owlvit tests, update docstring examples (#18586) · f28f2408
Alara Dirik authored Aug 11, 2022

f28f2408
Fix docstrings with last version of hf-doc-builder styler (#18581) · c23cbdff
Sylvain Gugger authored Aug 11, 2022
```
* Fix docstrings with last version of hf-doc-builder styler

* Remove empty Parameter block
```
c23cbdff

[FX] _generate_dummy_input supports audio-classification models for labels (#18580) · 42b8940b

Michael Benayoun authored Aug 11, 2022

* Support audio classification architectures for labels generation, as well as provides a flag to print warnings or not

* Use ENV_VARS_TRUE_VALUES

42b8940b

Deberta V2: Fix critical trace warnings to allow ONNX export (#18272) · d53dffec

iiLaurens authored Aug 11, 2022



* Fix critical trace warnings to allow ONNX export

* Force input to `sqrt` to be float type

* Cleanup code

* Remove unused import statement

* Update model sew

* Small refactor
Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com>

* Use broadcasting instead of repeat

* Implement suggestion
Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com>

* Match deberta v2 changes in sew_d

* Improve code quality

* Update code quality

* Consistency of small refactor

* Match changes in sew_d
Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com>

d53dffec

Change BartLearnedPositionalEmbedding's forward method signature to support... · 80468251

Dan Jones authored Aug 11, 2022


Change BartLearnedPositionalEmbedding's forward method signature to support Opacus training (#18486)

* changing BartLearnedPositionalEmbedding forward signature and references to it

* removing debugging dead code (thanks style checker)

* blackened modeling_bart file

* removing copy inconsistencies via make fix-copies

* changing references to copied signatures in Bart variants

* make fix-copies once more

* using expand over repeat (thanks @michaelbenayoun)

* expand instead of repeat for all model copies
Co-authored-by: Daniel Jones <jonesdaniel@microsoft.com>

80468251

Fix LayoutLMv3 documentation (#17932) · 4c8ec66a

Wonseok Lee (Jack) authored Aug 11, 2022

* fix typos

* fix sequence_length docs of LayoutLMv3Model

* delete trailing white spaces

* fix layoutlmv3 docs more

* apply make fixup & quality

* change to two versions of input docstring

* apply make fixup & quality

4c8ec66a

Fix resizing bug in OWL-ViT (#18573) · f762f373

Alara Dirik authored Aug 11, 2022

* Fixes resizing bug in OWL-ViT
* Defaults to square resize if size is set to an int
* Sets do_center_crop default value to False

f762f373

Segformer TF: fix output size in documentation (#18572) · 76568d24

Maxime G authored Aug 11, 2022



* Segformer TF: fix output size in doc

* Segformer pytorch: fix output size in doc
Co-authored-by: Maxime Gardoni <maxime.gardoni@ecorobotix.com>

76568d24