Commits · 544fd9876b3cea64b83e7eeb8e57501cc464b764 · chenpangpang / transformers

07 Mar, 2022 3 commits
- Support modern list type hints in HfArgumentParser (#15951) · 544fd987
  Konstantin Dobler authored Mar 07, 2022
```
* Support modern list type hint in HfArgumentParser

* Fix formatting with black
```
  544fd987
- remove re-defination of FlaxWav2Vec2ForCTCModule (#15965) · 60b81dfa
  Suraj Patil authored Mar 07, 2022
  
  60b81dfa
- [Bug Fix] Beam search example in docs fails & a fix (integrating `max_length`... · ef9c3ca3
  Chan Woo Kim authored Mar 07, 2022
```
[Bug Fix] Beam search example in docs fails & a fix (integrating `max_length` in `BeamScorer.finalize()`) (#15555)

* added the test and fix

* had left out a comment
```
  ef9c3ca3
04 Mar, 2022 12 commits

made MaskFormerModelTest faster (#15942) · 9932ee4b
Francesco Saverio Zuppichini authored Mar 04, 2022

9932ee4b
Move dependency to call method (#15941) · e8efaecb
NielsRogge authored Mar 04, 2022

e8efaecb

Constrained Beam Search [*With* Disjunctive Decoding] (#15761) · 5c6f57ee

Chan Woo Kim authored Mar 05, 2022



* added classes to get started with constrained beam search

* in progress, think i can directly force tokens now but not yet with the round robin

* think now i have total control, now need to code the bank selection

* technically works as desired, need to optimize and fix design choices leading to undersirable outputs

* complete PR #1 without disjunctive decoding

* removed incorrect tests

* Delete k.txt

* Delete test.py

* Delete test.sh

* revert changes to test scripts

* genutils

* full implementation with testing, no disjunctive yet

* shifted docs

* passing all tests realistically ran locally

* removing accidentally included print statements

* fixed source of error in initial PR test

* fixing the get_device() vs device trap

* fixed documentation docstrings about constrained_beam_search

* fixed tests having failing for Speech2TextModel's floating point inputs

* fix cuda long tensor

* added examples and testing for them and founx & fixed a bug in beam_search and constrained_beam_search

* deleted accidentally added test halting code with assert False

* code reformat

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py

* fixing based on comments on PR

* took out the testing code that should but work fails without the beam search moditification ; style changes

* fixing comments issues

* docstrings for ConstraintListState

* typo in PhrsalConstraint docstring

* docstrings improvements

* finished adding what is sort of an opinionated implementation of disjunctive generation, but it revealed errors in inner beam search logic during testing.

* fixed bug found in constrained beam search that used beam_idx that were not global across all the batches

* disjunctive constraint working 100% correctly

* passing all tests

* Accidentally included mlruns

* Update src/transformers/generation_beam_constraints.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/generation_beam_constraints.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* complete overhaul of type complexities and other nits

* strict type checks in generate()

* fixing second round of feedback by narsil

* fixed failing generation test because of type check overhaul

* generation test fail fix

* fixing test fails
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

5c6f57ee

Tests for MaskFormerFeatureExtractor's post_process*** methods (#15929) · 040c11f6

Francesco Saverio Zuppichini authored Mar 04, 2022



* proper tests for post_process*** methods in feature extractor

* mask th == 0

* Update tests/maskformer/test_feature_extraction_maskformer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* make style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

040c11f6

Do not change the output from tuple to list - to match PT's version (#15918) · f0aacc14

Yih-Dar authored Mar 04, 2022



* Do not change the output from tuple to list - to match PT's version

* Fix the same issues for 5 other models and the template
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

f0aacc14

[FlaxT5 Example] fix flax t5 example pretraining (#15835) · 10b76987
Patrick von Platen authored Mar 04, 2022

10b76987

Add missing support for Flax XLM-RoBERTa (#15900) · 01485cee

Javier de la Rosa authored Mar 04, 2022



* Adding Flax XLM-RoBERTa

* Add Flax to __init__

* Adding doc and dummy objects

* Add tests

* Add Flax XLM-R models autodoc

* Fix tests

* Add Flask XLM-RoBERTa to TEST_FILES_WITH_NO_COMMON_TESTS

* Update src/transformers/models/xlm_roberta/modeling_flax_xlm_roberta.py
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Update tests/xlm_roberta/test_modeling_flax_xlm_roberta.py
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Update tests/xlm_roberta/test_modeling_flax_xlm_roberta.py
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Remove test on large Flask XLM-RoBERTa

* Add tokenizer to the test
Co-authored-by: Suraj Patil <surajp815@gmail.com>

01485cee

Making MaskFormerForInstanceSegmentation. (#15934) · 89c7d9cf

Nicolas Patry authored Mar 04, 2022

Small adjustments.

Adding in type hint.

Last fix ?

Only include the default dict thing, not the pipelines.

89c7d9cf

Updating the slow tests: (#15893) · 7ade7c17
Nicolas Patry authored Mar 04, 2022
```
Linked to https://github.com/huggingface/transformers/pull/15826
```
7ade7c17

Support CLIPTokenizerFast for CLIPProcessor (#15913) · 6b104c5b

ParkSangJun authored Mar 04, 2022

* Fix to support fast tokenizer with `CLIPProcessor`

* Update CLIPProcessor test for fast tokenizer

* Fix Docstring Style

* Rename into meaningful Variable name in test code

6b104c5b

Update README.md · b7147489
Sanchit Gandhi authored Mar 04, 2022

b7147489
Re-enabling all fast pipeline tests. (#15924) · a6e3b179
Nicolas Patry authored Mar 04, 2022

a6e3b179

03 Mar, 2022 13 commits
- Update README.md (#15926) · a7df656f
  Patrick von Platen authored Mar 04, 2022
  
  a7df656f
- Fix #15898 (#15928) · c0281feb
  davidleonfdez authored Mar 03, 2022
  
  c0281feb
- Add vision models to doc tests (#15905) · 9251427c
  NielsRogge authored Mar 03, 2022
```
* Add vision models to doc tests

* Apply suggestions from code review

* Add more models
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
```
  9251427c
- fix for the output from post_process_panoptic_segmentation (#15916) · 742273a5
  Francesco Saverio Zuppichini authored Mar 03, 2022
  
  742273a5
- Mark slow tests as slow · 7c45fe74
  Sylvain Gugger authored Mar 03, 2022
  
  7c45fe74
- Enabling MaskFormer in pipelines (#15917) · 3822e4a5
  Nicolas Patry authored Mar 03, 2022
```
* Enabling MaskFormer in ppipelines

No AutoModel though :(

* Ooops local file.
```
  3822e4a5
- v4.18.0.dev.0 · 79d28e80
  Sylvain Gugger authored Mar 03, 2022
  
  79d28e80
- [Doctests] Fix ignore bug and add more doc tests (#15911) · 6cbfa7bf
  Patrick von Platen authored Mar 03, 2022
```
* finish speech doc tests

* finish

* boom

* Update src/transformers/models/speech_to_text/modeling_speech_to_text.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  6cbfa7bf
- The tests were not updated after the addition of `torch.diag` (#15890) · b693cbf9
  Nicolas Patry authored Mar 03, 2022
```
in the scoring (which is more correct)
```
  b693cbf9
- Freeze FlaxWav2Vec2 Feature Encoder (#15873) · 3c4fbc61
  Sanchit Gandhi authored Mar 03, 2022
```
* Freeze FlaxWav2Vec2 Feature Encoder

* add to all module apply

* add backprop test
```
  3c4fbc61
- Fix and improve REALM fine-tuning (#15297) · 7b3bd1f2
  Li-Huai (Allan) Lin authored Mar 03, 2022
```
* Draft

* Add test

* Update src/transformers/models/realm/modeling_realm.py

* Apply suggestion

* Add block_mask

* Update

* Update

* Add block_embedding_to

* Remove no_grad

* Use AutoTokenizer

* Remove model.to overridding
```
  7b3bd1f2
- [Fix link in pipeline doc] (#15906) · 439de3f7
  Patrick von Platen authored Mar 03, 2022
  
  439de3f7
- Fix a TF Vision Encoder Decoder test (#15896) · 4cd7ed4b
  Yih-Dar authored Mar 03, 2022
```
* send PyTorch inputs to the correct device

* Fix: TypeError: can't convert cuda:0 device type tensor to numpy. Use Tensor.cpu() to copy the tensor to host memory first.
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  4cd7ed4b
02 Mar, 2022 12 commits

Fix doc links in release utils (#15903) · 39249c95
Sylvain Gugger authored Mar 02, 2022

39249c95

Update delete-dev-doc job to match build-dev-doc (#15891) · 3d224286

Sylvain Gugger authored Mar 02, 2022

* Update delete-dev-doc job to match build-dev-doc

* More debug info

* More debug info

* Stash if needed

* Remove the comment update

* Fix paths

* Wtf is going on..

* Fix git status test

* Try another way

* I don't understand what's happening

* Bash shell

* What's happening now...

* What's happening now...

* Try like this

* Back to trying to use bash

* And like that?

* Refine tests

* Stash after adding new files

* Stash after adding new files

* Proper commit sha and PR number

* Address review comments

3d224286

Fix SegformerForImageClassification (#15895) · 89be34c3

NielsRogge authored Mar 02, 2022



* Fix reshape

* Apply suggestion from code review
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

89be34c3

[XGLM] run sampling test on CPU to be deterministic (#15892) · 130b9878
Suraj Patil authored Mar 02, 2022
```
* run sampling test on CPU to be deterministic

* input_ids on CPU
```
130b9878

TF generate refactor - Sample (#15793) · baab5e7c

Joao Gante authored Mar 02, 2022



* Add TF logits wrappers 

* Add sample method

* add tests for TF logit wrappers

* TF generate sample tests now run on CPU
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

baab5e7c

[SegFormer] Add deprecation warning (#15889) · 96ae92be

NielsRogge authored Mar 02, 2022



* Add deprecation warning

* Remove from docs and hide in kwargs

* Improve implementation
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

96ae92be

Fix Bug in FlaxWav2Vec2 Slow Test (#15887) · 8fd47310
Sanchit Gandhi authored Mar 02, 2022

8fd47310

Maskformer (#15682) · d83d22f5

Francesco Saverio Zuppichini authored Mar 02, 2022

* maskformer

* conflicts

* minor fixes

* feature extractor test fix

refactor MaskFormerLoss following conversation

MaskFormer related types should not trigger a module time import error

missed one

removed all the types that are not used

update config mapping

minor updates in the doc

resolved conversation that doesn't need a discussion

minor changes

resolved conversations

fixed DetrDecoder

* minor changes

minor changes

fixed mdx file

test feature_extractor return types

functional losses -> classes

removed the return type test for the feature extractor

minor changes + style + quality

* conflicts?

* rebase master

* readme

* added missing files

* deleded poolformers test that where in the wrong palce

* CI

* minor changes

* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* resolved conversations

* minor changes

* conversations

[Unispeech] Fix slow tests (#15818)

* remove soundfile old way of loading audio

* Adapt slow test

[Barthez Tokenizer] Fix saving (#15815)

[TFXLNet] Correct tf xlnet generate (#15822)

* [TFXLNet] Correct tf xlnet

* adapt test comment

Fix the push run (#15807)

Fix semantic segmentation pipeline test (#15826)

Fix dummy_inputs() to dummy_inputs in symbolic_trace doc (#15776)

Add model specific output classes to PoolFormer model docs (#15746)

* Added model specific output classes to poolformer docs

* Fixed Segformer typo in Poolformer docs

Adding the option to return_timestamps on pure CTC ASR models. (#15792)

* Adding the option to return_timestamps on pure CTC ASR models.

* Remove `math.prod` which was introduced in Python 3.8

* int are not floats.

* Reworking the PR to support "char" vs "word" output.

* Fixup!

* Update src/transformers/pipelines/automatic_speech_recognition.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Quality.
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

HFTracer.trace should use/return self.graph to be compatible with torch.fx.Tracer (#15824)

Fix tf.concatenate + test past_key_values for TF models (#15774)

* fix wrong method name tf.concatenate

* add tests related to causal LM / decoder

* make style and quality

* clean-up

* Fix TFBertModel's extended_attention_mask when past_key_values is provided

* Fix tests

* fix copies

* More tf.int8 -> tf.int32 in TF test template

* clean-up

* Update TF test template

* revert the previous commit + update the TF test template

* Fix TF template extended_attention_mask when past_key_values is provided

* Fix some styles manually

* clean-up

* Fix ValueError: too many values to unpack in the test

* Fix more: too many values to unpack in the test

* Add a comment for extended_attention_mask when there is past_key_values

* Fix TFElectra extended_attention_mask when past_key_values is provided

* Add tests to other TF models

* Fix for TF Electra test: add prepare_config_and_inputs_for_decoder

* Fix not passing training arg to lm_head in TFRobertaForCausalLM

* Fix tests (with past) for TF Roberta

* add testing for pask_key_values for TFElectra model
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

[examples/summarization and translation] fix readme (#15833)

Add ONNX Runtime quantization for text classification notebook (#15817)

Re-enable doctests for the quicktour (#15828)

* Re-enable doctests for the quicktour

* Re-enable doctests for task_summary (#15830)

* Remove &

Framework split model report (#15825)

Add TFConvNextModel (#15750)

* feat: initial implementation of convnext in tensorflow.

* fix: sample code for the classification model.

* chore: added checked for from the classification model.

* chore: set bias initializer in the classification head.

* chore: updated license terms.

* chore: removed ununsed imports

* feat: enabled argument during using drop_path.

* chore: replaced tf.identity with layers.Activation(linear).

* chore: edited default checkpoint.

* fix: minor bugs in the initializations.

* partial-fix: tf model errors for loading pretrained pt weights.

* partial-fix: call method updated

* partial-fix: cross loading of weights (4x3 variables to be matched)

* chore: removed unneeded comment.

* removed playground.py

* rebasing

* rebasing and removing playground.py.

* fix: renaming TFConvNextStage conv and layer norm layers

* chore: added initializers and other minor additions.

* add: tests for convnext.

* fix: integration tester class.

* fix: issues mentioned in pr feedback (round 1).

* fix: how output_hidden_states arg is propoagated inside the network.

* feat: handling of arg for pure cnn models.

* chore: added a note on equal contribution in model docs.

* rebasing

* rebasing and removing playground.py.

* feat: encapsulation for the convnext trunk.

* Fix variable naming; Test-related corrections; Run make fixup

* chore: added Joao as a contributor to convnext.

* rebasing

* rebasing and removing playground.py.

* rebasing

* rebasing and removing playground.py.

* chore: corrected copyright year and added comment on NHWC.

* chore: fixed the black version and ran formatting.

* chore: ran make style.

* chore: removed from_pt argument from test, ran make style.

* rebasing

* rebasing and removing playground.py.

* rebasing

* rebasing and removing playground.py.

* fix: tests in the convnext subclass, ran make style.

* rebasing

* rebasing and removing playground.py.

* rebasing

* rebasing and removing playground.py.

* chore: moved convnext test to the correct location

* fix: locations for the test file of convnext.

* fix: convnext tests.

* chore: applied sgugger's suggestion for dealing w/ output_attentions.

* chore: added comments.

* chore: applied updated quality enviornment style.

* chore: applied formatting with quality enviornment.

* chore: revert to the previous tests/test_modeling_common.py.

* chore: revert to the original test_modeling_common.py

* chore: revert to previous states for test_modeling_tf_common.py and modeling_tf_utils.py

* fix: tests for convnext.

* chore: removed output_attentions argument from convnext config.

* chore: revert to the earlier tf utils.

* fix: output shapes of the hidden states

* chore: removed unnecessary comment

* chore: reverting to the right test_modeling_tf_common.py.

* Styling nits
Co-authored-by: ariG23498 <aritra.born2fly@gmail.com>
Co-authored-by: Joao Gante <joao@huggingface.co>
Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com>

* minor changes

* doc fix in feature extractor

* doc

* typose

* removed detr logic from config

* removed num_labels

* small fix in the config

* auxilary -> auxiliary

* make style

* some test is failing

* fix a weird char in config prevending doc-builder

* retry to fix the doc-builder issue

* make style

* new try to fix the doc builder

* CI

* change weights to facebook
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: ariG23498 <aritra.born2fly@gmail.com>
Co-authored-by: Joao Gante <joao@huggingface.co>
Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com>

d83d22f5

Fix tiny typo (#15884) · e535c389
Ross Johnstone authored Mar 02, 2022

e535c389

Updates in Trainer to support new features in SM Model Parallel library (#15877) · 2eb7bb15

Rahul Huilgol authored Mar 02, 2022



* Create optimizer after model creation for SMP

* update dp_rank to rdp_rank for opt_state_dict

* update world_size and process_index for smp

* Address comments

* Lint fix
Co-authored-by: Cavdar <dcavdar@a07817b12d7e.ant.amazon.com>

2eb7bb15

Update TF QA example (#15870) · 05c237ea
Joao Gante authored Mar 02, 2022

05c237ea
Adding timestamps for CTC with LM in ASR pipeline. (#15863) · 6e57a569
Nicolas Patry authored Mar 02, 2022
```
* Adding timestamps for CTC with LM in ASR pipeline.

* iRemove print.

* Nit change.
```
6e57a569