Commits · fc21c9be62483d06adae6239ebe6ca77c2cb6269 · chenpangpang / transformers

14 Sep, 2022 6 commits

[CookieCutter] Clarify questions (#18959) · fc21c9be

NielsRogge authored Sep 14, 2022



* Clarify cookiecutter questions

* Update first question
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

fc21c9be

Make AutoProcessor a magic loading class for all modalities (#18963) · 6f8f2f6a
Sylvain Gugger authored Sep 14, 2022
```
* Make AutoProcessor a magic loading class for all modalities

* Quality
```
6f8f2f6a
PyTorch >= 1.7.0 and TensorFlow >= 2.4.0 (#19016) · a2a3afbc
Sylvain Gugger authored Sep 14, 2022

a2a3afbc
Generate: add missing comments after refactoring of generate() (#18981) · 9f4acd05
Ekagra Ranjan authored Sep 14, 2022

9f4acd05

Add Deformable DETR (#17281) · 59407bbe

NielsRogge authored Sep 14, 2022



* First draft

* More improvements

* Improve model, add custom CUDA code

* Import torch before

* Add script that imports custom layer

* Add everything in new ops directory

* Import custom layer in modeling file

* Fix ARCHIVE_MAP typo

* Creating the custom kernel on the fly.

* Import custom layer in modeling file

* More improvements

* Fix CUDA loading

* More improvements

* Improve conversion script

* Improve conversion script

* Make it work until encoder_outputs

* Make forward pass work

* More improvements

* Make logits match original implementation

* Make implementation also support single_scale model

* Add support for single_scale and dilation checkpoint

* Add support for with_box_refine model

* Support also two stage model

* Improve tests

* Fix more tests

* Make more tests pass

* Upload all models to the hub

* Clean up some code

* Improve decoder outputs

* Rename intermediate hidden states and reference points

* Improve model outputs

* Move tests to dedicated folder

* Improve model outputs

* Fix retain_grad test

* Improve docs

* Clean up and make test_initialization pass

* Improve variable names

* Add copied from statements

* Improve docs

* Fix style

* Improve docs

* Improve docs, move tests to model folder

* Fix rebase

* Remove DetrForSegmentation from auto mapping

* Apply suggestions from code review

* Improve variable names and docstrings

* Apply some more suggestions from code review

* Apply suggestion from code review

* better docs and variables names

* hint to num_queries and two_stage confusion

* remove asserts and code refactor

* add exception if two_stage is True and with_box_refine is False

* use f-strings

* Improve docs and variable names

* Fix code quality

* Fix rebase

* Add require_torch_gpu decorator

* Add pip install ninja to CI jobs

* Apply suggestion of @sgugger

* Remove DeformableDetrForObjectDetection from auto mapping

* Remove DeformableDetrModel from auto mapping

* Add model to toctree

* Add model back to mappings, skip model in pipeline tests

* Apply @sgugger's suggestion

* Fix imports in the init

* Fix copies

* Add CPU implementation

* Comment out GPU function

* Undo previous change

* Apply more suggestions

* Remove require_torch_gpu annotator

* Fix quality

* Add logger.info

* Fix logger

* Fix variable names

* Fix initializaztion

* Add missing initialization

* Update checkpoint name

* Add model to doc tests

* Add CPU/GPU equivalence test

* Add Deformable DETR to pipeline tests

* Skip model for object detection pipeline
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
Co-authored-by: Nouamane Tazi <nouamane98@gmail.com>
Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com>

59407bbe

Add Support to Gradient Checkpointing for LongT5 (#18977) · 5a70a77b

Ahmed Elnaggar authored Sep 14, 2022

FlaxLongT5PreTrainedModel is missing "enable_gradient_checkpointing" function. This gives an error if someone tries to enable gradient checkpointing for longt5.
This pull request fixes it.

5a70a77b

13 Sep, 2022 10 commits
- new length penalty docstring (#19006) · 4157e3cd
  Joao Gante authored Sep 13, 2022
  
  4157e3cd
- Re-add support for single url files in objects download (#19014) · f89f16a5
  Sylvain Gugger authored Sep 13, 2022
  
  f89f16a5
- add missing `require_tf` for `TFOPTGenerationTest` (#19010) · ad5045e3
  Yih-Dar authored Sep 13, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  ad5045e3
- add DDP HPO support for optuna (#19002) · d14af22c
  Wang, Yi authored Sep 13, 2022
```
only main_process will have HPO, and pass argument to other process
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
```
  d14af22c
- Fixed bug which caused overwrite_cache to always be True (#19000) · 00fc9217
  Rahul A R authored Sep 13, 2022
```
* fixed bug which caused overwrite_cache to always be True (#18967).

* reformatting changes
```
  00fc9217
- Update default revision for document-question-answering (#18938) · 420f6c5e
  Ankur Goyal authored Sep 13, 2022
```
Co-authored-by: Ankur Goyal <ankur@impira.com>
```
  420f6c5e
- Fix tokenizer for XLMRobertaXL (#19004) · 2886f7f0
  Yih-Dar authored Sep 13, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  2886f7f0
- Add type hints for M2M (#18998) · 2848c9ce
  Partho authored Sep 13, 2022
```
* added type hints

* fixed typo
```
  2848c9ce
- Generate: add model class validation (#18902) · 4bd36f18
  Joao Gante authored Sep 13, 2022
  
  4bd36f18
- Fix MaskFormerFeatureExtractor instance segmentation preprocessing bug (#18997) · 69df33f1
  Alara Dirik authored Sep 13, 2022
```
* fix preprocessing for instance segmentation maps

* add support for per-image instance2class_id mapping

* edit docstrings for clarity
```
  69df33f1
12 Sep, 2022 14 commits
- Removed issue in wav2vec link (#18945) · 470799b3
  Chris Emezue authored Sep 12, 2022
```
Fix connected to [this issue](https://github.com/huggingface/transformers/issues/18944)
```
  470799b3
- Fixed typo (#18921) · 4c2e983f
  Tobias Nusser authored Sep 12, 2022
```
Fixed typo itmes --> items
```
  4c2e983f
- TF: TF 2.10 unpin + related onnx test skips (#18995) · 1182b945
  Joao Gante authored Sep 12, 2022
  
  1182b945
- added type hints (#18996) · 7f4708e1
  Partho authored Sep 12, 2022
  
  7f4708e1
- fix checkpoint name for wav2vec2 conformer (#18994) · 39b5bb79
  Yih-Dar authored Sep 12, 2022
```
* fix checkpoint name for wav2vec2 conformer
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  39b5bb79
- TF: correct TFBart embeddings weights name when load_weight_prefix is passed (#18993) · 8a6928e2
  Joao Gante authored Sep 12, 2022
  
  8a6928e2
- Fix tflongformer int dtype (#18907) · c126a239
  Matt authored Sep 12, 2022
```
* Use int64 throughout TFLongFormer

* make style

* Do some more fixed casting in TFLongFormer

* Fix some wonky "is None" conditionals

* Cast all the dtypes, salt the earth

* Fix copies to TFLED as well and do some casting there

* dtype fix in TFLongformer test

* Make fixup

* Expand tolerances on the LED tests too (I think this is a TF32 thing)

* Expand test tolerances for LED a tiny bit (probably a Tensorfloat thing again)
```
  c126a239
- Align try_to_load_from_cache with huggingface_hub (#18966) · f7ceda34
  Sylvain Gugger authored Sep 12, 2022
```
* Align try_to_load_from_cache with huggingface_hub

* Fix tests
```
  f7ceda34
- Fix TF start docstrings (#18991) · cf450b77
  Matt authored Sep 12, 2022
```
* Update our TF 2.0 input format tip across all models

* make style
```
  cf450b77
- Remove dropout in embedding layer of OPT (#18845) · adbf3a40
  Shijie Wu authored Sep 12, 2022
  
  adbf3a40
- create Past CI results as tables for GitHub issue (#18953) · 36702600
  Yih-Dar authored Sep 12, 2022
```
* create Past CI results as tables for GitHub issue
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  36702600
- Remove `decoder_position_ids` from `check_decoder_model_past_large_inputs` (#18980) · 0b369703
  Yih-Dar authored Sep 12, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  0b369703
- add DDP HPO support for sigopt (#18931) · a86acb75
  Wang, Yi authored Sep 12, 2022
```
only main_process will have HPO, and pass argument to other process
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
```
  a86acb75
- remove unused activation dropout (#18842) · 9faa9f9d
  Shijie Wu authored Sep 12, 2022
  
  9faa9f9d
10 Sep, 2022 3 commits
- Revert "TF: unpin maximum TF version (#18917)" (#18972) · a2611477
  Sylvain Gugger authored Sep 10, 2022
```
This reverts commit d8cf3b20.
```
  a2611477
- TF: unpin maximum TF version (#18917) · d8cf3b20
  Joao Gante authored Sep 10, 2022
  
  d8cf3b20
- RFC: Replace custom TF embeddings by Keras embeddings (#18939) · 00cbadb8
  Joao Gante authored Sep 10, 2022
  
  00cbadb8
09 Sep, 2022 7 commits

update black target version (#18955) · 855dcae8

Bram Vanroy authored Sep 09, 2022

* update black target version

* add comment

as per https://github.com/huggingface/transformers/pull/18955#issuecomment-1242081649

* revert change

Will only update to 3.7 after black 2023 upgrade in January

855dcae8

Exit early in load if no weights are in the sharded state dict (#18937) · 645f1742
Sylvain Gugger authored Sep 09, 2022

645f1742

Fix train_step, test_step and tests for CLIP (#18684) · 660e0b97

Matt authored Sep 09, 2022

* Fix train_step and test_step, correctly enable CLIP fit test

* Stop using get_args on older Python versions

* Don't use get_origin either

* UnionType is actually even newer, don't use that either

* Apply the same fix to test_loss_computation

* Just realized I was accidentally skipping a bunch of tests!

* Fix test_loss_computation for models without separable labels

* Fix scalar losses in test_step and train_step

* Stop committing your breakpoints

* Fix Swin loss shape

* Fix Tapas loss shape

* Shape fixes for TAPAS, DeIT, HuBERT and ViTMAE

* Add loss computation to TFMobileBertForPreTraining

* make fixup and move copied from statement

* make fixup and move copied from statement

* Correct copied from

* Add labels and next_sentence_label inputs to TFMobileBERT

* Make sure total_loss is always defined

* Update tests/test_modeling_tf_common.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users....

660e0b97

Generate: Simplify is_pad_token_not_equal_to_eos_token_id (#18933) · f1a6df32
Ekagra Ranjan authored Sep 09, 2022

f1a6df32

Neptune.ai integration improvements (#18934) · 85125fcf

Rafał Jankowski authored Sep 09, 2022



* NeptuneCallback improvements

* After review suggestions and deduplication of initial run

* Added volatile checkpoints support due to missing post-rebase commit

* Update README per review comments

- Remove list formatting
- Correct Neptune docs link
Co-authored-by: Sabine <sabine.nyholm@neptune.ai>

85125fcf

[JAX] Replace all jax.tree_* calls with jax.tree_util.tree_* (#18361) · e6f221c8
Sanchit Gandhi authored Sep 09, 2022
```
* [JAX] Replace all jax.tree_* calls with jax.tree_util.tree_*

* fix double tree_util
```
e6f221c8

add task_type_id to BERT to support ERNIE-2.0 and ERNIE-3.0 models (#18686) · 22f72185

HuYong authored Sep 09, 2022



* add_ernie

* remove Tokenizer in ernie

* polish code

* format code style

* polish code

* fix style

* update doc

* make fix-copies

* change model name

* change model name

* fix dependency

* add more copied from

* rename ErnieLMHeadModel to ErnieForCausalLM
do not expose ErnieLayer
update doc

* fix

* make style

* polish code

* polish code

* fix

* fix

* fix

* fix

* fix

* final fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

22f72185