Commits · 4eb36f2921fed7d57aa9ff27a05942bd9402c6f0 · chenpangpang / transformers

14 Sep, 2022 14 commits

Mark right save_load test as slow (#19031) · 4eb36f29
Sylvain Gugger authored Sep 14, 2022

4eb36f29

Add support for Japanese GPT-NeoX-based model by ABEJA, Inc. (#18814) · f5f430e5

Shinya Otani authored Sep 14, 2022

* add gpt-neox-japanese model and tokenizer as new model

* Correction to PR's comment for GPT NeoX Japanese
- Fix to be able to use gpu
- Add comment # Copied... at the top of RotaryEmbedding
- Implement nn.Linear instead of original linear class
- Add generation test under @slow

* fix bias treatment for gpt-neox-japanese

* Modidy gpt-neox-japanese following PR
- add doc for bias_dropout_add
- style change following a PR comment

* add document for gpt-neox-japanese

* remove unused import from gpt-neox-japanese

* fix README for gpt-neox-japanese

f5f430e5

Fix `DocumentQuestionAnsweringPipelineTests` (#19023) · 6a9726ec

Yih-Dar authored Sep 14, 2022



* Fix DocumentQuestionAnsweringPipelineTests
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

6a9726ec

Typo fix · 1207deb8
Sylvain Gugger authored Sep 14, 2022

1207deb8
Making save_load test slow as it times out · e1224a2a
Sylvain Gugger authored Sep 14, 2022

e1224a2a
Add Document QA pipeline metadata (#19028) · 0b567aa4
Sylvain Gugger authored Sep 14, 2022

0b567aa4

Fix CI for `PegasusX` (#19025) · 77b18783

Yih-Dar authored Sep 14, 2022



* Skip test_torchscript_output_attentions for PegasusXModelTest

* fix test_inference_no_head

* fix test_inference_head

* fix test_seq_to_seq_generation
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

77b18783

added type hints (#19015) · 77ea35b9
Partho authored Sep 14, 2022

77ea35b9

[CookieCutter] Clarify questions (#18959) · fc21c9be

NielsRogge authored Sep 14, 2022



* Clarify cookiecutter questions

* Update first question
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

fc21c9be

Make AutoProcessor a magic loading class for all modalities (#18963) · 6f8f2f6a
Sylvain Gugger authored Sep 14, 2022
```
* Make AutoProcessor a magic loading class for all modalities

* Quality
```
6f8f2f6a
PyTorch >= 1.7.0 and TensorFlow >= 2.4.0 (#19016) · a2a3afbc
Sylvain Gugger authored Sep 14, 2022

a2a3afbc
Generate: add missing comments after refactoring of generate() (#18981) · 9f4acd05
Ekagra Ranjan authored Sep 14, 2022

9f4acd05

Add Deformable DETR (#17281) · 59407bbe

NielsRogge authored Sep 14, 2022



* First draft

* More improvements

* Improve model, add custom CUDA code

* Import torch before

* Add script that imports custom layer

* Add everything in new ops directory

* Import custom layer in modeling file

* Fix ARCHIVE_MAP typo

* Creating the custom kernel on the fly.

* Import custom layer in modeling file

* More improvements

* Fix CUDA loading

* More improvements

* Improve conversion script

* Improve conversion script

* Make it work until encoder_outputs

* Make forward pass work

* More improvements

* Make logits match original implementation

* Make implementation also support single_scale model

* Add support for single_scale and dilation checkpoint

* Add support for with_box_refine model

* Support also two stage model

* Improve tests

* Fix more tests

* Make more tests pass

* Upload all models to the hub

* Clean up some code

* Improve decoder outputs

* Rename intermediate hidden states and reference points

* Improve model outputs

* Move tests to dedicated folder

* Improve model outputs

* Fix retain_grad test

* Improve docs

* Clean up and make test_initialization pass

* Improve variable names

* Add copied from statements

* Improve docs

* Fix style

* Improve docs

* Improve docs, move tests to model folder

* Fix rebase

* Remove DetrForSegmentation from auto mapping

* Apply suggestions from code review

* Improve variable names and docstrings

* Apply some more suggestions from code review

* Apply suggestion from code review

* better docs and variables names

* hint to num_queries and two_stage confusion

* remove asserts and code refactor

* add exception if two_stage is True and with_box_refine is False

* use f-strings

* Improve docs and variable names

* Fix code quality

* Fix rebase

* Add require_torch_gpu decorator

* Add pip install ninja to CI jobs

* Apply suggestion of @sgugger

* Remove DeformableDetrForObjectDetection from auto mapping

* Remove DeformableDetrModel from auto mapping

* Add model to toctree

* Add model back to mappings, skip model in pipeline tests

* Apply @sgugger's suggestion

* Fix imports in the init

* Fix copies

* Add CPU implementation

* Comment out GPU function

* Undo previous change

* Apply more suggestions

* Remove require_torch_gpu annotator

* Fix quality

* Add logger.info

* Fix logger

* Fix variable names

* Fix initializaztion

* Add missing initialization

* Update checkpoint name

* Add model to doc tests

* Add CPU/GPU equivalence test

* Add Deformable DETR to pipeline tests

* Skip model for object detection pipeline
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
Co-authored-by: Nouamane Tazi <nouamane98@gmail.com>
Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com>

59407bbe

Add Support to Gradient Checkpointing for LongT5 (#18977) · 5a70a77b

Ahmed Elnaggar authored Sep 14, 2022

FlaxLongT5PreTrainedModel is missing "enable_gradient_checkpointing" function. This gives an error if someone tries to enable gradient checkpointing for longt5.
This pull request fixes it.

5a70a77b

13 Sep, 2022 10 commits
- new length penalty docstring (#19006) · 4157e3cd
  Joao Gante authored Sep 13, 2022
  
  4157e3cd
- Re-add support for single url files in objects download (#19014) · f89f16a5
  Sylvain Gugger authored Sep 13, 2022
  
  f89f16a5
- add missing `require_tf` for `TFOPTGenerationTest` (#19010) · ad5045e3
  Yih-Dar authored Sep 13, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  ad5045e3
- add DDP HPO support for optuna (#19002) · d14af22c
  Wang, Yi authored Sep 13, 2022
```
only main_process will have HPO, and pass argument to other process
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
```
  d14af22c
- Fixed bug which caused overwrite_cache to always be True (#19000) · 00fc9217
  Rahul A R authored Sep 13, 2022
```
* fixed bug which caused overwrite_cache to always be True (#18967).

* reformatting changes
```
  00fc9217
- Update default revision for document-question-answering (#18938) · 420f6c5e
  Ankur Goyal authored Sep 13, 2022
```
Co-authored-by: Ankur Goyal <ankur@impira.com>
```
  420f6c5e
- Fix tokenizer for XLMRobertaXL (#19004) · 2886f7f0
  Yih-Dar authored Sep 13, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  2886f7f0
- Add type hints for M2M (#18998) · 2848c9ce
  Partho authored Sep 13, 2022
```
* added type hints

* fixed typo
```
  2848c9ce
- Generate: add model class validation (#18902) · 4bd36f18
  Joao Gante authored Sep 13, 2022
  
  4bd36f18
- Fix MaskFormerFeatureExtractor instance segmentation preprocessing bug (#18997) · 69df33f1
  Alara Dirik authored Sep 13, 2022
```
* fix preprocessing for instance segmentation maps

* add support for per-image instance2class_id mapping

* edit docstrings for clarity
```
  69df33f1
12 Sep, 2022 14 commits
- Removed issue in wav2vec link (#18945) · 470799b3
  Chris Emezue authored Sep 12, 2022
```
Fix connected to [this issue](https://github.com/huggingface/transformers/issues/18944)
```
  470799b3
- Fixed typo (#18921) · 4c2e983f
  Tobias Nusser authored Sep 12, 2022
```
Fixed typo itmes --> items
```
  4c2e983f
- TF: TF 2.10 unpin + related onnx test skips (#18995) · 1182b945
  Joao Gante authored Sep 12, 2022
  
  1182b945
- added type hints (#18996) · 7f4708e1
  Partho authored Sep 12, 2022
  
  7f4708e1
- fix checkpoint name for wav2vec2 conformer (#18994) · 39b5bb79
  Yih-Dar authored Sep 12, 2022
```
* fix checkpoint name for wav2vec2 conformer
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  39b5bb79
- TF: correct TFBart embeddings weights name when load_weight_prefix is passed (#18993) · 8a6928e2
  Joao Gante authored Sep 12, 2022
  
  8a6928e2
- Fix tflongformer int dtype (#18907) · c126a239
  Matt authored Sep 12, 2022
```
* Use int64 throughout TFLongFormer

* make style

* Do some more fixed casting in TFLongFormer

* Fix some wonky "is None" conditionals

* Cast all the dtypes, salt the earth

* Fix copies to TFLED as well and do some casting there

* dtype fix in TFLongformer test

* Make fixup

* Expand tolerances on the LED tests too (I think this is a TF32 thing)

* Expand test tolerances for LED a tiny bit (probably a Tensorfloat thing again)
```
  c126a239
- Align try_to_load_from_cache with huggingface_hub (#18966) · f7ceda34
  Sylvain Gugger authored Sep 12, 2022
```
* Align try_to_load_from_cache with huggingface_hub

* Fix tests
```
  f7ceda34
- Fix TF start docstrings (#18991) · cf450b77
  Matt authored Sep 12, 2022
```
* Update our TF 2.0 input format tip across all models

* make style
```
  cf450b77
- Remove dropout in embedding layer of OPT (#18845) · adbf3a40
  Shijie Wu authored Sep 12, 2022
  
  adbf3a40
- create Past CI results as tables for GitHub issue (#18953) · 36702600
  Yih-Dar authored Sep 12, 2022
```
* create Past CI results as tables for GitHub issue
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  36702600
- Remove `decoder_position_ids` from `check_decoder_model_past_large_inputs` (#18980) · 0b369703
  Yih-Dar authored Sep 12, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  0b369703
- add DDP HPO support for sigopt (#18931) · a86acb75
  Wang, Yi authored Sep 12, 2022
```
only main_process will have HPO, and pass argument to other process
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
```
  a86acb75
- remove unused activation dropout (#18842) · 9faa9f9d
  Shijie Wu authored Sep 12, 2022
  
  9faa9f9d
10 Sep, 2022 2 commits
- Revert "TF: unpin maximum TF version (#18917)" (#18972) · a2611477
  Sylvain Gugger authored Sep 10, 2022
```
This reverts commit d8cf3b20.
```
  a2611477
- TF: unpin maximum TF version (#18917) · d8cf3b20
  Joao Gante authored Sep 10, 2022
  
  d8cf3b20