Commits · 119e3c0fc83db5803d20d0749eef1220f27cfdc8 · chenpangpang / transformers

07 Jun, 2022 8 commits

Chan Woo Kim authored Jun 08, 2022



* added cbs to notebooks, made copy-paste error fix in generation_utils

* initial push for mctc model

* mctc feature extractor done

* added processor, tokenizer and their tests for MCTC. Have added an MCTC modeling test, adjusting model code accordingly.

* added processor, tokenizer and their tests for MCTC. Have added an MCTC modeling test, adjusting model code accordingly.

* passing attention, now struggling to figure out how attention masks make sense here

* works when excluding attention masks. ask later how one would integrate attention maskshere

* bizarre configuration error (model prefix comes first in config dict json and messes up the order)

* all passing but bizzarre config dict ordering issue when to_dict

* passing all major tests

* feature extraction, processor, tokenizer added & tests passing

* style & consistency & other logistical fixes

* copy paste fix

* model after feature extraction working

* commiting final feature extraction results; need to fix normalization

* feature extraction passing tests; probably should add tests on the specific flashlight-copied functions?

* delete print ; format code a bit

* fixing tests

* passing major tests

* fixing styles

* completed tokenization test with real example; not sure if these values are entirely correct.

* last test fixes from local

* reverting accidentally included custom setup configs

* remove load tf weights; fix config error

* testing couldnt import featureextractor

* fix docs

* fix docs

* resolving comments

* style fixes

* style fixes

* Update to MCTCConv1dSubSampler
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* relposemb fixes

* conv1d name issue; expecting config fail with paraentheses

* fix config issue

* fix config issue

* fix config issue

* change everything to MCTCT

* fixing naming change errors

* archive list

* copyrights and docs

* copyrights and docs

* copyrights and docs

* merge resolution

* move tests, fix to changed optionaldependency structure

* test directories changed

* fixing tests

* how to avoid tf tests?

* how to avoid tf tests?

* tests passing locally

* allow mctctprocessor imported any env

* allow mctctprocessor imported any env

* fixed second round of feedback, need to fix docs

* doc changes not being applied

* all fixed

* style fix

* feedback fixes

* fix copies and feature extraction style fix

* Update tests/models/visual_bert/test_modeling_visual_bert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* copy paste huggingface:main visual bert

* added eof newline to visual bert; all tests are passing otherwise

* fix slow tests by adding attention mask

* change model id to speechbrain

* make fix-copies

* fix readme unwanted deletes

* fixing readmes, make fix-copies

* consistent M-CTC-T naming

* Update src/transformers/models/mctct/__init__.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* all fixed but variable naming

* adjust double quotes

* fixed variable names

* copyright and mr quilter

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* correct slow tests

* make fix-copies

* Update src/transformers/models/mctct/configuration_mctct.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/mctct/configuration_mctct.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* m-ctc-t not mctct
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

119e3c0f

quicktour.mdx en -> pt translation (#17074) · 706bb836

Vítor Fróis authored Jun 07, 2022



* Quicktour Portuguese Translation

Translated quicktour.mdx until line 161

* Finished translating quicktour.mdx

Ready to upload and adjust eventual .mdx or translation mistakes.

* Add _toctree.yml and fix nits

* Fixed pt-br mdx syntax problem

Closed <frameworkcontent> instance

* Changed </frameworkcontent> line

* Copied missing block from english version of quicktour.mdx

* Reviwed the entire file once again. It should be working now.
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>

706bb836

Fx support for Deberta-v[1-2], Hubert and LXMERT (#17539) · 5c8f6010

Michael Benayoun authored Jun 07, 2022

* Support for deberta and deberta-v2

* Support for LXMert

* Support for Hubert

* Fix for pt1.11

* Trigger CI

5c8f6010

Add examples telemetry (#17552) · 3cab9027

Sylvain Gugger authored Jun 07, 2022

* Add examples telemetry

* Alternative approach

* Add to all other examples

* Add to templates as well

* Put framework separately

* Same for TensorFlow

3cab9027

Skip disk offload test for T5 · 9e72eb44
Sylvain Gugger authored Jun 07, 2022

9e72eb44
Fix gendered sentence in Spanish translation(#17558) · b1187307
Omar U. Espejel authored Jun 07, 2022

b1187307
Fix circular import in onnx.utils (#17577) · b6a65ae5
Sylvain Gugger authored Jun 07, 2022
```
* Fix circular import in onnx.utils

* Add comment for test fetcher

* Here too

* Style
```
b6a65ae5
Use latest stable PyTorch/DeepSpeed for Push & Scheduled CI (#17417) · 9aa230aa
Yih-Dar authored Jun 07, 2022
```
* update versions
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
9aa230aa

06 Jun, 2022 8 commits

Remove circular imports in layoutlm/__init__.py (#17576) · ad719652
regisss authored Jun 06, 2022

ad719652

Add magic method to our TF models to convert datasets with column inference (#17160) · 19a8a303

Matt authored Jun 06, 2022



* Add method to call to_tf_dataset() with column inference

* Add test for dataset creation

* Add a default arg for data collator

* Fix test

* Fix call with non-dev version of datasets

* Test correct column removal too

* make fixup

* More tests to make sure we remove unwanted columns

* Fix test to avoid predicting on unbuilt models

* Fix test to avoid predicting on unbuilt models

* Fix test to remove unwanted head mask columns from inputs

* Stop pushing your debug breakpoints to the main repo of the $2bn company you work for

* Skip the test in convnext because no grouped conv support

* Drop bools from the dataset dict

* Make style

* Skip the training test for models whose input dicts don't give us labels

* Skip transformerXL in the test because it doesn't return a simple loss

* Skip TFTapas because of some odd NaN losses

* make style

* make fixup

* Add docstring

* fixup

* Update src/transformers/modeling_tf_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Remove breakpoint from tests

* Fix assert, add requires_backends

* Protect tokenizer import with if TYPE_CHECKING

* make fixup

* Add noqa, more fixup

* More rearranging for ~* aesthetics *~

* Adding defaults for shuffle and batch_size to match to_tf_dataset()

* Update src/transformers/modeling_tf_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

19a8a303

[deepspeed / testing] reset global state (#17553) · d28b7aa8
Stas Bekman authored Jun 06, 2022
```
* [deepspeed] fix load_best_model test

* [deepspeed] add state reset on unittest tearDown
```
d28b7aa8

Translation/italian: added pipeline_tutorial.mdx [Issue: #17459] (#17507) · 34a886fc

Nicola Procopio authored Jun 06, 2022

* added toctree.yml file

* first translation

* added pipeline_tutorial.mdx translation

added pipeline_tutorial.mdx
updated _toctree.yml

* updated pipeline_tutorial.mdx

* updated _toctree.yml

Updated preprocessing and training

* updated preprocessing.mdx

start translation

* Update _toctree.yml

* Delete preprocessing.mdx

* Update _toctree.yml

* updated _toctree.yml

* added preprocessing

* Update _toctree.yml

* updated _toctree.yml

* undo

* Revert "undo"

This reverts commit 5d38d768752dc80918bf60ada9d185f98b742520.

* Revert "Revert "undo""

This reverts commit 8aa0830b587f915ca7d154ebca282b782e82bd92.

34a886fc

Remove RuntimeErrors for NaN-checking in 20B (#17563) · 2e37ef35
Jason Phang authored Jun 06, 2022

2e37ef35

Add installation.mdx Italian translation (#17530) · f6ad0e05

Martina Fumanelli authored Jun 06, 2022

* Add the Italian translation of the file installation.mdx and edit _toctree

* Add the Italian translation of the file installation.mdx and edit _toctree

f6ad0e05

Adding the Portuguese version of the tasks/token_classification.mdx documentation (#17492) · 4aed1dc8

Jonatas Grosman authored Jun 06, 2022

* add tasks/token_classification pt doc structure

* add tasks/token_classification pt doc translation

* add tasks/token_classification pt doc translation

4aed1dc8

fix integration test levit (#17555) · da71df1a
Anugunj Naman authored Jun 06, 2022

da71df1a

03 Jun, 2022 9 commits

[deepspeed] fix load_best_model test (#17550) · 26e5e129
Stas Bekman authored Jun 03, 2022

26e5e129

Update index.mdx (#17547) · 72f5b949

Britney Muller authored Jun 03, 2022

This PR updates our Expert Acceleration Program image with a new image featuring our experts.

This is similar to our Transformers/README.md image update that has proven to be successful.

72f5b949

Clean imports to fix test_fetcher (#17531) · c4e58cd8

Sylvain Gugger authored Jun 03, 2022



* Clean imports to fix test_fetcher

* Add dependencies printer

* Update utils/tests_fetcher.py
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

* Fix Perceiver import
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

c4e58cd8

Update run_glue_no_trainer.py (#17546) · 254d9c06
bhuang authored Jun 03, 2022

254d9c06
Fix all offload and MP tests (#17533) · 83439012
Sylvain Gugger authored Jun 03, 2022

83439012
Fix bug - layer names and activation from previous refactor (#17524) · 1c57242d
amyeroberts authored Jun 03, 2022
```
* Fix activation and layers in MLP head

* Remove unused import
```
1c57242d

Add support for Perceiver ONNX export (#17213) · babeff55

Patrick Deutschmann authored Jun 03, 2022



* Start adding perceiver support for ONNX

* Fix pad token bug for fast tokenizers

* Fix formatting

* Make get_preprocesor more opinionated (processor priority, otherwise tokenizer/feature extractor)

* Clean docs format

* Minor cleanup following @sgugger's comments

* Fix typo in docs

* Fix another docs typo

* Fix one more typo in docs

* Update src/transformers/onnx/utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/onnx/utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/onnx/utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

babeff55

Allow from transformers import TypicalLogitsWarper (#17477) · 5c17918f

Robert Dargavel Smith authored Jun 03, 2022

* Allow from transformers import TypicalLogitsWarper

* Added TypicalLogitsWarper

* Allow from transformers import TypicalLogitsWarper

* Allow from transformers import TypicalLogitsWarper

* Allow from transformers import TypicalLogitsWarper

* Allow from transformers import TypicalLogitsWarper

Added TypicalLogitsWarper

Allow from transformers import TypicalLogitsWarper

Allow from transformers import TypicalLogitsWarper

Allow from transformers import TypicalLogitsWarper

5c17918f

Add Gated-SiLU to T5 (#17420) · 607acd4f

DanielHesslow authored Jun 03, 2022



* Add gated-silu to t5 architecture to support UL2

* Fix error message

* formatting

* formatting again

* refactor

* fix classnames in _init_weights

* remove is_gated

* add test

* fix test

* Try without the test?

* Add back the test.

* Improve error message.
Co-authored-by: Daniel Hesslow <daniel@lighton.ai>

607acd4f

02 Jun, 2022 11 commits
- Update URL for Hub PR docs (#17532) · 1c220ced
  lewtun authored Jun 02, 2022
  
  1c220ced
- fix OPT-Flax CI tests (#17512) · 013462c5
  Arthur authored Jun 02, 2022
  
  013462c5
- [trainer/deepspeed] load_best_model (reimplement re-init) (#17151) · 2f59ad16
  Stas Bekman authored Jun 02, 2022
```
* [trainer/deepspeed] load_best_model

* to sync with DS PR #1947

* simplify

* rework load_best_model test

* cleanup

* bump deepspeed>=0.6.5
Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
```
  2f59ad16
- Implemented loss for training AudioFrameClassification (#17513) · 046c5ea9
  Moreno La Quatra authored Jun 02, 2022
```
* Implemented loss for training AudioFrameClassification

* reported changes in wav2vec2 main class and used make copies to propagate

* running black for code formatting
```
  046c5ea9
- Update configuration_auto.py (#17527) · 085321c9
  Kamal Raj authored Jun 02, 2022
  
  085321c9
- Check list of models in the main README and sort it (#17517) · 048dd73b
  Sylvain Gugger authored Jun 02, 2022
```
* Script for README

* Fix copies

* Complete error message
```
  048dd73b
- Fix when Accelerate is not installed (#17518) · 588d8f1f
  Sylvain Gugger authored Jun 02, 2022
  
  588d8f1f
- Clean README in post release job as well. (#17519) · f128ccb9
  Sylvain Gugger authored Jun 02, 2022
  
  f128ccb9
- Fix CI tests hang forever (#17471) · 216499bf
  Yih-Dar authored Jun 02, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  216499bf
- Print more library versions in CI (#17384) · 659b27fd
  Yih-Dar authored Jun 02, 2022
```
* print more lib. versions and just befor test runs

* update print_env_pt.py

* rename to print_env

* Disable warning + better job name

* print python version
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  659b27fd
- Split push CI into 2 workflows (#17369) · 0932adb3
  Yih-Dar authored Jun 02, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  0932adb3
01 Jun, 2022 4 commits
- Fix Tapas tests (#17510) · 58fb3c9f
  Yih-Dar authored Jun 01, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  58fb3c9f
- CLI: tool to convert PT into TF weights and open hub PR (#17497) · ca1f1c86
  Joao Gante authored Jun 01, 2022
  
  ca1f1c86
- Fix flakey no-trainer test (#17515) · 3766df4f
  Zachary Mueller authored Jun 01, 2022
  
  3766df4f
- Deal with the error when task is regression (#16330) · 028d4b7c
  fireindark707 authored Jun 01, 2022
  
  028d4b7c