Commits · 90f6fe9155d2f477588d9ba2d7c697a1933e205a · chenpangpang / transformers

07 Sep, 2022 14 commits

Skip some doctests in quicktour (#18927) · 90f6fe91

Steven Liu authored Sep 07, 2022

* skip some code examples for doctests

* make style

* fix code snippet formatting

* separate code snippet into two blocks

90f6fe91

Add image height and width to ONNX dynamic axes (#18915) · 6519150c
lewtun authored Sep 07, 2022

6519150c

Starts on a list of external deps required for dev (#18929) · 737f6ad1

Colin Dean authored Sep 07, 2022



* Starts on a list of external deps required for dev

I've found that I need to install MeCab manually on my AS Mac.

* Generalizes OS nascent dependency list
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

737f6ad1

Fix XLA fp16 and bf16 error checking (#18913) · 63942218

Yanming Wang authored Sep 07, 2022



* Fix XLA fp16 and bf16 error checking

* Update src/transformers/training_args.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

63942218

pin TF 2.9.1 for self-hosted CIs (#18925) · 6690ba3f
Yih-Dar authored Sep 07, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
6690ba3f

Add DocumentQuestionAnswering pipeline (#18414) · 2ef77421

Ankur Goyal authored Sep 07, 2022



* [WIP] Skeleton of VisualQuestionAnweringPipeline extended to support LayoutLM-like models

* Fixup

* Use the full encoding

* Basic refactoring to DocumentQuestionAnsweringPipeline

* Cleanup

* Improve args, docs, and implement preprocessing

* Integrate OCR

* Refactor question_answering pipeline

* Use refactored QA code in the document qa pipeline

* Fix tests

* Some small cleanups

* Use a string type annotation for Image.Image

* Update encoding with image features

* Wire through the basic docs

* Handle invalid response

* Handle empty word_boxes properly

* Docstring fix

* Integrate Donut model

* Fixup

* Incorporate comments

* Address comments

* Initial incorporation of tests

* Address Comments

* Change assert to ValueError

* Comments

* Wrap `score` in float to make it JSON serializable

* Incorporate AutoModeLForDocumentQuestionAnswering changes

* Fixup

* Rename postprocess function

* Fix auto import

* Applying comments

* Improve docs

* Remove extra assets and add copyright

* Address comments
Co-authored-by: Ankur Goyal <ankur@impira.com>

2ef77421

[DeepSpeed ZeRO3] Fix performance degradation in sharded models (#18911) · 3059d80d

Olatunji Ruwase authored Sep 07, 2022



* [DeepSpeed] Fix performance degradation in sharded models

* style

* polish
Co-authored-by: Stas Bekman <stas@stason.org>

3059d80d

remvoe `_create_and_check_torch_fx_tracing` in specific test files (#18667) · 10c774cf

Yih-Dar authored Sep 07, 2022



* remvoe _create_and_check_torch_fx_tracing defined in specific model test files
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

10c774cf

TF: final bias as a layer in seq2seq models (replicate TFMarian fix) (#18903) · 0eabab09
Joao Gante authored Sep 07, 2022

0eabab09

Update TF fine-tuning docs (#18654) · 2b9513fd

Matt authored Sep 07, 2022



* Update TF fine-tuning docs

* Fix formatting

* Add some section headers so the right sidebar works better

* Squiggly it

* Update docs/source/en/training.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/training.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/training.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/training.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/training.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/training.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/training.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/training.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/training.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/training.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/training.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/training.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/training.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Explain things in the text, not the comments

* Make the two dataset creation methods into a list

* Move the advice about collation out of a <Tip>

* Edits for clarity

* Edits for clarity

* Edits for clarity

* Replace `to_tf_dataset` with `prepare_tf_dataset` in the fine-tuning pages

* Restructure the page a little bit

* Restructure the page a little bit

* Restructure the page a little bit
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

2b9513fd

update the train_batch_size in case HPO change batch_size_per_device (#18918) · d842f2d5
Wang, Yi authored Sep 07, 2022
```
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
```
d842f2d5

Accelerator end training (#18910) · 4f299b24

Nicholas Broad authored Sep 07, 2022

* add accelerator.end_training()

Some trackers need this to end their runs.

* fixup and quality

* add space

* add space again ?!?

4f299b24

Add checks for more workflow jobs (#18905) · 7a811894

Yih-Dar authored Sep 07, 2022



* add check for scheduled CI

* Add check to other CIs
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

7a811894

[VideoMAE] Improve code examples (#18919) · c25f27fa
NielsRogge authored Sep 07, 2022
```
* Simplify code example

* Add seed
```
c25f27fa

06 Sep, 2022 8 commits

Fix incorrect size of input for 1st strided window length in `Perplexity of... · 0a632f07

Ekagra Ranjan authored Sep 07, 2022

Fix incorrect size of input for 1st strided window length in `Perplexity of fixed-length models` (#18906)

* update the PPL for stride 512

* fix 1st strided window size

* linting

* fix typo

* styling

0a632f07

unpin slack_sdk version (#18901) · 7d5fde99
Yih-Dar authored Sep 06, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
7d5fde99

Further reduce the number of alls to head for cached objects (#18871) · 71ff88fa

Sylvain Gugger authored Sep 06, 2022

* Further reduce the number of alls to head for cached models/tokenizers/pipelines

* Fix tests

* Address review comments

71ff88fa

fixes bugs to handle non-dict output (#18897) · 6678350c
Alara Dirik authored Sep 06, 2022

6678350c
Fix `test_tf_encode_plus_sent_to_model` for `LayoutLMv3` (#18898) · 998a90bc
Yih-Dar authored Sep 06, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
998a90bc

Fix decode_input_ids to bare T5Model and improve doc (#18791) · f85acb4d

Ekagra Ranjan authored Sep 06, 2022



* use tokenizer to output tensor

* add preprocessing for decoder_input_ids for bare T5Model

* add preprocessing to tf and flax

* linting

* linting

* Update src/transformers/models/t5/modeling_flax_t5.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/models/t5/modeling_tf_t5.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/models/t5/modeling_t5.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

f85acb4d

updating gather function with gather_for_metrics in run_wav2vec2_pretraining (#18877) · 3b19c031
arun99481 authored Sep 06, 2022
```
Co-authored-by: Arun Rajaram <arunrajaram@Aruns-MacBook-Pro.local>
```
3b19c031

Mask t5 relative position bias then head pruned (#17968) · 734b7e2a

Had authored Sep 06, 2022



* add position bias head masking if heads pruned

* fix pruning function in t5 encoder

* make style

* make fix-copies

* Revert added folder
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

734b7e2a

05 Sep, 2022 7 commits
- Generate: get the correct beam index on eos token (#18851) · d4dbd7ca
  Joao Gante authored Sep 05, 2022
  
  d4dbd7ca
- Update Chinese documentation (#18893) · c6d3daba
  zkep authored Sep 06, 2022
```
* update the translation
```
  c6d3daba
- Add type hints to XLM-Roberta-XL models (#18475) · cfd623a8
  Sofia Oliveira authored Sep 05, 2022
```
* Add type hints to XLM-Roberta-XL models

* Format
```
  cfd623a8
- Update perf_train_gpu_one.mdx (#18442) · 17c634fd
  Surya Prakash Sahu authored Sep 05, 2022
  
  17c634fd
- Correct naming pegasus x (#18896) · badb9d2a
  Patrick von Platen authored Sep 05, 2022
```
* add first generation tutorial

* [Pegasus X] correct naming

* [Generation] Remove
```
  badb9d2a
- Mention TF and Flax checkpoints (#18894) · 591cfc6c
  Lysandre Debut authored Sep 05, 2022
  
  591cfc6c
- TF: TFMarianMTModel final logits bias as a layer (#18833) · 7f27e002
  Joao Gante authored Sep 05, 2022
```
* bias as a layer

* alias the bias (hah, it rhymes)

* add comment with info
```
  7f27e002
02 Sep, 2022 11 commits

Add Trainer to quicktour (#18723) · 65fb71bc

Steven Liu authored Sep 02, 2022

* 📝 update quicktour

* 📝 add trainer section

* 🖍 markdown table, apply feedbacks

* ✨ make style

* add tf training section

* make style

65fb71bc

Finetune guide for semantic segmentation (#18640) · ae32f3af

Steven Liu authored Sep 02, 2022

* 📝 first draft

* oops add to toctree

* make style

* 📝 add inference section

* 🖍 make style

* 📝 add images

* 🖍

 apply feedbacks

* remove num_labels and pytorch block

* apply feedbacks, add colab notebook
Co-authored-by: Steven <stevhliu@gmail.com>

ae32f3af

Update docs landing page (#18590) · bf9d5061

Steven Liu authored Sep 02, 2022

* 📝 update docs landing page

* 🖍 apply feedbacks

* apply feedbacks

* apply feedbacks, use <br> for list

bf9d5061

PEGASUS-X (#18551) · 53e33e6f

Jason Phang authored Sep 02, 2022

* PegasusX Initial commit

* rename

* pegasus X implementation

* pegx update

* pegx fix

* pegasus-x fixes

* pegx updates

* cleanup

* cleanup

* cleanup

* tests

* stylefixes

* Documentation update

* Model hub fix

* cleanup

* update

* update

* testfix

* Check fix

* tweaks for merging

* style

* style

* updates for pr

* style

* change pegasus-x repo

53e33e6f

Remove cached torch_extensions on CI runners (#18868) · ecdf9b06
Yih-Dar authored Sep 02, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
ecdf9b06
A script to download artifacts and perform CI error statistics (#18865) · 4e29b3f8
Yih-Dar authored Sep 02, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
4e29b3f8
Generate: validate `model_kwargs` on TF (and catch typos in generate arguments) (#18651) · 9196f48b
Joao Gante authored Sep 02, 2022

9196f48b
postpone bnb load until it's needed (#18859) · c5be7cae
Stas Bekman authored Sep 02, 2022

c5be7cae
Fix number of examples for iterable datasets in multiprocessing (#18856) · 9e346f74
Sylvain Gugger authored Sep 02, 2022
```
* Fix number of examples for iterable datasets in multiprocessing

* Add stronger check
```
9e346f74
pin Slack SDK to 3.18.1 to avoid failing issue (#18869) · 0ab465a5
Yih-Dar authored Sep 02, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
0ab465a5
Clean up utils.hub using the latest from hf_hub (#18857) · 38c3cd52
Sylvain Gugger authored Sep 02, 2022
```
* Clean up utils.hub using the latest from hf_hub

* Adapt test

* Address review comment

* Fix test
```
38c3cd52