Commits · dc9147ff362e4e69829f64d28178c77cab4bef6f · chenpangpang / transformers

19 Jul, 2022 6 commits

Sylvain Gugger authored Jul 19, 2022



* Initial work

* More work

* Add tests for custom pipelines on the Hub

* Protect import

* Make the test work for TF as well

* Last PyTorch specific bit

* Add documentation

* Style

* Title in toc

* Bad names!

* Update docs/source/en/add_new_pipeline.mdx
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

* Auto stash before merge of "custom_pipeline" and "origin/custom_pipeline"

* Address review comments

* Address more review comments

* Update src/transformers/pipelines/__init__.py
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

dc9147ff

[From pretrained] Allow download from subfolder inside model repo (#18184) · 3bb6356d

Patrick von Platen authored Jul 19, 2022



* add first generation tutorial

* [from_pretrained] Allow loading models from subfolders

* remove gen file

* add doc strings

* allow download from subfolder

* add tests

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* apply comments

* correct doc string
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

3bb6356d

Update docs README with instructions on locally previewing docs (#18196) · ce015281

Snehan Kekre authored Jul 19, 2022

* Update docs README with instructions on locally previewing docs

* Add instructions to install `watchdog` before previewing the docs

ce015281

bugfix: div-->dim (#18135) · 79838446
orgoro authored Jul 19, 2022

79838446
Add vision example to README (#18194) · e630dad5
Sylvain Gugger authored Jul 19, 2022

e630dad5
Remove use_auth_token from the from_config method (#18192) · 4bea6584
Duong A. Nguyen authored Jul 19, 2022
```
* remove use_auth_token from from_config

* restore use_auth_token from_pretrained run_t5_mlm_flax
```
4bea6584

18 Jul, 2022 17 commits

Use smaller variant of BLOOM for doc to fix tests · 29fd4715
Sylvain Gugger authored Jul 18, 2022

29fd4715

FSDP integration enhancements and fixes (#18134) · bc8e30ba

Sourab Mangrulkar authored Jul 19, 2022

* FSDP integration enhancements and fixes

* resolving comments

* fsdp fp16 mixed precision requires `ShardedGradScaler`

bc8e30ba

Translation/training: italian translation training.mdx (#17662) · 8e445ca5

Nicola Procopio authored Jul 18, 2022



* added training.mdx

* updated training.mdx

* updated training.mdx

* updated training.mdx

* updated _toctree.yml

* fixed typos after review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

8e445ca5

BLOOM minor fixes small test (#18175) · 6a1b1bf7

Younes Belkada authored Jul 18, 2022



* minor fixes

- add correct revision
- corrected dosctring for test
- removed a test

* contrib credits
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>
Co-authored-by: Nouamane Tazi <nouamane98@gmail.com>
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>
Co-authored-by: Nouamane Tazi <nouamane98@gmail.com>

6a1b1bf7

Translation italian: multilingual.mdx (#17768) · c4cc8940

Nicola Procopio authored Jul 18, 2022

* added multilingual.mdx

* updated multilingual.mdx

* italian translation multilingual.mdx

* updated _toctree.yml

* fixed typos _toctree.yml

* fixed typos after review

* fixed error after review

c4cc8940

Added preprocessing.mdx italian translation (#17600) · 0a5b61d0

Nicola Procopio authored Jul 18, 2022

* updated _toctree.yml

* added preprocessing

* updated preprocessing.mdx

* updated preprocessing.mdx

updated after review

0a5b61d0

fix typo inside bloom documentation (#18187) · ced1f1f5
SaulLu authored Jul 18, 2022

ced1f1f5
Better default for offload_state_dict in from_pretrained (#18183) · edadfc58
Sylvain Gugger authored Jul 18, 2022

edadfc58
Fix template for new models in README (#18182) · aeeab1ff
Sylvain Gugger authored Jul 18, 2022

aeeab1ff
FIX: Typo (#18156) · 45255814
Ayan Sengupta authored Jul 18, 2022

45255814

Update TF(Vision)EncoderDecoderModel PT/TF equivalence tests (#18073) · 6561fbcc

Yih-Dar authored Jul 18, 2022


Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

6561fbcc

Fix expected loss values in some (m)T5 tests (#18177) · cb19c2af
Yih-Dar authored Jul 18, 2022
```
* fix expected loss values
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
cb19c2af

[HPO] update to sigopt new experiment api (#18147) · 7417f3ac

Wang, Yi authored Jul 18, 2022

* [HPO] update to sigopt new experiment api
* follow https://docs.sigopt.com/experiments

Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

* [HPO] use new API if sigopt version >= 8.0.0
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

7417f3ac

add ONNX support for LeVit (#18154) · 8c14b342
gcheron authored Jul 18, 2022
```
Co-authored-by: Guilhem Chéron <guilhemc@authentifier.com>
```
8c14b342

NLLB tokenizer (#18126) · c1c79b06

Lysandre Debut authored Jul 18, 2022



* NLLB tokenizer

* Apply suggestions from code review - Thanks Stefan!
Co-authored-by: Stefan Schweter <stefan@schweter.it>

* Final touches

* Style :)

* Update docs/source/en/model_doc/nllb.mdx
Co-authored-by: Stefan Schweter <stefan@schweter.it>

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* PR reviews

* Auto models
Co-authored-by: Stefan Schweter <stefan@schweter.it>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

c1c79b06

Fix incorrect type hint for lang (#18161) · a4f97e6c
John Giorgi authored Jul 18, 2022

a4f97e6c
Fix check for falsey inputs in run_summarization (#18155) · c46d39f3
John Giorgi authored Jul 18, 2022

c46d39f3

15 Jul, 2022 2 commits

Adding support for `device_map` directly in `pipeline(..)` function. (#17902) · ccc08978

Nicolas Patry authored Jul 15, 2022

* Adding support for `device_map` directly in `pipeline(..)` function.

* Updating the docstring.

* Adding a better docstring

* Put back type hints.

* Blacked. (`make fixup` didn't work ??!!)

ccc08978

Fixing a hard to trigger bug for `text-generation` pipeline. (#18131) · fca66ec4
Nicolas Patry authored Jul 15, 2022
```
* Fixing a bug where attention mask was not passed to generate.

* Fixing zero-size prompts.

* Comment on top.
```
fca66ec4

13 Jul, 2022 9 commits

Add TF DeiT implementation (#17806) · 8581a798

amyeroberts authored Jul 13, 2022



* Initial TF DeiT implementation

* Fix copies naming issues

* Fix up + docs

* Properly same main layer

* Name layers properly

* Initial TF DeiT implementation

* Fix copies naming issues

* Fix up + docs

* Properly same main layer

* Name layers properly

* Fixup

* Fix import

* Fix import

* Fix import

* Fix weight loading for tests whilst not on hub

* Add doc tests and remove to_2tuple

* Add back to_2tuple
Removing to_2tuple results in many downstream changes needed because of the copies checks

* Incorporate updates in Improve vision models #17731 PR

* Don't hard code num_channels

* Copy PyTorch DeiT embeddings and remove pytorch operations with mask

* Fix patch embeddings & tidy up

* Update PixelShuffle to move logic into class layer

* Update doc strings - remove PT references

* Use NHWC format in internal layers

* Fix up

* Use linear activation layer

* Remove unused import

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Move dataclass to top of file

* Remove from_pt now weights on hub

* Fixup
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Amy Roberts <amyeroberts@users.noreply.github.com>

8581a798

Enable torchdynamo with torch_tensorrt(fx path) (#17765) · 7ea6ccc2

Wei authored Jul 13, 2022



* enable fx2trt

* Update perf_train_gpu_one.mdx

* Update perf_train_gpu_one.mdx

* add lib check

* update

* format

* update

* fix import check

* fix isort

* improve doc

* refactor ctx manager

* fix isort

* black format

* isort fix

* fix format

* update args

* update black

* cleanups

* Update perf_train_gpu_one.mdx

* code refactor

* code refactor to init

* remove redundancy

* isort

* replace self.args with args
Co-authored-by: Stas Bekman <stas@stason.org>

7ea6ccc2

Make sharded checkpoints work in offline mode (#18125) · 37aeb578
Sylvain Gugger authored Jul 13, 2022
```
* Make sharded checkpoints work in offline mode

* Add test
```
37aeb578
Revert "Make sharded checkpoints work in offline mode" · 0a21a485
Sylvain Gugger authored Jul 13, 2022
```
This reverts commit 3564c657.
```
0a21a485
Make sharded checkpoints work in offline mode · 3564c657
Sylvain Gugger authored Jul 13, 2022

3564c657

add dataset split and config to model-index in TrainingSummary.from_trainer (#18064) · 56e6487c

lmagne authored Jul 13, 2022



* added metadata to training summary

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

56e6487c

Add summarization name mapping for MultiNews (#18117) · fde22c75
John Giorgi authored Jul 13, 2022
```
* Add summarization name mapping for MultiNews

* Add summarization name mapping for MultiNews
```
fde22c75

supported python versions reference (#18116) · 19513336

Sebastian Sosa authored Jul 13, 2022



* supported python versions reference

* Update CONTRIBUTING.md

removing commit hash from link
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

19513336

TF: unpack_inputs decorator independent from main_input_name (#18110) · 20509ab0
Joao Gante authored Jul 13, 2022

20509ab0

12 Jul, 2022 6 commits

TF: remove graph mode distinction when processing boolean options (#18102) · fcefa200
Joao Gante authored Jul 12, 2022

fcefa200

Fix BLOOM dtype (#17995) · bc34c211

Niklas Muennighoff authored Jul 12, 2022

* Add fp16 option

* Fix BLOOM dtype

* Formatting

* Remove torch_dtype arg

* Revert formatting

* Apply formatting

* Add n_embed backward compat

bc34c211

CLI: reenable `pt_to_tf` test (#18108) · 981714ef
Joao Gante authored Jul 12, 2022

981714ef

Report value for a step instead of epoch. (#18095) · f5221c06

wei zhao authored Jul 12, 2022



* Report value for a step instead of epoch.

Report an objective function value for a step instead of epoch to optuna.
I made this modification for the following reason:
If "eval_steps" is less than steps per epoch, there maybe warnings like this: "optuna/trial/_trial.py:592: UserWarning: The reported value is ignored because this `step` 0 is already reported.". So "step" are more appropriate than "epoch" here.

* MOD: make style.
Co-authored-by: zhaowei01 <zhaowei01@yuanfudao.com>

f5221c06

speed up test (#18106) · d4ebd4e1
Sijun He authored Jul 12, 2022

d4ebd4e1

Enhance IPEX integration in Trainer (#18072) · b7d8bd37

jianan-gu authored Jul 12, 2022



* enhance ipex import

* refine codes

* refine style

* add link

* style
Co-authored-by: Stas Bekman <stas@stason.org>

b7d8bd37