Commits · cd918492c694bcf4fe8f5ca403f00d1d40ae46ac · chenpangpang / transformers

03 Jan, 2023 14 commits

Fix race condition on cleaning checkpoints when save_total_limit set to 1 (#20989) · cd918492
radcheb authored Jan 03, 2023
```
* Update trainer.py

* fix style
Co-authored-by: Radhwane Chebaane <rchebaane.external@epo.org>
```
cd918492
Improve OWL-ViT postprocessing (#20980) · cd245780
Alara Dirik authored Jan 03, 2023
```
* add post_process_object_detection method

* style changes
```
cd245780
Fix for LXMERT (#20986) · e901914d
Yih-Dar authored Jan 03, 2023
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
e901914d

Avoid CI runs under users' own CircleCI personal account (#20981) · 8f09dd89

Yih-Dar authored Jan 03, 2023



* Avoid null CI

* Avoid null CI

* rename

* more clear error message

* Update .circleci/config.yml
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* clean up
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

8f09dd89

Ignore errors when deleting old checkpoints in trainer (#20984) · 7b0727a4
Anna Krogager authored Jan 03, 2023

7b0727a4

Enable `decoder_attention_mask` in `generate` function (#20726) · 15c68c67

samuelpullely authored Jan 03, 2023

* Enable `decoder_attention_mask` in `generate` function

* Make style corrections

* Run `make repo-consistency`

* Add integration test

15c68c67

Fix valid ratio for Deformable Detr (#20958) · a9653400

JeongYeon Nam authored Jan 03, 2023



* fix: valid ratio has right value

* chore: remove unnecessary line
Co-authored-by: Jeongyeon Nam <jy.nam@navercorp.com>

a9653400

[run_clm example] add torch_dtype option for model load. (#20971) · 9c9fe89f

Wang, Yi authored Jan 03, 2023



* [run_clm example] add torch_dtype option for model load.
for BLOOM 175B model. peak memory will reduce about 350G for inference. the weight of BLOOM in model hub is bfloat16
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

* add other type in option

* fix style
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

9c9fe89f

Remove more unused attributes in config classes (#20858) · e697c912

Yih-Dar authored Jan 03, 2023



Remove more unused attributes in config classes
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

e697c912

Add GIT (GenerativeImage2Text) (#20295) · 9c6f7485

NielsRogge authored Jan 03, 2023



* First draft

* Make model instantiation work

* Fix copied from statement

* More fixes

* Add correct output head

* Improve configuration

* Add conversion script

* Improve conversion script

* Remove token_type_ids

* Fix conversion of projection layers

* Convert all weights

* Use cats image

* Make logits match

* Generate caption on cats image

* Add GITProcessor

* Update conversion script

* Add support for more checkpoints

* Fix conversion script

* Add initial tests

* Remove cross-attention

* More improvements

* Remove is_decoder

* Improve model tests

* Improve tests

* Improve model outputs

* Fix model outputs equivalence

* Fix more tests

* Remove unused code

* Use generate to generate text, no use of cache for now

* Use generate more appropriately

* Fix config tests

* Fix style

* Add support for use_cache
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Fix style

* Fix GIT vision encoder

* Update README

* Fix integration test

* Set bos and eos token ids

* Improve docs

* Improve code

* Add support for provided attention_mask

* Add copied from statement

* Fix gradient checkpointing test

* Set model_input_names

* Investigate model_input_names

* Remove script

* Fix model inputs

* Fix docstring

* Rename GIT to Git

* Support more models

* Add support for textvqa model

* Add video support

* Extend conversion script for video

* Add support for large variant

* Add support for more models

* Fix config archive map

* Update integration test

* Fix README

* Fix CLIP mean and std

* Update processor

* Fix use_cache for video, thanks @gante

* Remove print statements

* Remove assertion

* Add processor tests

* Fix model_input_names

* Use Auto API for processor

* Fix processor tests

* Fix integration test

* Fix pipeline test

* Make tests faster

* Update conversion script

* Update conversion script

* Convert more checkpoints

* Update conversion script

* Fix typo

* Update docstrings

* Improve code snippets

* Fix doc tests

* Add more code examplesé

* Fix doc tests

* Add integration tests

* Fix unused variable

* revert

* Add GIT to Japanese README
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

9c6f7485

Fix post_process_object_detection method descriptions (#20977) · 305f41e4
Alara Dirik authored Jan 03, 2023
```
fix post_process_object_detection descriptions
```
305f41e4

`MinNewTokensLengthLogitsProcessor` for `.generate` method #20814 (#20892) · 367fdf33

Konstantin Kotik authored Jan 03, 2023



* feat: add min new length logit processor

* test: add min new length logit processor

* docs: add MinNewTokensLengthLogitsProcessor

* feat: import MinNewTokensLengthLogitsProcessor

* fix: update pytorch dummy objects

* refactor & fix: rename attributes and var and get rid of dynamic attribute

* tests: align test with new interface

* docs: fix typo

* docs: minor clarification

* Empty-Commit

* empty commit

* run automated quality edits
Co-authored-by: Joao Gante <joao@huggingface.co>

367fdf33

Generate: delete unused TF `_reorder_cache` (#20964) · 4fd89e49
Joao Gante authored Jan 03, 2023

4fd89e49
Fix T5 docstring (#20957) · a3e8d3cb
ivanllt authored Jan 03, 2023
```
Fix start_docstring for deparallelize method
```
a3e8d3cb

02 Jan, 2023 1 commit

Generate: TF XLA beam sample (#20927) · 588faad1

Joao Gante authored Jan 02, 2023

* beam sample in beam search

* rag now works with the updated beam search

* delete legacy (non-XLA) generation code related to beam sample

588faad1

31 Dec, 2022 4 commits
- update pyknp to rhoknp (#20890) · 375801d5
  Hao Wang authored Dec 31, 2022
```
* update pyknp to rhoknp

* fix linter

* fix linter

* fix linter

* fix linter

* fix linter

* support rhoknp==1.1.0, fix testcase
```
  375801d5
- Add generate kwargs to `AutomaticSpeechRecognitionPipeline` (#20952) · 092d4d49
  bofeng huang authored Dec 31, 2022
```
* Add generate kwargs to AutomaticSpeechRecognitionPipeline

* Add test for generation kwargs
```
  092d4d49
- Add generate kwargs to `AutomaticSpeechRecognitionPipeline` (#20952) · 47c9b22d
  bofeng huang authored Dec 31, 2022
```
* Add generate kwargs to AutomaticSpeechRecognitionPipeline

* Add test for generation kwargs
```
  47c9b22d
- [trainer: `distributed_concat`] ensure `all_gather`'s inputs are contiguous (#20951) · 9e6da0a7
  Stas Bekman authored Dec 30, 2022
```
[trainer: distributed_concat] ensure all_gather's input are contiguous
```
  9e6da0a7
30 Dec, 2022 3 commits
- Fixing DistilBert error message (#20945) · 17292440
  Samuel Xu authored Dec 30, 2022
```
Fixing error message
```
  17292440
- Fix error message in `WhisperFeatureExtractor` (#20936) · 881fa716
  bofeng huang authored Dec 30, 2022
```
* Fix error message

* Fix code quality
```
  881fa716
- Adds type checking to PreTrainedConfig. (#20926) · 491a33d1
  Matthew McDermott authored Dec 30, 2022
  
  491a33d1
29 Dec, 2022 4 commits

Remove Bert tokenizer dependency from DistillBert (slow/fast) tokenizers (#20933) · 8637316e
ivanllt authored Dec 29, 2022

8637316e

Fix FP16 inference in TextGenerationPipeline (#20913) · fe65657d

bofeng huang authored Dec 29, 2022



* add torch_dtype attribute to Pipeline

* Use torch_dtype to cast input tensor type in AutomaticSpeechRecognitionPipeline

* Fix code quality

* Add TextGenerationPipeline fp16 test

* Fix code quality

* Remove useless require in tests
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

fe65657d

Load the state dict on CPU to prevent unnecessary GPU memory surge (#20920) · 11c49ed2
Harsh Trivedi authored Dec 29, 2022
```
load the state dict on cpu.
```
11c49ed2

Remove non-breaking spaces (#20929) · 0b686a8a

Alex Hedges authored Dec 29, 2022

* Remove non-breaking space in comment

It was likely added unintionally.

* Remove remaining non-breaking spaces

0b686a8a

28 Dec, 2022 2 commits

Generate: correctly detect default max length (#20911) · bbcd9618
Joao Gante authored Dec 28, 2022
```
correctly detect default max length
```
bbcd9618

Avoid collisions in writing metrics via 2 APIs - azureml + mlflow (#20837) · 5f9b2ce0

Akshaya Annavajhala authored Dec 27, 2022

* Avoid collisions in writing metrics via 2 APIs - azureml + mlflow

MLflow tracking API is enabled by default in AzureML and HF MLflow integration is more fully featured. I'd remove the AzureML integration but leaving the current behavior for backwards compatibility (though it should really be removed)

* Trigger CI

5f9b2ce0

27 Dec, 2022 3 commits
- [Past CI] 🔥 Leave Past CI failures in the past 🔥 (#20861) · 5fa0b17c
  Yih-Dar authored Dec 27, 2022
```
* torch.jit._state

* Fix past CI

* Fix for perceiver

* Fix REALM

* Fix for Bloom

* Fix for SwinMode

* Fix for TrajectoryTransformerModel

* Fix for test_wav2vec2_with_lm

* make style
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  5fa0b17c
- fix docs typos in "add_new_model" (#20900) · e35bc46a
  Eli Simhayev authored Dec 27, 2022
```
fix Jupyter typos
```
  e35bc46a
- Update flan-t5 original model link (#20897) · d1b30112
  Kamal Raj Kanakarajan authored Dec 27, 2022
```
Update flan-t5.mdx
```
  d1b30112
26 Dec, 2022 2 commits

[ `T5`] fix fp16 loading issue (#20878) · accad48e

Younes Belkada authored Dec 26, 2022

* fix fp16 loading issue

* add backward compatibility

* better refactor

* better readability

- remove `force_upcast_dtype` as it is used once
- use `inspect`
- add `TODO`

accad48e

typo fix (#20891) · 47146721
Nathan Barry authored Dec 26, 2022

47146721

24 Dec, 2022 1 commit
- Fixes typo in the help text for --max_length (#20883) · 3830b3f7
  Márton Makrai authored Dec 24, 2022
  
  3830b3f7
23 Dec, 2022 6 commits

[RobertaPreLayernom] Fixes the CI daily test (#20886) · a081f292
Arthur authored Dec 23, 2022
```
get correct checkpoint
```
a081f292

Add japanese translation of template (#20870) · cab7799f

Younes Belkada authored Dec 23, 2022



* add japanese translation of template

* fix japanese translation

- fix special cases
- fix typos
- manually translate special cases
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

cab7799f

Add script to convert T5X T5 (v1.0 and v1.1) checkpoints to PyTorch (#20801) · efed8a27

Jasmijn Bastings authored Dec 23, 2022

* Add script to convert T5X T5 (v1.0 and v1.1) checkpoints to PyTorch

* Remove unnecessary check and update docstring

* Format docstring

* Fix whitespace in docstring

efed8a27

Adding support for `fp16` for asr pipeline. (#20864) · f7f0ec2f

Nicolas Patry authored Dec 23, 2022

* Supporting `fp16` for asr pipeline

* Adding test.

* Style.

* Oops.

* Flake8 update ?

* Fixing flake8 ?

* Revert "Flake8 update ?"

This reverts commit 0b917fcb520e5f34d1933d9d37d8f32b64553048.

* Style (acctidentally deleted flake8 F401.)

* Move to a bigger test (no small whisper model, and s2t doesn't seem to
accept torch_dtype=fp16).

Also we need to use a GPU to actually compute on fp16.

* Using BatchFeature capability.

f7f0ec2f

Add Onnx Config for PoolFormer (#20868) · 15bc776f
Syed Abdul Gaffar Shakhadri authored Dec 23, 2022
```
poolformer onnx
Co-authored-by: syed <syed.abdul@sandlogic.com>
```
15bc776f
having new model entries in Hindi for Hindi README (#20869) · 4a4cd6cd
Sourab Mangrulkar authored Dec 23, 2022

4a4cd6cd