Commits · 3cb9309024aeeeba5cf820539119f2f12aa4eac7 · chenpangpang / transformers

19 May, 2023 11 commits

[`Blip`] Remove redundant shift right (#23153) · 3cb93090
Younes Belkada authored May 19, 2023
```
* remove redundant shit right

* fix failing tests

* this time fix tests
```
3cb93090

Fix: Change tensors to integers for torch.dynamo and torch.compile compatibility (#23475) · 847e5691

Dennis Loevlie authored May 19, 2023

* Fix: Change tensors to integers in torch.split() for torch.dynamo and torch.compile compatibility

* Applied the suggested fix to the utils/check_copies.py test

* Applied the suggested fix by changing the original function that gets copied

847e5691

Fix PretrainedConfig `min_length` docstring (#23471) · 389bdba6
joaoareis authored May 19, 2023

389bdba6

Fix parallel mode check (#23409) · b455ad0a

Zachary Mueller authored May 19, 2023

* Fix sagemaker/distributed state

* Fix correctly

* Bring back -1

* Bring back local rank for distributed check

* better version

* Cleanest option

b455ad0a

Fix `transformers`' DeepSpeed CI job (#23463) · db4d7652
Yih-Dar authored May 19, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
db4d7652
Use config to set name and description if not present (#23473) · 2aa0cc2c
Sylvain Gugger authored May 19, 2023
```
Use config to set name and descriptiob if not present
```
2aa0cc2c
[`RWKV`] Rwkv fix for 8bit inference (#23468) · 21bd3be1
Younes Belkada authored May 19, 2023
```
* rwkv fix for 8bit inference

* add comment
```
21bd3be1

TF port of the Segment Anything Model (SAM) (#22970) · 1c460a52

Matt authored May 19, 2023



* First commit

* Add auto-translation with GPT-4

* make fixup

* Add a functional layernorm for TF

* Add all the auxiliary imports etc.

* Add the extra processor and tests

* rebase to main

* Add all the needed fixes to the GPT code

* make fixup

* Make convolutions channels-last so they run on CPU

* make fixup

* Fix final issues

* Fix other models affected by test change

* Clarify comment on the sparse_prompt_embeddings check

* Refactor functional_layernorm, use shape_list in place of .shape in some places

* Remove deprecated torch-alike code

* Update tests/models/sam/test_modeling_tf_sam.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update tests/models/sam/test_modeling_tf_sam.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Refactor processor with common methods and separated private methods

* make fixup

* Quietly delete the file that didn't do anything (sorry Sylvain)

* Refactor the processor tests into one file

* make fixup

* Clean up some unnecessary indirection

* Fix TF mask postprocessing

* Add more processor equivalence tests

* Refactor generate_crop_boxes to use framework-neutral np code

* Make the serving output correctly conditional

* Fix error message line length

* Use dict keys rather than indices internally in both TF and PT SAM call/forward

* Return dicts internally in the call/forward methods

* Revert changes to common tests and just override check_pt_tf_outputs

* Revert changes to other model tests

* Clarify comments for functional layernorm

* Add missing transpose from PT code

* Removed unused copied from in PT code

* Remove overrides for tests that don't exist in TF

* Fix transpose and update tests for PT and TF to check pred_masks

* Add training flag

* Update tests to use TF checkpoints

* Update index.mdx

* Add missing cross-test decorator

* Remove optional extra asterisks

* Revert return_dict changes in PT code

* Update src/transformers/models/sam/modeling_tf_sam.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Remove None return annotations on init methods

* Update tests/models/sam/test_processor_sam.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Fix input_boxes shapes

* make fixup

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

1c460a52

Remove .data usages in optimizations.py (#23417) · 8aa8513f
Jiewen Tan authored May 19, 2023
```
Patched the optimizers
```
8aa8513f

README: Fix affiliation for MEGA (#23394) · 3cf01b20

Julien Chaumond authored May 19, 2023



* README: Fix affiliation for MEGA

* Fix quality

---------
Co-authored-by: Lysandre <lysandre@huggingface.co>

3cf01b20

feat: Whisper prompting (#22496) · 2acedf47

Connor Henderson authored May 19, 2023

* initial working additions

* clean and rename, add cond stripping initial prompt to decode

* cleanup, edit create_initial_prompt_ids, add tests

* repo consistency, flip order of conditional

* fix error, move the processor fn to the tokenizer

* repo consistency, update test ids to corresponding tokenizer

* use convert_tokens_to_ids not get_vocab...

* use actual conditional in generate

* make sytle

* initial address comments

* initial working add new params to pipeline

* first draft of sequential generation for condition_on_previous_text

* add/update tests, make compatible with timestamps

* make compatible with diff. input kwargs and max length

* add None check

* add temperature check

* flip temp check operand

* refocusing to prev pr scope

* remove the params too

* make style

* edits, move max length incorporating prompt to whisper

* address comments

* remove asr pipeline prompt decoding, fix indexing

* address comments (more tests, validate prompt)

* un-comment out tests (from debug)

* remove old comment

* address comments

* fix typo

* remove timestamp token from test

* make style

* cleanup

* copy method to fast tokenizer, set max_new_tokens for test

* prompt_ids type just pt

* address Amy's comments

* make style

2acedf47

18 May, 2023 15 commits
- fix bug in group_texts function, that was inserting short batches (#23429) · a7920065
  Boda Sadallah authored May 18, 2023
```
* fix bug in group_texts function, that was inserting short batches

* fully exclude short batches and return empty dict instead

* fix style
```
  a7920065
- Clean up CUDA kernels (#23455) · b7b81d93
  Sylvain Gugger authored May 18, 2023
  
  b7b81d93
- Add an option to log result from the Agent (#23454) · 40ed18ae
  Sylvain Gugger authored May 18, 2023
  
  40ed18ae
- add cleanlab to awesome-transformers tools list (#23440) · f69589d1
  Jonas Mueller authored May 18, 2023
```
* add tool to awesome-transformers list

* add keyword list

* sgugger wording suggestion
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  f69589d1
- Properly guard PyTorch stuff (#23452) · 167aa76c
  Sylvain Gugger authored May 18, 2023
```
* Properly guard PyTorch stuff

* [all-test]

* [all-test] Fix model imports as well

* Making sure StoppingCriteria is always defined

* [all-test]
```
  167aa76c
- Update tiny models and pipeline tests (#23446) · ffad4f13
  Yih-Dar authored May 18, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  ffad4f13
- Less flaky `test_assisted_decoding_matches_greedy_search` (#23451) · 2406dbdc
  Yih-Dar authored May 18, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  2406dbdc
- Make `RwkvModel` accept `attention_mask` but discard it internally (#23442) · 21f7e81b
  Yih-Dar authored May 18, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  21f7e81b
- Add local agent (#23438) · cf432008
  Sylvain Gugger authored May 18, 2023
```
* Add local agent

* Document LocalAgent
```
  cf432008
- TF: GPT2 with native embedding layers (#23436) · db136341
  Joao Gante authored May 18, 2023
  
  db136341
- Fix DecisionTransformerConfig doctring (#23450) · c618ab4f
  joaoareis authored May 18, 2023
  
  c618ab4f
- Fix (skip) a pipeline test for `RwkvModel` (#23444) · 5777c3cb
  Yih-Dar authored May 18, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  5777c3cb
- 🌐 [i18n-KO] Translated `tasks/zero_shot_object_detection.mdx` to Korean (#23430) · 8cfae440
  Nayeon Han authored May 18, 2023
```
docs: ko: zero_shot_object_detection
```
  8cfae440
- remove unnecessary print in gpt neox sequence classifier (#23433) · f2d2880b
  Chris Hammill authored May 18, 2023
  
  f2d2880b
- Generate: skip left-padding tests on old models (#23437) · aea7b23b
  Joao Gante authored May 18, 2023
  
  aea7b23b
17 May, 2023 14 commits

Fix device issue in... · a8732e09

Yih-Dar authored May 17, 2023


Fix device issue in `SwiftFormerModelIntegrationTest::test_inference_image_classification_head` (#23435)

fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

a8732e09

Remove hardcoded prints in Trainer (#23432) · 0f2c7382
Hugo Abonizio authored May 17, 2023

0f2c7382
Encoder-Decoder: add informative exception when the decoder is not compatible (#23426) · a574de30
Joao Gante authored May 17, 2023

a574de30

Update Bigbird Pegasus tests (#23431) · 939a65ab

Yih-Dar authored May 17, 2023



* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

939a65ab

TF: embeddings out of bounds check factored into function (#23427) · cf9e7cb0
Joao Gante authored May 17, 2023

cf9e7cb0
Update error message when Accelerate isn't installed (#23373) · 45e3d649
Zachary Mueller authored May 17, 2023
```
Update error
```
45e3d649
Small fixes and link in the README (#23428) · ea0eb156
Lysandre Debut authored May 17, 2023
```
Fix + link
```
ea0eb156

Top 100 (#22912) · 5ba0c332

Lysandre Debut authored May 17, 2023

* Awesome Transformers

* Update

* Update

* Keywords

* Keywords

* Complete document

* Add lm-evaluation-harness

* Edit txtai according to David's comments

* Update awesome-transformers.md

5ba0c332

Add Missing tokenization test [electra] (#22997) · ebb649a4

IMvision12 authored May 17, 2023



* Create test_tokenization_electra.py

* Update tests/models/electra/test_tokenization_electra.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

ebb649a4

[Reland] search model buffers for dtype as the last resort (#23319) · a2789add
cyy authored May 17, 2023
```
search model buffers for dtype as the last resort
```
a2789add

Return early once stop token is found. (#23421) · 3d764fe8

Taras Tsugrii authored May 17, 2023

Previously even after finding a stop token, other stop tokens were considered, which is unnecessary and slows down processing.

Currently, this unnecessary overhead is negligible since there are usually 2 stop tokens considered and they are fairly short, but in future it may become more expensive.

3d764fe8

[`SAM`] fix sam slow test (#23376) · 3d3c7d42
Younes Belkada authored May 17, 2023
```
* fix sam slow test

* oops

* fix error message
```
3d3c7d42
Update 3 docker files to use cu118 (#23406) · 22a07699
Yih-Dar authored May 17, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
22a07699

Use dict.items to avoid unnecessary lookups. (#23415) · a6c9643c

Taras Tsugrii authored May 17, 2023

It's more efficient to iterate over key, value dict pairs instead of iterating over keys and performing value lookups on each iteration. It's also more idiomatic.

a6c9643c