Commits · 98dba52ccd713b6a821a597ac2aa50b6d145dcdf · chenpangpang / transformers

08 Jan, 2024 4 commits

Bugfix / ffmpeg input device (mic) not working on Windows (#27051) · 98dba52c

Ondrej Major authored Jan 08, 2024

* fix input audio device for windows.

* ffmpeg audio device Windows

* Fixes wrong input device assignment in Windows

* Fixed getting mic on Windows systems by adding _get_microphone_name() function.

98dba52c

remove two deprecated function (#28220) · 7d9d5cea
Hz, Ji authored Jan 08, 2024

7d9d5cea
Fix building alibi tensor when num_heads is not a power of 2 (#28380) · 0c2121f9
Mohamed Abu El-Nasr authored Jan 08, 2024
```
* Fix building alibi tensor when num_heads is not a power of 2

* Remove print function
```
0c2121f9

Enhancing Code Readability and Maintainability with Simplified Activation... · 53cffeb3

Chi authored Jan 08, 2024


Enhancing Code Readability and Maintainability with Simplified Activation Function Selection. (#28349)

* Little bit change code in get_activation()

* proper area to deffine gelu_activation() in this two file

* Fix github issue

* Mistake some typo

* My mistake to self using to call config

* Reformat my two file

* Update src/transformers/activations.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/electra/modeling_electra.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/convbert/modeling_convbert.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Rename gelu_act to activatioin

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

53cffeb3

07 Jan, 2024 1 commit
- [Phi2] Add support for phi2 models (#28211) · 3eddda11
  Susnato Dhar authored Jan 07, 2024
```
* modified script and added test for phi2

* changes
```
  3eddda11
05 Jan, 2024 6 commits

chore: Fix typo s/exclusivelly/exclusively/ (#28361) · 4ab5fb89
hugo-syn authored Jan 05, 2024

4ab5fb89

Update VITS modeling to enable ONNX export (#28141) · 7226f3d2

Ella Charlaix authored Jan 05, 2024

* Update vits modeling for onnx export compatibility

* fix style

* Update src/transformers/models/vits/modeling_vits.py

7226f3d2

fix FA2 when using quantization for remaining models (#28341) · cadf93a6

Susnato Dhar authored Jan 05, 2024



* fix fa2 autocasting when using quantization

* Update src/transformers/models/distilbert/modeling_distilbert.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/distilbert/modeling_distilbert.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

cadf93a6

[DETA] Improvement and Sync from DETA especially for training (#27990) · 899d8351

Sangbum Daniel Choi authored Jan 05, 2024



* [DETA] fix freeze/unfreeze function

* Update src/transformers/models/deta/modeling_deta.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/deta/modeling_deta.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* add freeze/unfreeze test case in DETA

* fix type

* fix typo 2

* fix : enable aux and enc loss in training pipeline

* Add unsynced variables from original DETA for training

* modification for passing CI test

* make style

* make fix

* manual make fix

* change deta_modeling_test of configuration 'two_stage' default to TRUE and minor change of dist checking

* remove print

* divide configuration in DetaModel and DetaForObjectDetection

* image smaller size than 224 will give topk error

* pred_boxes and logits should be equivalent to two_stage_num_proposals

* add missing part in DetaConfig

* Update src/transformers/models/deta/modeling_deta.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* add docstring in configure and prettify TO DO part

* change distribute related code to accelerate

* Update src/transformers/models/deta/configuration_deta.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update tests/models/deta/test_modeling_deta.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* protect importing accelerate

* change variable name to specific value

* wrong import

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

899d8351

Fix pos_mask application and update tests accordingly (#27892) · 57e9c832

Fernando Rodriguez Sanchez authored Jan 05, 2024



* Fix pos_mask application and update tests accordingly

* Fix style

* Adding comments

---------
Co-authored-by: Fernando Rodriguez <fernando.rodriguez@nielseniq.com>

57e9c832

Don't check the device when device_map=auto (#28351) · 03b98099

yuanwu2017 authored Jan 05, 2024

When running the case on multi-cards server with devcie_map-auto, It will not always be allocated to device 0,
Because other processes may be using these cards. It will select the devices that can accommodate this model.
Signed-off-by: yuanwu <yuan.wu@intel.com>

03b98099

04 Jan, 2024 3 commits

README: install transformers from conda-forge channel (#28313) · 5d36025c

Kevin Herro authored Jan 04, 2024

Switch to the conda-forge channel for transformer installation,
as the huggingface channel does not offer the latest version.

Fixes #28248

5d36025c

Fix error in M4T feature extractor (#28340) · 35e9d2b2

Yoach Lacombe authored Jan 04, 2024

* fix M4T FE error when no attention mask

* modify logic

* add test

* go back to initial test situation + add other tests

35e9d2b2

enable training mask2former and maskformer for transformers trainer (#28277) · 4a66c0d9
Sangbum Daniel Choi authored Jan 04, 2024
```
* fix get_num_masks output as [int] to int

* fix loss size from torch.Size([1]) to torch.Size([])
```
4a66c0d9

03 Jan, 2024 6 commits

[docs] Sort es/toctree.yml | Translate performance.md (#28262) · 6b8ec258

Aaron Jimenez authored Jan 03, 2024

* Sort es/_toctree.yml like en/_toctree.yml

* Run make style

* Add -Rendimiento y escalabilidad- section to es/_toctree.yml

* Run make style

* Add s to section

* Add translate of performance.md

* Add performance.md to es/_toctree.yml

* Run make styele

* Fix docs links

* Run make style

6b8ec258

Translate contributing.md into Chinese (#28243) · 3ea88336
Mayfsz authored Jan 04, 2024
```
* Translate contributing.md into Chinese

* Update review comments
```
3ea88336

Remove token_type_ids from model_input_names (like #24788) (#28325) · 45b1dfa3

Apsod authored Jan 03, 2024

* remove token_type_ids from model_input_names (like #24788)

* removed test that assumed token_type_ids should be present and updated a model reference so that it points to an available model)

45b1dfa3

Add FastSpeech2Conformer (#23439) · d83ff5ee

Connor Henderson authored Jan 03, 2024

* start - docs, SpeechT5 copy and rename

* add relevant code from FastSpeech2 draft, have tests pass

* make it an actual conformer, demo ex.

* matching inference with original repo, includes debug code

* refactor nn.Sequentials, start more desc. var names

* more renaming

* more renaming

* vocoder scratchwork

* matching vocoder outputs

* hifigan vocoder conversion script

* convert model script, rename some config vars

* replace postnet with speecht5's implementation

* passing common tests, file cleanup

* expand testing, add output hidden states and attention

* tokenizer + passing tokenizer tests

* variety of updates and tests

* g2p_en pckg setup

* import structure edits

* docstrings and cleanup

* repo consistency

* deps

* small cleanup

* forward signature param order

* address comments except for masks and labels

* address comments on attention_mask and labels

* address second round of comments

* remove old unneeded line

* address comments part 1

* address comments pt 2

* rename auto mapping

* fixes for failing tests

* address comments part 3 (bart-like, train loss)

* make style

* pass config where possible

* add forward method + tests to WithHifiGan model

* make style

* address arg passing and generate_speech comments

* address Arthur comments

* address Arthur comments pt2

* lint  changes

* Sanchit comment

* add g2p-en to doctest deps

* move up self.encoder

* onnx compatible tensor method

* fix is symbolic

* fix paper url

* move models to espnet org

* make style

* make fix-copies

* update docstring

* Arthur comments

* update docstring w/ new updates

* add model architecture images

* header size

* md wording update

* make style

d83ff5ee

fix documentation for zero_shot_object_detection (#28267) · 6eba901d
lain authored Jan 03, 2024
```
remove broken space
```
6eba901d

Bump tj-actions/changed-files from 22.2 to 41 in /.github/workflows (#28311) · c2d283a6

dependabot[bot] authored Jan 03, 2024

Bumps [tj-actions/changed-files](https://github.com/tj-actions/changed-files) from 22.2 to 41.
- [Release notes](https://github.com/tj-actions/changed-files/releases)
- [Changelog](https://github.com/tj-actions/changed-files/blob/main/HISTORY.md)
- [Commits](https://github.com/tj-actions/changed-files/compare/v22.2...v41

)

---
updated-dependencies:
- dependency-name: tj-actions/changed-files
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

c2d283a6

02 Jan, 2024 5 commits

Remove fast tokenization warning in Data Collators (#28213) · aa4a0f8e
Daniel Bustamante Ospina authored Jan 02, 2024

aa4a0f8e

[Whisper] Fix errors with MPS backend introduced by new code on word-level... · 5be46dfc

Marco Carosi authored Jan 02, 2024


[Whisper] Fix errors with MPS backend introduced by new code on word-level timestamps computation (#28288)

* Update modeling_whisper.py to support MPS backend

Fixed some issue with MPS backend.

First, the torch.std_mean is not implemented and is not scheduled for implementation, while the single torch.std and torch.mean are.
Second, MPS backend does not support float64, so it can not cast from float32 to float64. Inverting the double() when the matrix is in the cpu fixes the issue while should not change the logic.

* Found another instruction in modeling_whisper.py not implemented byor MPS

After a load test, where I transcribed a 2 hours audio file, I got into a branch that did not fix in the previous commit.
Similar fix, where the torch.std_mean is changed into torch.std and torch.mean

* Update modeling_whisper.py removed trailing white spaces

Removed trailing white spaces

* Update modeling_whisper.py to use is_torch_mps_available()

Using is_torch_mps_available() instead of capturing the NotImplemented exception

* Update modeling_whisper.py sorting the import block

Sorting the utils import block

* Update src/transformers/models/whisper/modeling_whisper.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/models/whisper/modeling_whisper.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/models/whisper/modeling_whisper.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

5be46dfc

fix bug:divide by zero in _maybe_log_save_evaluate() (#28251) · 87ae2a46
frankenliu authored Jan 02, 2024
```
Co-authored-by: liujizhong1 <liujizhong1@xiaomi.com>
```
87ae2a46
Fix trainer saving safetensors: metadata is None (#28219) · 502a10a6
hoshi-hiyouga authored Jan 02, 2024
```
* Update trainer.py

* format
```
502a10a6
Update docs around mixing hf scheduler with deepspeed optimizer (#28223) · cad9f5c6
Dean Wyatte authored Jan 02, 2024
```
update docs around mixing hf scheduler with deepspeed optimizer
```
cad9f5c6

26 Dec, 2023 2 commits
- small typo (#28229) · 3cefac1d
  Stas Bekman authored Dec 26, 2023
```
Update modeling_utils.py
```
  3cefac1d
- fix FA2 when using quantization (#28203) · 3b7675b2
  Sourab Mangrulkar authored Dec 26, 2023
  
  3b7675b2
25 Dec, 2023 1 commit
- [`Awq`] Enable the possibility to skip quantization for some target modules (#27950) · fa21ead7
  Younes Belkada authored Dec 25, 2023
```
* v1

* add docstring

* add tests

* add awq 0.1.8

* oops

* fix test
```
  fa21ead7
22 Dec, 2023 12 commits

[`Llava`] Fix llava index errors (#28032) · 29e7a1e1

Younes Belkada authored Dec 22, 2023



* fix llava index errors

* forward contrib credits from original implementation and fix

* better fix

* final fixes and fix all tests

* fix

* fix nit

* fix tests

* add regression tests

---------
Co-authored-by: gullalc <gullalc@users.noreply.github.com>

29e7a1e1

update the logger message with accordant weights_file_name (#28181) · 68fa1e85
lin yudong authored Dec 22, 2023
```
Co-authored-by: yudong.lin <yudong.lin@funplus.com>
```
68fa1e85

Fixing visualization code for object detection to support both types of bounding box. (#27842) · 74d9d0ce

Anindyadeep authored Dec 22, 2023



* fix: minor enhancement and fix in bounding box visualization example

The example that was trying to visualize the bounding box was not considering an edge case,
where the bounding box can be un-normalized. So using the same set of code, we can not get
results with a different dataset with un-normalized bounding box. This commit fixes that.

* run make clean

* add an additional note on the scenarios where the box viz code works

---------
Co-authored-by: Anindyadeep <anindya@pop-os.localdomain>

74d9d0ce

[Whisper] Fix word-level timestamps with bs>1 or num_beams>1 (#28114) · 5da3db3f

Yoach Lacombe authored Dec 22, 2023



* fix frames

* use smaller chunk length

* correct beam search + tentative stride

* fix whisper word timestamp in batch

* add test batch generation with return token timestamps

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* clean a test

* make style + correct typo

* write clearer comments

* explain test in comment

---------
Co-authored-by: sanchit-gandhi <sanchit@huggingface.co>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

5da3db3f

Drop `feature_extractor_type` when loading an image processor file (#28195) · c4df7c16
Yih-Dar authored Dec 22, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
c4df7c16

Fix the check of models supporting FA/SDPA not run (#28202) · bb3bd447

Yih-Dar authored Dec 22, 2023



* add check_support_list.py

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

bb3bd447

Bug: `training_args.py` fix missing import with accelerate with version... · e37ab52d

Michael Feil authored Dec 22, 2023

Bug: `training_args.py` fix missing import with accelerate with version `accelerate==0.20.1` (#28171)

* fix-accelerate-version

* updated with exported ACCELERATE_MIN_VERSION,

* update string in ACCELERATE_MIN_VERSION

e37ab52d

Add Swinv2 backbone (#27742) · c9fb250a

NielsRogge authored Dec 22, 2023

* First draft

* More improvements

* More improvements

* Make all tests pass

* Remove script

* Update image processor

* Address comments

* Use new gradient checkpointing method

* Convert checkpoints, add integration test

* Do not keep aspect ratio for now

* Set keep_aspect_ratio=False for beit, add integration test

* Remove print statement

c9fb250a

Fix: [SeamlessM4T - S2TT] Bug in batch loading of audio in torch.Tensor format... · 1ef86c4f

Nicholas Neo authored Dec 22, 2023


Fix: [SeamlessM4T - S2TT] Bug in batch loading of audio in torch.Tensor format in the SeamlessM4TFeatureExtractor class (#27914)

* fixes: code fixes on is_batched condition to also check for batched audio data in torch.Tensor format instead of only just checking for batched audio data in np.ndarray format

* Update src/transformers/models/seamless_m4t/feature_extraction_seamless_m4t.py
Co-authored-by: Yoach Lacombe <52246514+ylacombe@users.noreply.github.com>

* refactor: code refactoring to remove torch framework dependency

* docs: updated docstring to add torch tensor compatibility

* test: add test cases to incorporate torch tensor inputs

* test: ran make fix-copies for code conformity

* test: refactor test to separate the test_call into test_call_numpy and test_call_torch

---------
Co-authored-by: Yoach Lacombe <52246514+ylacombe@users.noreply.github.com>

1ef86c4f

Fix ONNX export for causal LM sequence classifiers by removing reverse indexing (#28144) · 548a8f61

Dean Wyatte authored Dec 22, 2023

* normalize reverse indexing for causal lm sequence classifiers

* normalize reverse indexing for causal lm sequence classifiers

* normalize reverse indexing for causal lm sequence classifiers

* use modulo instead

* unify modulo-based sequence lengths

548a8f61

Update `docs/source/en/perf_infer_gpu_one.md` (#28198) · 71f46057
Yih-Dar authored Dec 22, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
71f46057
[`Docs`] Add 4-bit serialization docs (#28182) · 3a8769f6
Younes Belkada authored Dec 22, 2023
```
* add 4-bit serialization docs

* up

* up
```
3a8769f6