Commits · f5658732d5bd38657b6cf3138f69373a94866c61 · chenpangpang / transformers

"examples/research_projects/distillation/train.py" did not exist on "c51e533a5febe3fae2bb33b060f6b1f36a92e003"

08 Apr, 2024 7 commits
- fixing issue 30034 - adding data format for run_ner.py (#30088) · f5658732
  JINO ROHIT authored Apr 08, 2024
  
  f5658732
- [tests] add `require_bitsandbytes` marker (#30116) · d16f0abc
  Fanli Lin authored Apr 08, 2024
```
* add bnb flag

* move maker

* add accelerator maker
```
  d16f0abc
- updated examples/pytorch/language-modeling scripts and requirements.txt to... · 5e673ed2
  Haz Sameen Shahgir authored Apr 08, 2024
```
updated examples/pytorch/language-modeling scripts and requirements.txt to require datasets>=2.14.0 (#30120)

updated requirements.txt and require_version() calls in examples/pytorch/language-modeling to require datasets>=2.14.0
```
  5e673ed2
- Make MLFlow version detection more robust and handles mlflow-skinny (#29957) · 836e88ca
  Howard Liberty authored Apr 08, 2024
```
* Make MLFlow version detection more robust and handles mlflow-skinny

* Make function name more clear and refactor the logic

* Further refactor
```
  836e88ca
- Change log level to warning for num_train_epochs override (#30014) · a907a903
  Xu Song authored Apr 08, 2024
  
  a907a903
- [Whisper] Computing features on GPU in batch mode for whisper feature extractor. (#29900) · 1ed93be4
  vaibhavagg303 authored Apr 08, 2024
```
* add _torch_extract_fbank_features_batch function in feature_extractor_whisper

* reformat feature_extraction_whisper.py file

* handle batching in single function

* add gpu test & doc

* add batch test & device in each __call__

* add device arg in doc string

---------
Co-authored-by: vaibhav.aggarwal <vaibhav.aggarwal@sprinklr.com>
```
  1ed93be4
- doc: Correct spelling mistake (#30107) · 1fc34aa6
  Cylis authored Apr 08, 2024
  
  1fc34aa6
05 Apr, 2024 13 commits

Fix whisper kwargs and generation config (#30018) · 76fa17c1
Raushan Turganbay authored Apr 05, 2024
```
* clean-up whisper kwargs

* failing test
```
76fa17c1

Yih-Dar authored Apr 05, 2024



* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

9b5a6450

Add docstrings and types for MambaCache (#30023) · d9fa13ce

Kola authored Apr 05, 2024

* Add docstrings and types for MambaCache

* Update src/transformers/models/mamba/modeling_mamba.py

* Update src/transformers/models/mamba/modeling_mamba.py

* Update src/transformers/models/mamba/modeling_mamba.py

* make fixup

* import copy in generation_whisper

* ruff

* Revert "make fixup"

This reverts commit c4fedd6f60e3b0f11974a11433bc130478829a5c.

d9fa13ce

Refactor daily CI workflow (#30012) · b17b54d3

Yih-Dar authored Apr 05, 2024



* separate jobs

* separate jobs

* use channel name directly instead of ID

* use channel name directly instead of ID

* use channel name directly instead of ID

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

b17b54d3

Fix `torch.fx` symbolic tracing for LLama (#30047) · 17cd7a9d

Michael Benayoun authored Apr 05, 2024

* [WIP] fix fx

* [WIP] fix fx

* [WIP] fix fx

* [WIP] fix fx

* [WIP] fix fx

* Apply changes to other models

17cd7a9d

[test fetcher] Always include the directly related test files (#30050) · 48795317
Yih-Dar authored Apr 05, 2024
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
48795317

Update quantizer_bnb_4bit.py: In the ValueError string there should be... · de11d0bd

miRx923 authored Apr 05, 2024

Update quantizer_bnb_4bit.py: In the ValueError string there should be "....you need to set `llm_int8_enable_fp32_cpu_offload=True`...." instead of "`load_in_8bit_fp32_cpu_offload=True`". (#30013)

* Update quantizer_bnb_4bit.py

There is an mistake in ValueError on line 86 of quantizer_bnb_4bit.py. In the error string there should be "....you need to set `llm_int8_enable_fp32_cpu_offload=True`...." instead of "load_in_8bit_fp32_cpu_offload=True". I think you updated the BitsAndBytesConfig() arguments, but forgot to change the ValueError in quantizer_bnb_4bit.py.

* Update quantizer_bnb_4bit.py

Changed ValueError string "...you need to set load_in_8bit_fp32_cpu_offload=True..." to "....you need to set llm_int8_enable_fp32_cpu_offload=True...."

de11d0bd

[bnb] Fix offload test (#30039) · 4207a407
Marc Sun authored Apr 05, 2024
```
fix bnb test
```
4207a407
[Trainer] Allow passing image processor (#29896) · 1ab71364
NielsRogge authored Apr 05, 2024
```
* Add image processor to trainer

* Replace tokenizer=image_processor everywhere
```
1ab71364
Fix mixtral ONNX Exporter Issue. (#29858) · d704c0b6
Adam Louly authored Apr 05, 2024
```
* fix mixtral onnx export

* fix qwen model
```
d704c0b6

if output is tuple like facebook/hf-seamless-m4t-medium, waveform is … (#29722) · 79d62b2d

Wang, Yi authored Apr 05, 2024



* if output is tuple like facebook/hf-seamless-m4t-medium, waveform is the first element
Signed-off-by: Wang, Yi <yi.a.wang@intel.com>

* add test and fix batch issue
Signed-off-by: Wang, Yi <yi.a.wang@intel.com>

* add dict output support for seamless_m4t
Signed-off-by: Wang, Yi <yi.a.wang@intel.com>

---------
Signed-off-by: Wang, Yi <yi.a.wang@intel.com>

79d62b2d

skip `test_encode_decode_fast_slow_all_tokens` for now (#30044) · 8b52fa6b

Yih-Dar authored Apr 05, 2024



skip test_encode_decode_fast_slow_all_tokens for now
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

8b52fa6b

Add `whisper` to `IMPORTANT_MODELS` (#30046) · 24d787ce
Yih-Dar authored Apr 05, 2024
```
Add whisper
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
24d787ce

04 Apr, 2024 3 commits

Refactor Cohere Model (#30027) · 517a3e67
Saurabh Dash authored Apr 04, 2024
```
* changes

* addressing comments

* smol fix
```
517a3e67

[`ProcessingIdefics`] Attention mask bug with padding (#29449) · 75b76a5e

byi8220 authored Apr 04, 2024

* Defaulted IdeficsProcessor padding to 'longest', removed manual padding

* make fixup

* Defaulted processor call to padding=False

* Add padding to processor call in IdeficsModelIntegrationTest as well

* Defaulted IdeficsProcessor padding to 'longest', removed manual padding

* make fixup

* Defaulted processor call to padding=False

* Add padding to processor call in IdeficsModelIntegrationTest as well

* redefaulted padding=longest again

* fixup/doc

75b76a5e

Add a converter from mamba_ssm -> huggingface mamba (#29705) · 4e6c5eb0

byi8220 authored Apr 04, 2024



* implement convert_mamba_ssm_checkpoint_to_pytorch

* Add test test_model_from_mamba_ssm_conversion

* moved convert_ssm_config_to_hf_config to inside mamba_ssm_available check

* fix skipif clause

* moved skips to inside test since skipif decorator isn't working for some reason

* Added validation

* removed test

* fixup

* only compare logits

* remove weight rename

* Update src/transformers/models/mamba/convert_mamba_ssm_checkpoint_to_pytorch.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* nits

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

4e6c5eb0

03 Apr, 2024 12 commits

Enable multi-device for efficientnet (#29989) · 03732dea
Jacky Lee authored Apr 03, 2024
```
feat: enable mult-idevice for efficientnet
```
03732dea

Make clearer about zero_init requirements (#29879) · 863e2562

Zach Mueller authored Apr 03, 2024



* Docstring to note about zero init

* Check for accelerate

* Change conditional return

* Tweak

* Add new accelerate-specific zero3 check

* Fix import

* Revert to RTFM

* Update src/transformers/modeling_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

863e2562

[`Main CIs`] Fix the red cis (#30022) · 695d8233
Arthur authored Apr 03, 2024
```
* fix

* sort imports
```
695d8233
Superpoint imports fix (#29898) · c10b5dd2
Raushan Turganbay authored Apr 03, 2024
```
quick fix
```
c10b5dd2
[docs] Fix audio file (#30006) · 34bfe95a
Steven Liu authored Apr 03, 2024
```
new audio file
```
34bfe95a
Fix vipllava for generation (#29874) · cc75f1ac
Raushan Turganbay authored Apr 03, 2024
```
* fix vipllava generation

* consistent llava code

* revert llava tests changes
```
cc75f1ac
Fix probability computation in `WhisperNoSpeechDetection` when recomputing scores (#29248) · 240e1062
Ondřej Cífka authored Apr 03, 2024
```
* Fix is_scores_logprobs in WhisperNoSpeechDetection

* Add test_whisper_longform_no_speech_detection

* Fix typo
```
240e1062

Fix `kwargs` handling in `generate_with_fallback` (#29225) · bcd42c4a

Ondřej Cífka authored Apr 03, 2024

* Fix generate_with_fallback **kwargs

* Change pop to get

* Delete keys from kwargs to prevent overriding generation_config

* Revert to passing kwargs by reference, but make a (shallow) copy

* dict -> copy.copy

* Add test_whisper_longform_multi_batch_beam

bcd42c4a

Fix Qwen2Tokenizer (#29929) · 851f253f

Ren Xuancheng authored Apr 03, 2024



qwen2: fixed tokens starting with # in slow tokenizer; add tests
Co-authored-by: jklj077 <17811943+jklj077@users.noreply.github.com>

851f253f

Fix Swinv2ForImageClassification NaN output (#29981) · 17b06e2c

Miguel Almeida authored Apr 03, 2024

To address the issue of NaN logit outputs for certain combinations
of the `image_size`, `patch_size` and `depths` configuration
parameters, an assertion was made to ensure that the resulting
`window_size` field in the model's Self Attention class is greater
than 1, preventing divisions by zero in the normalization of
`relative_coords_table`.

Fix: #28675

17b06e2c

Make EncodecModel.decode ONNX exportable (#29913) · 81642d2b
fxmarty authored Apr 03, 2024
```
* fix encodec onnx export for musicgen

* simplification

* fix quality

* better style
```
81642d2b
Update `tests/utils/tiny_model_summary.json` (#29941) · b44df05b
Yih-Dar authored Apr 03, 2024
```
update
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
b44df05b

02 Apr, 2024 5 commits

Fix `remove_columns` in `text-classification` example (#29351) · fce52cef
Mario Šaško authored Apr 02, 2024

fce52cef
Generate: fix logits processors doctests (#29718) · 5080ab12
Joao Gante authored Apr 02, 2024
```
* fix norm

* fix logits processors doctests
```
5080ab12

Hard error when ignoring tensors. (#27484) (#29906) · 9b0a8ea7

Nicolas Patry authored Apr 02, 2024



* Hard error when ignoring tensors. (#27484)

* [WIP] Hard error when ignoring tensors.

* Better selection/error when saving a checkpoint.

- Find all names we should normally drop (those are in the transformers
  config)
- Find all disjoint tensors (for those we can safely trigger a copy to
  get rid of the sharing before saving)
- Clone those disjoint tensors getting rid of the issue
- Find all identical names (those should be declared in the config
  but we try to find them all anyway.)
- For all identical names:
  - If they are in the config, just ignore them everything is fine
  - If they are not, warn about them.
- For all remainder tensors which are shared yet neither identical NOR
  disjoint. raise a hard error.

* Adding a failing test on `main` that passes here.

* We don't need to keep the subfolder logic in this test.

* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Add small tests.

* Dead variable.

* Fixup.

* Fixing tied_Weights_keys on generic models.

* Fixup + T5 encoder/decoder tying (with different layers)

* Code quality.

* Dynamic member.

* trigger

* Fixing encoder name for other types of encoder/decoder combos.

* Fix scoping.

* Update .github/workflows/self-scheduled.yml
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Fixing the tied_weights after the call.

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

9b0a8ea7

Fix `skip_special_tokens` for `Wav2Vec2CTCTokenizer._decode` (#29311) · 15cd6871

Minsub Lee (Matt) authored Apr 02, 2024

* Fix skip_special_tokens process for Wav2Vec2CTCTokenizer._decode

* Fix skip_special_tokens for Wav2Vec2CTCTokenizer._decode

* Exclude pad_token filtering since it is used as CTC-blank token

* Add small test for skip_special_tokens

* Update decoding test for added new token

15cd6871

[Docs] Make an ordered list prettier in add_tensorflow_model.md (#29949) · cb5927ca
Michael authored Apr 02, 2024

cb5927ca