Commits · 60d5f8f9f04026cb801d0dc5158bf4531e250072 · chenpangpang / transformers

18 Apr, 2024 2 commits
- 🚨🚨🚨Deprecate `evaluation_strategy` to `eval_strategy`🚨🚨🚨 (#30190) · 60d5f8f9
  Zach Mueller authored Apr 18, 2024
```
* Alias

* Note alias

* Tests and src

* Rest

* Clean

* Change typing?

* Fix tests

* Deprecation versions
```
  60d5f8f9
- Dev version · ce8e64fb
  Lysandre authored Apr 18, 2024
  
  ce8e64fb
17 Apr, 2024 1 commit
- Fix test `ExamplesTests::test_run_translation` (#30281) · 05dab4e5
  Yih-Dar authored Apr 17, 2024
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  05dab4e5
15 Apr, 2024 1 commit
- Set pad_token in run_glue_no_trainer.py #28534 (#30234) · f0107862
  JINO ROHIT authored Apr 15, 2024
  
  f0107862
10 Apr, 2024 1 commit

Fix and simplify semantic-segmentation example (#30145) · 56d001b2

Pavel Iakubovskii authored Apr 10, 2024

* Remove unused augmentation

* Fix pad_if_smaller() and remove unused augmentation

* Add indentation

* Fix requirements

* Update dataset use instructions

* Replace transforms with albumentations

* Replace identity transform with None

* Fixing formatting

* Fixed comment place

56d001b2

09 Apr, 2024 1 commit
- [Trainer] Undo #29896 (#30129) · e9c23fa0
  NielsRogge authored Apr 09, 2024
```
* Undo

* Use tokenizer

* Undo data collator
```
  e9c23fa0
08 Apr, 2024 2 commits
- fixing issue 30034 - adding data format for run_ner.py (#30088) · f5658732
  JINO ROHIT authored Apr 08, 2024
  
  f5658732
- updated examples/pytorch/language-modeling scripts and requirements.txt to... · 5e673ed2
  Haz Sameen Shahgir authored Apr 08, 2024
```
updated examples/pytorch/language-modeling scripts and requirements.txt to require datasets>=2.14.0 (#30120)

updated requirements.txt and require_version() calls in examples/pytorch/language-modeling to require datasets>=2.14.0
```
  5e673ed2
05 Apr, 2024 1 commit
- [Trainer] Allow passing image processor (#29896) · 1ab71364
  NielsRogge authored Apr 05, 2024
```
* Add image processor to trainer

* Replace tokenizer=image_processor everywhere
```
  1ab71364
02 Apr, 2024 1 commit
- Fix `remove_columns` in `text-classification` example (#29351) · fce52cef
  Mario Šaško authored Apr 02, 2024
  
  fce52cef
30 Mar, 2024 1 commit
- Add warning message for `run_qa.py` (#29867) · 156d30da
  Jacky Lee authored Mar 30, 2024
```
* improve: error message for best model metric

* update: raise warning instead of error
```
  156d30da
21 Mar, 2024 1 commit
- Add support for `torch_dtype` in the run_mlm example (#29776) · ef6e371d
  Jacky Lee authored Mar 21, 2024
```
feat: add support for torch_dtype
Co-authored-by: Jacky Lee <jackylee328@gmail.com>
```
  ef6e371d
20 Mar, 2024 1 commit
- v4.40.0.dev.0 · 1248f092
  Arthur Zucker authored Mar 20, 2024
  
  1248f092
15 Mar, 2024 1 commit
- Rename `glue` to `nyu-mll/glue` (#29679) · f02aea27
  Quentin Lhoest authored Mar 15, 2024
```
* Update run_glue.py

* Update run_glue.py

* Update run_glue_no_trainer.py
```
  f02aea27
12 Mar, 2024 2 commits

Examples: check `max_position_embeddings` in the translation example (#29600) · d4796653
Joao Gante authored Mar 12, 2024
```
check max_position_embeddings
```
d4796653

Update legacy Repository usage in various example files (#29085) · b6404866

Hilco van der Wilk authored Mar 12, 2024

* Update legacy Repository usage in `examples/pytorch/text-classification/run_glue_no_trainer.py`

Marked for deprecation here https://huggingface.co/docs/huggingface_hub/guides/upload#legacy-upload-files-with-git-lfs

* Fix import order

* Replace all example usage of deprecated Repository

* Fix remaining repo call and rename args variable

* Revert removing creation of gitignore files and don't change research examples

b6404866

11 Mar, 2024 2 commits

Make torch xla available on GPU (#29334) · 873d9bb3

Yitong Huang authored Mar 11, 2024



* add USE_TORCH_XLA env

* rename torch_tpu to torch_xla

* better is_torch_xla_available; fix some fsdp and performance issues

* fix format

* fix bug when pjrt_device is cpu

* fix bug

* fix the deprecation handling

---------
Co-authored-by: anw90 <ang868@gmail.com>
Co-authored-by: wangang.wa <wangang.wa@alibaba-inc.com>

873d9bb3

Add Fill-in-the-middle training objective example - PyTorch (#27464) · 6d67837f

Tanay Mehta authored Mar 11, 2024

* add: initial script to train clm fim

* fix: if training model from scratch, new tokens will be added and embeddings resized

* fix: fixed attention_mask errors when generating FIM data

* fix: file formatted using black

* add: run_fim_no_trainer.py and fixed some comments in run_fim.py

* add: added fim examples to the README.md and ran code fixup

* fix: little bug in both fim training scripts

* fix: remove comment from notebook and added a note on fim related params

* fix: minor typo in README

* add: suggested minor changes to README and run_fim.py

* add: gradient_accumulation_steps and gradient_checkpointing args

* add: improved model embedding resizing

* add: pad_to_multiple_of and attn_implementation params

* add: requested minor changes

* add: deepspeed zero compatibility

* add: resize embeddings layer with zero3 support for fim model initialization

6d67837f

21 Feb, 2024 1 commit
- v4.39.dev.0 · 1a77f07f
  Arthur Zucker authored Feb 21, 2024
  
  1a77f07f
19 Feb, 2024 2 commits

change version (#29097) · b2724d7b

Arthur authored Feb 19, 2024



* change version

* nuke

* this doesn't make sense

* update some requirements.py

* revert + no main

* nits

* change cache number

* more pin

* revert

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

b2724d7b

Fix a typo in `examples/pytorch/text-classification/run_classification.py` (#29072) · 79132d4c
Jay Zhou authored Feb 19, 2024

79132d4c

16 Feb, 2024 1 commit
- Update all references to canonical models (#29001) · f497f564
  Lysandre Debut authored Feb 16, 2024
```
* Script & Manual edition

* Update
```
  f497f564
12 Feb, 2024 2 commits
- [Docs] Add language identifiers to fenced code blocks (#28955) · fe3df9d5
  Klaus Hipp authored Feb 12, 2024
```
Add language identifiers to code blocks
```
  fe3df9d5
- Updated requirements for image-classification samples: datasets>=2.14.0 (#28974) · 792819f6
  Alexey Fadeev authored Feb 12, 2024
```
Updated datasets requirements. Need a package version >= 2.14.0
```
  792819f6
07 Feb, 2024 1 commit

Update the cache number (#28905) · 308d2b90

Yih-Dar authored Feb 07, 2024



* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

308d2b90

02 Feb, 2024 1 commit

[Docs] Fix spelling and grammar mistakes (#28825) · 721ee783

Klaus Hipp authored Feb 02, 2024

* Fix typos and grammar mistakes in docs and examples

* Fix typos in docstrings and comments

* Fix spelling of `tokenizer` in model tests

* Remove erroneous spaces in decorators

* Remove extra spaces in Markdown link texts

721ee783

01 Feb, 2024 1 commit
- [docs] fix some bugs about parameter description (#28806) · d98591a1
  zspo authored Feb 02, 2024
```
Co-authored-by: p_spozzhang <p_spozzhang@tencent.com>
```
  d98591a1
30 Jan, 2024 1 commit

Pin Torch to <2.2.0 (#28785) · 74c9cfea

Matt authored Jan 30, 2024



* Pin torch to <2.2.0

* Pin torchvision and torchaudio as well

* Playing around with versions to see if this helps

* twiddle something to restart the CI

* twiddle it back

* Try changing the natten version

* make fixup

* Revert "Try changing the natten version"

This reverts commit de0d6592c35dc39ae8b5a616c27285db28262d06.

* make fixup

* fix fix fix

* fix fix fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

74c9cfea

29 Jan, 2024 1 commit
- Fix input data file extension in examples (#28741) · 39fa4009
  Klaus Hipp authored Jan 29, 2024
  
  39fa4009
26 Jan, 2024 1 commit
- [docs] Fix datasets in guides (#28715) · abe0289e
  Steven Liu authored Jan 26, 2024
```
* change datasets

* fix
```
  abe0289e
22 Jan, 2024 2 commits
- Fix lr_scheduler in no_trainer training scripts (#27872) · deb2b590
  bofeng huang authored Jan 22, 2024
```
* Fix lr_scheduler

* Fix lr scheduler
```
  deb2b590
- Fix id2label assignment in run_classification.py (#28590) · f0acf7b6
  jheitmann authored Jan 22, 2024
  
  f0acf7b6
19 Jan, 2024 1 commit
- v4.38.dev.0 · b2748a6e
  Amy Roberts authored Jan 19, 2024
  
  b2748a6e
18 Jan, 2024 2 commits

Making CTC training example more general (#28582) · 772307be

Yoach Lacombe authored Jan 18, 2024



* add w2v2bert compatibility

* Update examples/pytorch/speech-recognition/run_speech_recognition_ctc.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

772307be

Add new meta w2v2-conformer BERT-like model (#28165) · d2cdefb9

Yoach Lacombe authored Jan 18, 2024



* first commit

* correct default value non causal

* update config and modeling code

* update converting checkpoint

* clean modeling and fix tests

* make style

* add new config parameters to docstring

* fix copied from statements

* Apply suggestions from code review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* make position_embeddings_type docstrings clearer

* clean converting script

* remove function not used

* clean modeling file

* apply suggestion for test file + add convert script to not_doctested

* modify tests according to review - cleaner logic and more tests

* Apply nit suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* add checker of valid position embeddings type

* instantiate new layer norm layer with the right eps

* fix freeze_feature_encoder since it can be None in some cases

* add test same output in convert script

* restore wav2vec2conformer and add new model

* create processor and FE + clean

* add new model code

* fix convert script and set default config parameters

* correct model id paths

* make style

* make fix-copies and cleaning files

* fix copied from statements

* complete .md and fixe copies

* clean convert script argument defaults

* fix config parameters docstrings

* fix config docstring

* add copied from and enrich FE tests

* fix copied from and repo-consistency

* add autotokenizer

* make test input length shorter and change docstring code

* fix docstrings and copied from

* add add_adapter to ASR training example

* make testing of adapters more robust

* adapt to multi adapter layers

* refactor input_values->input_features and remove w2v2-bert feature extractor

* remove pretraining model

* remove depreciated features and useless lines

* add copied from and ignore statements to modeling tests

* remove pretraining model #2

* change import in convert script

* change default in convert script

* update readme and remove useless line

* Update tests/models/wav2vec2_bert/test_processor_wav2vec2_bert.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* refactor BERT to Bert for consistency

* remove useless ignore copy statement

* add persistent to buffer in rotary

* add eps in LayerNorm init and remove copied from

* add adapter activation parameters and add copied from statements

* Fix copied statements and add unitest.skip reasons

* add copied statement in test_processor

* refactor processor

* make style

* replace numpy random by torch rand

* remove expected output CTC

* improve converting script with processor class

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* remove gumbel class

* remove tests related to previously deleted class

* Update src/transformers/models/wav2vec2_bert/configuration_wav2vec2_bert.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* correct typos

* remove uused parameters

* update processor to takes both text and audio

* update checkpoints

* update expected output and add ctc expected output

* add label_attention_mask

* replace pt with np in processor tests

* fix typo

* revert to behaviour with labels_attention_mask

---------
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

d2cdefb9

16 Jan, 2024 1 commit

Remove `task` arg in `load_dataset` in image-classification example (#28408) · 0cdcd7a2

regisss authored Jan 16, 2024

* Remove `task` arg in `load_dataset` in image-classification example

* Manage case where "train" is not in dataset

* Add new args to manage image and label column names

* Similar to audio-classification example

* Fix README

* Update tests

0cdcd7a2

15 Jan, 2024 2 commits
- improve dev setup comments and hints (#28495) · ff86bc36
  Timothy Cronin authored Jan 15, 2024
```
* improve dev setup comments and hints

* fix tests for new dev setup hints
```
  ff86bc36
- Don't set `finetuned_from` if it is a local path (#28482) · 64bdbd88
  Yih-Dar authored Jan 15, 2024
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  64bdbd88
12 Jan, 2024 1 commit
- TF: purge `TFTrainer` (#28483) · 4fb3d3a0
  Joao Gante authored Jan 12, 2024
  
  4fb3d3a0
11 Jan, 2024 1 commit

Set `cache_dir` for `evaluate.load()` in example scripts (#28422) · 95091e15

Alex Hedges authored Jan 11, 2024

While using `run_clm.py`,[^1] I noticed that some files were being added
to my global cache, not the local cache. I set the `cache_dir` parameter
for the one call to `evaluate.load()`, which partially solved the
problem. I figured that while I was fixing the one script upstream, I
might as well fix the problem in all other example scripts that I could.

There are still some files being added to my global cache, but this
appears to be a bug in `evaluate` itself. This commit at least moves
some of the files into the local cache, which is better than before.

To create this PR, I made the following regex-based transformation:
`evaluate\.load\((.*?)\)` -> `evaluate\.load\($1,
cache_dir=model_args.cache_dir\)`. After using that, I manually fixed
all modified files with `ruff` serving as useful guidance. During the
process, I removed one existing usage of the `cache_dir` parameter in a
script that did not have a corresponding `--cache-dir` argument
declared.

[^1]: I specifically used `pytorch/language-modeling/run_clm.py` from
v4.34.1 of the library. For the original code, see the following URL:
https://github.com/huggingface/transformers/tree/acc394c4f5e1283c19783581790b3dc3105a3697/examples/pytorch/language-modeling/run_clm.py.

95091e15