Commits · 57c965a8f1000b4016ad219e616f509d8af3f5b5 · chenpangpang / transformers

17 May, 2024 1 commit

Remove deprecated logic and warnings (#30743) · 57c965a8

amyeroberts authored May 17, 2024

* Remove deprecated logic and warnings

* Add back some code that seems to be important...

* Let's just add all he nllb stuff back; removing it is a bit more involved

* Remove kwargs

* Remove more kwargs

57c965a8

10 May, 2024 1 commit
- [docs] Update link in es/pipeline_webserver.md (#30745) · 8ce4fefc
  Aaron Jimenez authored May 10, 2024
```
* update link

* run make style
```
  8ce4fefc
08 May, 2024 1 commit

Add examples for detection models finetuning (#30422) · 998dbe06

Pavel Iakubovskii authored May 08, 2024



* Training script for object detection

* Evaluation script for object detection

* Training script for object detection with eval loop outside trainer

* Trainer DETR finetuning

* No trainer DETR finetuning

* Eval script

* Refine object detection example with trainer

* Remove commented code and enable telemetry

* No trainer example

* Add requirements for object detection examples

* Add test for trainer example

* Readme draft

* Fix uploading to HUB

* Readme improvements

* Update eval script

* Adding tests for object-detection examples

* Add object-detection example

* Add object-detection resources to docs

* Update README with custom dataset instructions

* Update year

* Replace valid with validation

* Update instructions for custom dataset

* Remove eval script

* Remove use_auth_token

* Add copied from and telemetry

* Fixup

* Update readme

* Fix id2label

* Fix links in docs

* Update examples/pytorch/object-detection/run_object_detection.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update examples/pytorch/object-detection/run_object_detection.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Move description to the top

* Fix Trainer example

* Update no trainer example

* Update albumentations version

---------
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

998dbe06

02 May, 2024 1 commit
- Fix memory leak with CTC training script on Chinese languages (#30358) · 12c5544d
  Bai Li authored May 02, 2024
```
* Fix memory leak with CTC training script on Chinese languages

* Fix lint
```
  12c5544d
01 May, 2024 2 commits
- Fix canonical model --model_type in examples (#30480) · bbaa8cef
  amyeroberts authored May 01, 2024
```
Fix --model_type in examples
```
  bbaa8cef
- Fix QA example (#30580) · 1e05671d
  Matt authored May 01, 2024
```
* Handle cases when CLS token is absent

* Use BOS token as a fallback
```
  1e05671d
30 Apr, 2024 1 commit

Fix seq2seq collator padding (#30556) · 9112520b

Anton Vlasjuk authored Apr 30, 2024

* fix seq2seq data collator to respect the given padding strategy

further added tests for the seq2seq data collator in the style of the `data_collator_for_token_classification` (pt, tf, np)

* formatting and change bool equals "==" to "is"

* add missed return types in tests

* update numpy test as it can handle unequal shapes, not like pt or tf

9112520b

26 Apr, 2024 1 commit

[examples] update whisper fine-tuning (#29938) · 38b53da3

Sanchit Gandhi authored Apr 26, 2024

* [examples] update whisper fine-tuning

* deprecate forced/suppress tokens

* item assignment

* update readme

* final fix

38b53da3

18 Apr, 2024 2 commits
- 🚨🚨🚨Deprecate `evaluation_strategy` to `eval_strategy`🚨🚨🚨 (#30190) · 60d5f8f9
  Zach Mueller authored Apr 18, 2024
```
* Alias

* Note alias

* Tests and src

* Rest

* Clean

* Change typing?

* Fix tests

* Deprecation versions
```
  60d5f8f9
- Dev version · ce8e64fb
  Lysandre authored Apr 18, 2024
  
  ce8e64fb
17 Apr, 2024 1 commit
- Fix test `ExamplesTests::test_run_translation` (#30281) · 05dab4e5
  Yih-Dar authored Apr 17, 2024
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  05dab4e5
15 Apr, 2024 1 commit
- Set pad_token in run_glue_no_trainer.py #28534 (#30234) · f0107862
  JINO ROHIT authored Apr 15, 2024
  
  f0107862
10 Apr, 2024 1 commit

Fix and simplify semantic-segmentation example (#30145) · 56d001b2

Pavel Iakubovskii authored Apr 10, 2024

* Remove unused augmentation

* Fix pad_if_smaller() and remove unused augmentation

* Add indentation

* Fix requirements

* Update dataset use instructions

* Replace transforms with albumentations

* Replace identity transform with None

* Fixing formatting

* Fixed comment place

56d001b2

09 Apr, 2024 1 commit
- [Trainer] Undo #29896 (#30129) · e9c23fa0
  NielsRogge authored Apr 09, 2024
```
* Undo

* Use tokenizer

* Undo data collator
```
  e9c23fa0
08 Apr, 2024 2 commits
- fixing issue 30034 - adding data format for run_ner.py (#30088) · f5658732
  JINO ROHIT authored Apr 08, 2024
  
  f5658732
- updated examples/pytorch/language-modeling scripts and requirements.txt to... · 5e673ed2
  Haz Sameen Shahgir authored Apr 08, 2024
```
updated examples/pytorch/language-modeling scripts and requirements.txt to require datasets>=2.14.0 (#30120)

updated requirements.txt and require_version() calls in examples/pytorch/language-modeling to require datasets>=2.14.0
```
  5e673ed2
05 Apr, 2024 1 commit
- [Trainer] Allow passing image processor (#29896) · 1ab71364
  NielsRogge authored Apr 05, 2024
```
* Add image processor to trainer

* Replace tokenizer=image_processor everywhere
```
  1ab71364
02 Apr, 2024 1 commit
- Fix `remove_columns` in `text-classification` example (#29351) · fce52cef
  Mario Šaško authored Apr 02, 2024
  
  fce52cef
30 Mar, 2024 1 commit
- Add warning message for `run_qa.py` (#29867) · 156d30da
  Jacky Lee authored Mar 30, 2024
```
* improve: error message for best model metric

* update: raise warning instead of error
```
  156d30da
21 Mar, 2024 1 commit
- Add support for `torch_dtype` in the run_mlm example (#29776) · ef6e371d
  Jacky Lee authored Mar 21, 2024
```
feat: add support for torch_dtype
Co-authored-by: Jacky Lee <jackylee328@gmail.com>
```
  ef6e371d
20 Mar, 2024 1 commit
- v4.40.0.dev.0 · 1248f092
  Arthur Zucker authored Mar 20, 2024
  
  1248f092
15 Mar, 2024 1 commit
- Rename `glue` to `nyu-mll/glue` (#29679) · f02aea27
  Quentin Lhoest authored Mar 15, 2024
```
* Update run_glue.py

* Update run_glue.py

* Update run_glue_no_trainer.py
```
  f02aea27
12 Mar, 2024 2 commits

Examples: check `max_position_embeddings` in the translation example (#29600) · d4796653
Joao Gante authored Mar 12, 2024
```
check max_position_embeddings
```
d4796653

Update legacy Repository usage in various example files (#29085) · b6404866

Hilco van der Wilk authored Mar 12, 2024

* Update legacy Repository usage in `examples/pytorch/text-classification/run_glue_no_trainer.py`

Marked for deprecation here https://huggingface.co/docs/huggingface_hub/guides/upload#legacy-upload-files-with-git-lfs

* Fix import order

* Replace all example usage of deprecated Repository

* Fix remaining repo call and rename args variable

* Revert removing creation of gitignore files and don't change research examples

b6404866

11 Mar, 2024 2 commits

Make torch xla available on GPU (#29334) · 873d9bb3

Yitong Huang authored Mar 11, 2024



* add USE_TORCH_XLA env

* rename torch_tpu to torch_xla

* better is_torch_xla_available; fix some fsdp and performance issues

* fix format

* fix bug when pjrt_device is cpu

* fix bug

* fix the deprecation handling

---------
Co-authored-by: anw90 <ang868@gmail.com>
Co-authored-by: wangang.wa <wangang.wa@alibaba-inc.com>

873d9bb3

Add Fill-in-the-middle training objective example - PyTorch (#27464) · 6d67837f

Tanay Mehta authored Mar 11, 2024

* add: initial script to train clm fim

* fix: if training model from scratch, new tokens will be added and embeddings resized

* fix: fixed attention_mask errors when generating FIM data

* fix: file formatted using black

* add: run_fim_no_trainer.py and fixed some comments in run_fim.py

* add: added fim examples to the README.md and ran code fixup

* fix: little bug in both fim training scripts

* fix: remove comment from notebook and added a note on fim related params

* fix: minor typo in README

* add: suggested minor changes to README and run_fim.py

* add: gradient_accumulation_steps and gradient_checkpointing args

* add: improved model embedding resizing

* add: pad_to_multiple_of and attn_implementation params

* add: requested minor changes

* add: deepspeed zero compatibility

* add: resize embeddings layer with zero3 support for fim model initialization

6d67837f

21 Feb, 2024 1 commit
- v4.39.dev.0 · 1a77f07f
  Arthur Zucker authored Feb 21, 2024
  
  1a77f07f
19 Feb, 2024 2 commits

change version (#29097) · b2724d7b

Arthur authored Feb 19, 2024



* change version

* nuke

* this doesn't make sense

* update some requirements.py

* revert + no main

* nits

* change cache number

* more pin

* revert

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

b2724d7b

Fix a typo in `examples/pytorch/text-classification/run_classification.py` (#29072) · 79132d4c
Jay Zhou authored Feb 19, 2024

79132d4c

16 Feb, 2024 1 commit
- Update all references to canonical models (#29001) · f497f564
  Lysandre Debut authored Feb 16, 2024
```
* Script & Manual edition

* Update
```
  f497f564
12 Feb, 2024 2 commits
- [Docs] Add language identifiers to fenced code blocks (#28955) · fe3df9d5
  Klaus Hipp authored Feb 12, 2024
```
Add language identifiers to code blocks
```
  fe3df9d5
- Updated requirements for image-classification samples: datasets>=2.14.0 (#28974) · 792819f6
  Alexey Fadeev authored Feb 12, 2024
```
Updated datasets requirements. Need a package version >= 2.14.0
```
  792819f6
07 Feb, 2024 1 commit

Update the cache number (#28905) · 308d2b90

Yih-Dar authored Feb 07, 2024



* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

308d2b90

02 Feb, 2024 1 commit

[Docs] Fix spelling and grammar mistakes (#28825) · 721ee783

Klaus Hipp authored Feb 02, 2024

* Fix typos and grammar mistakes in docs and examples

* Fix typos in docstrings and comments

* Fix spelling of `tokenizer` in model tests

* Remove erroneous spaces in decorators

* Remove extra spaces in Markdown link texts

721ee783

01 Feb, 2024 1 commit
- [docs] fix some bugs about parameter description (#28806) · d98591a1
  zspo authored Feb 02, 2024
```
Co-authored-by: p_spozzhang <p_spozzhang@tencent.com>
```
  d98591a1
30 Jan, 2024 1 commit

Pin Torch to <2.2.0 (#28785) · 74c9cfea

Matt authored Jan 30, 2024



* Pin torch to <2.2.0

* Pin torchvision and torchaudio as well

* Playing around with versions to see if this helps

* twiddle something to restart the CI

* twiddle it back

* Try changing the natten version

* make fixup

* Revert "Try changing the natten version"

This reverts commit de0d6592c35dc39ae8b5a616c27285db28262d06.

* make fixup

* fix fix fix

* fix fix fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

74c9cfea

29 Jan, 2024 1 commit
- Fix input data file extension in examples (#28741) · 39fa4009
  Klaus Hipp authored Jan 29, 2024
  
  39fa4009
26 Jan, 2024 1 commit
- [docs] Fix datasets in guides (#28715) · abe0289e
  Steven Liu authored Jan 26, 2024
```
* change datasets

* fix
```
  abe0289e
22 Jan, 2024 2 commits
- Fix lr_scheduler in no_trainer training scripts (#27872) · deb2b590
  bofeng huang authored Jan 22, 2024
```
* Fix lr_scheduler

* Fix lr scheduler
```
  deb2b590
- Fix id2label assignment in run_classification.py (#28590) · f0acf7b6
  jheitmann authored Jan 22, 2024
  
  f0acf7b6