Commits · 243d0de9971d953b2a69636fba0420fb56cd42e4 · chenpangpang / transformers

"examples/pytorch/question-answering/run_qa.py" did not exist on "7809eb82ae0341b7a02b1ce7ab7d6d551e9855d9"

20 Mar, 2024 1 commit
- Larger runner on CircleCI (#29750) · 243d0de9
  Yih-Dar authored Mar 20, 2024
```
larger runner
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  243d0de9
04 Mar, 2024 1 commit

NielsRogge authored Mar 04, 2024



* First draft

* More improvements

* More improvements

* More fixes

* Fix copies

* More improvements

* More fixes

* More improvements

* Convert checkpoint

* More improvements, set up tests

* Fix more tests

* Add UdopModel

* More improvements

* Fix equivalence test

* More fixes

* Redesign model

* Extend conversion script

* Use real inputs for conversion script

* Add image processor

* Improve conversion script

* Add UdopTokenizer

* Add fast tokenizer

* Add converter

* Update README's

* Add processor

* Add fully fledged tokenizer

* Add fast tokenizer

* Use processor in conversion script

* Add tokenizer tests

* Fix one more test

* Fix more tests

* Fix tokenizer tests

* Enable fast tokenizer tests

* Fix more tests

* Fix additional_special_tokens of fast tokenizer

* Fix tokenizer tests

* Fix more tests

* Fix equivalence test

* Rename image to pixel_values

* Rename seg_data to bbox

* More renamings

* Remove vis_special_token

* More improvements

* Add docs

* Fix copied from

* Update slow tokenizer

* Update fast tokenizer design

* Make text input optional

* Add first draft of processor tests

* Fix more processor tests

* Fix decoder_start_token_id

* Fix test_initialization

* Add integration test

* More improvements

* Improve processor, add test

* Add more copied from

* Add more copied from

* Add more copied from

* Add more copied from

* Remove print statement

* Update README and auto mapping

* Delete files

* Delete another file

* Remove code

* Fix test

* Fix docs

* Remove asserts

* Add doc tests

* Include UDOP in exotic model tests

* Add expected tesseract decodings

* Add sentencepiece

* Use same design as T5

* Add UdopEncoderModel

* Add UdopEncoderModel to tests

* More fixes

* Fix fast tokenizer

* Fix one more test

* Remove parallelisable attribute

* Fix copies

* Remove legacy file

* Copy from T5Tokenizer

* Fix rebase

* More fixes, copy from T5

* More fixes

* Fix init

* Use ArthurZ/udop for tests

* Make all model tests pass

* Remove UdopForConditionalGeneration from auto mapping

* Fix more tests

* fixups

* more fixups

* fix the tokenizers

* remove un-necessary changes

* nits

* nits

* replace truncate_sequences_boxes with truncate_sequences for fix-copies

* nit current path

* add a test for input ids

* ids that we should get taken from c9f7a32f57440d90ff79890270d376a1cc0acb68

* nits converting

* nits

* apply ruff

* nits

* nits

* style

* fix slow order of addition

* fix udop fast range as well

* fixup

* nits

* Add docstrings

* Fix gradient checkpointing

* Update code examples

* Skip tests

* Update integration test

* Address comment

* Make fixup

* Remove extra ids from tokenizer

* Skip test

* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update year

* Address comment

* Address more comments

* Address comments

* Add copied from

* Update CI

* Rename script

* Update model id

* Add AddedToken, skip tests

* Update CI

* Fix doc tests

* Do not use Tesseract for the doc tests

* Remove kwargs

* Add original inputs

* Update casting

* Fix doc test

* Update question

* Update question

* Use LayoutLMv3ImageProcessor

* Update organization

* Improve docs

* Update forward signature

* Make images optional

* Remove deprecated device argument

* Add comment, add add_prefix_space

* More improvements

* Remove kwargs

---------
Co-authored-by: ArthurZucker <arthur.zucker@gmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

836921fd

20 Feb, 2024 1 commit
- Save (circleci) cache at the end of a job (#29141) · 7688d8df
  Yih-Dar authored Feb 20, 2024
```
nice job
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  7688d8df
19 Feb, 2024 1 commit

change version (#29097) · b2724d7b

Arthur authored Feb 19, 2024



* change version

* nuke

* this doesn't make sense

* update some requirements.py

* revert + no main

* nits

* change cache number

* more pin

* revert

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

b2724d7b

07 Feb, 2024 1 commit

Update the cache number (#28905) · 308d2b90

Yih-Dar authored Feb 07, 2024



* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

308d2b90

06 Feb, 2024 2 commits

Hotfix - make `torchaudio` get the correct version in `torch_and_flax_job` (#28899) · 40658be4
Yih-Dar authored Feb 06, 2024
```
* check

* check

* check

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
40658be4

unpin torch (#28892) · 89439fea

Yih-Dar authored Feb 06, 2024



* unpin torch

* check

* check

* check

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

89439fea

02 Feb, 2024 2 commits
- Use `-v` for `pytest` on CircleCI (#28840) · f4977959
  Yih-Dar authored Feb 02, 2024
```
use -v in pytest
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  f4977959
- Fix issues caused by natten (#28834) · 0e75aeef
  Yih-Dar authored Feb 02, 2024
```
try
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  0e75aeef
30 Jan, 2024 2 commits

Pin Torch to <2.2.0 (#28785) · 74c9cfea

Matt authored Jan 30, 2024



* Pin torch to <2.2.0

* Pin torchvision and torchaudio as well

* Playing around with versions to see if this helps

* twiddle something to restart the CI

* twiddle it back

* Try changing the natten version

* make fixup

* Revert "Try changing the natten version"

This reverts commit de0d6592c35dc39ae8b5a616c27285db28262d06.

* make fixup

* fix fix fix

* fix fix fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

74c9cfea

Further pin pytest version (in a temporary way) (#28780) · c24c5245
Yih-Dar authored Jan 30, 2024
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
c24c5245

10 Jan, 2024 1 commit
- CI: limit natten version (#28432) · ee2482b6
  Joao Gante authored Jan 10, 2024
  
  ee2482b6
03 Jan, 2024 1 commit

Add FastSpeech2Conformer (#23439) · d83ff5ee

Connor Henderson authored Jan 03, 2024

* start - docs, SpeechT5 copy and rename

* add relevant code from FastSpeech2 draft, have tests pass

* make it an actual conformer, demo ex.

* matching inference with original repo, includes debug code

* refactor nn.Sequentials, start more desc. var names

* more renaming

* more renaming

* vocoder scratchwork

* matching vocoder outputs

* hifigan vocoder conversion script

* convert model script, rename some config vars

* replace postnet with speecht5's implementation

* passing common tests, file cleanup

* expand testing, add output hidden states and attention

* tokenizer + passing tokenizer tests

* variety of updates and tests

* g2p_en pckg setup

* import structure edits

* docstrings and cleanup

* repo consistency

* deps

* small cleanup

* forward signature param order

* address comments except for masks and labels

* address comments on attention_mask and labels

* address second round of comments

* remove old unneeded line

* address comments part 1

* address comments pt 2

* rename auto mapping

* fixes for failing tests

* address comments part 3 (bart-like, train loss)

* make style

* pass config where possible

* add forward method + tests to WithHifiGan model

* make style

* address arg passing and generate_speech comments

* address Arthur comments

* address Arthur comments pt2

* lint  changes

* Sanchit comment

* add g2p-en to doctest deps

* move up self.encoder

* onnx compatible tensor method

* fix is symbolic

* fix paper url

* move models to espnet org

* make style

* make fix-copies

* update docstring

* Arthur comments

* update docstring w/ new updates

* add model architecture images

* header size

* md wording update

* make style

d83ff5ee

16 Nov, 2023 1 commit

[`Styling`] stylify using ruff (#27144) · 651408a0

Arthur authored Nov 16, 2023



* try to stylify using ruff

* might need to remove these changes?

* use ruf format andruff check

* use isinstance instead of type comparision

* use # fmt: skip

* use # fmt: skip

* nits

* soem styling changes

* update ci job

* nits isinstance

* more files update

* nits

* more nits

* small nits

* check and format

* revert wrong changes

* actually use formatter instead of checker

* nits

* well docbuilder is overwriting this commit

* revert notebook changes

* try to nuke docbuilder

* style

* fix feature exrtaction test

* remve `indent-width = 4`

* fixup

* more nits

* update the ruff version that we use

* style

* nuke docbuilder styling

* leve the print for detected changes

* nits

* Remove file I/O
Co-authored-by: charliermarsh <charlie.r.marsh@gmail.com>

* style

* nits

* revert notebook changes

* Add # fmt skip when possible

* Add # fmt skip when possible

* Fix

* More `  # fmt: skip` usage

* More `  # fmt: skip` usage

* More `  # fmt: skip` usage

* NIts

* more fixes

* fix tapas

* Another way to skip

* Recommended way

* Fix two more fiels

* Remove asynch
Remove asynch

---------
Co-authored-by: charliermarsh <charlie.r.marsh@gmail.com>

651408a0

10 Nov, 2023 1 commit
- Make `examples_torch_job` faster (#27437) · 7ee995fd
  Yih-Dar authored Nov 10, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  7ee995fd
09 Nov, 2023 2 commits

Final fix of the accelerate installation issue (#27408) · c8b6052f

Yih-Dar authored Nov 09, 2023



* fix

* [test-all] commit

* fix

* [test-all] commit

* [test-all] commit

* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

c8b6052f

Use editable install for git deps (#27404) · c5037b45
Zach Mueller authored Nov 09, 2023
```
* Use editable install

* Full command
```
c5037b45

23 Oct, 2023 1 commit
- Limit to inferior fsspec version (#27010) · 70032949
  Lysandre Debut authored Oct 23, 2023
```
Pin fsspec
```
  70032949
18 Oct, 2023 1 commit

[`Tokenizer`] Fix slow and fast serialization (#26570) · ef7e9369

Arthur authored Oct 18, 2023

* fix

* last attempt

* current work

* fix forward compatibility

* save all special tokens

* current state

* revert additional changes

* updates

* remove tokenizer.model

* add a test and the fix

* nit

* revert one more break

* fix typefield issue

* quality

* more tests

* fix fields for FC

* more nits?

* new additional changes

* how

* some updates

* simplify all

* more nits

* revert some things to original

* nice

* nits

* a small hack

* more nits

* ahhaha

* fixup

* update

* make test run on ci

* use subtesting

* update

* Update .circleci/create_circleci_config.py

* updates

* fixup

* nits

* replace typo

* fix the test

* nits

* update

* None max dif pls

* a partial fix

* had to revert one thing

* test the fast

* updates

* fixup

* and more nits

* more fixes

* update

* Oupsy 👁



* nits

* fix marian

* on our way to heaven

* Update src/transformers/models/t5/tokenization_t5.py
Co-authored-by: Lysandre Debut <hi@lysand.re>

* fixup

* Update src/transformers/tokenization_utils_fast.py
Co-authored-by: Leo Tronchon <leo.tronchon@gmail.com>

* Update src/transformers/tokenization_utils_base.py
Co-authored-by: Leo Tronchon <leo.tronchon@gmail.com>

* fix phobert

* skip some things, test more

* nits

* fixup

* fix deberta

* update

* update

* more updates

* skip one test

* more updates

* fix camembert

* can't test this one

* more good fixes

* kind of a major update

- seperate what is only done in fast in fast init and refactor
- add_token(AddedToken(..., speicla = True)) ignores it in fast
- better loading

* fixup

* more fixups

* fix pegasus and mpnet

* remove skipped tests

* fix phoneme tokenizer if self.verbose

* fix individual models

* update common tests

* update testing files

* all over again

* nits

* skip test for markup lm

* fixups

* fix order of addition in fast by sorting the added tokens decoder

* proper defaults for deberta

* correct default for fnet

* nits on add tokens, string initialized to special if special

* skip irrelevant herbert tests

* main fixes

* update test added_tokens_serialization

* the fix for bart like models and class instanciating

* update bart

* nit!

* update idefix test

* fix whisper!

* some fixup

* fixups

* revert some of the wrong chanegs

* fixup

* fixup

* skip marian

* skip the correct tests

* skip for tf and flax as well

---------
Co-authored-by: Lysandre Debut <hi@lysand.re>
Co-authored-by: Leo Tronchon <leo.tronchon@gmail.com>

ef7e9369

09 Oct, 2023 1 commit

Avoid CI OOM (#26639) · 740fc6a1

Yih-Dar authored Oct 09, 2023



fix avoid oom
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

740fc6a1

26 Sep, 2023 1 commit

Add Nougat (#25942) · ace74d16

NielsRogge authored Sep 26, 2023



* Add conversion script

* Add NougatImageProcessor

* Add crop margin

* More improvements

* Add docs, READMEs

* Remove print statements

* Include model_max_length

* Add NougatTokenizerFast

* Fix imports

* Improve postprocessing

* Improve image processor

* Fix image processor

* Improve normalize method

* More improvements

* More improvements

* Add processor, improve docs

* Simplify fast tokenizer

* Remove test file

* Fix docstrings

* Use NougatProcessor in conversion script

* Add is_levensthein_available

* Add tokenizer tests

* More improvements

* Use numpy instead of opencv

* Add is_cv2_available

* Fix cv2_available

* Add is_nltk_available

* Add image processor tests, improve crop_margin

* Add integration tests

* Improve integration test

* Use do_rescale instead of hacks, thanks Amy

* Remove random_padding

* Address comments

* Address more comments

* Add import

* Address more comments

* Address more comments

* Address comment

* Address comment

* Set max_model_input_sizes

* Add tests

* Add requires_backends

* Add Nougat to exotic tests

* Use to_pil_image

* Address comment regarding nltk

* Add NLTK

* Improve variable names, integration test

* Add test

* refactor, document, and test regexes

* remove named capture groups, add comments

* format

* add non-markdown fixed tokenization

* format

* correct flakyness of args parse

* add regex comments

* test functionalities for crop_image, align long axis and expected output

* add regex tests

* remove cv2 dependency

* test crop_margin equality between cv2 and python

* refactor table regexes to markdown

add newline

* change print to log, improve doc

* fix high count tables correction

* address PR comments: naming, linting, asserts

* Address comments

* Add copied from

* Update conversion script

* Update conversion script to convert both small and base versions

* Add inference example

* Add more info

* Fix style

* Add require annotators to test

* Define all keyword arguments explicitly

* Move cv2 annotator

* Add tokenizer init method

* Transfer checkpoints

* Add reference to Donut

* Address comments

* Skip test

* Remove cv2 method

* Add copied from statements

* Use cached_property

* Fix docstring

* Add file to not doctested

---------
Co-authored-by: Pablo Montalvo <pablo.montalvo.leroux@gmail.com>

ace74d16

22 Sep, 2023 1 commit
- Use CircleCI `store_test_results` (#26223) · 06ee91ae
  Yih-Dar authored Sep 22, 2023
```
store_test_results
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  06ee91ae
19 Sep, 2023 1 commit

Fix `Error` not captured in PR doctesting (#26215) · 39df4eca

Yih-Dar authored Sep 19, 2023



* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

39df4eca

07 Sep, 2023 1 commit
- Fix CircleCI config (#26023) · 0188739a
  Yih-Dar authored Sep 07, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  0188739a
05 Sep, 2023 2 commits

[`CI`] Fix red CI and ERROR failed should show (#25995) · d0354e5e

Arthur authored Sep 05, 2023

* start with error too

* fix ?

* start with nit

* one more path

* use `job_name`

* mark pipeline test as slow

d0354e5e

Show failed tests on CircleCI layout in a better way (#25895) · aa5c94d3
Yih-Dar authored Sep 05, 2023
```
* update

* update

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
aa5c94d3

30 Aug, 2023 1 commit
- Reduce CI output (#25876) · 1c6f072d
  Yih-Dar authored Aug 30, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  1c6f072d
11 Aug, 2023 2 commits
- Revert "Reuse the cache created for latest `main` on PRs/branches" (#25466) · fe3c8ab1
  Yih-Dar authored Aug 11, 2023
```
Revert "Reuse the cache created for latest `main` on PRs/branches if `setup.py` is not modified (#25445)"

This reverts commit 1d757686.
```
  fe3c8ab1
- Reuse the cache created for latest `main` on PRs/branches if `setup.py` is not modified (#25445) · 1d757686
  Yih-Dar authored Aug 11, 2023
```
* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  1d757686
08 Aug, 2023 2 commits
- Use small config for `OneFormerModelTest.test_model_with_labels` (#25383) · 5b517e17
  Yih-Dar authored Aug 08, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  5b517e17
- Fix `torch_job` worker(s) crashing (#25374) · 9e57e0c0
  Yih-Dar authored Aug 08, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  9e57e0c0
02 Aug, 2023 2 commits
- CI with `pytest_num_workers=8` for torch/tf jobs (#25274) · 2bd7a27a
  Yih-Dar authored Aug 02, 2023
```
n8
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  2bd7a27a
- Remove `pytest_options={"rA": None}` in CI (#25263) · 8edd0da9
  Yih-Dar authored Aug 02, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  8edd0da9
18 Jul, 2023 2 commits
- Separate CircleCI cache between `main` and `pull` (or other branches) (#24886) · 30c172fc
  Yih-Dar authored Jul 18, 2023
```
* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  30c172fc
- Fix CircleCI cache (#24880) · f14c7f99
  Yih-Dar authored Jul 18, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  f14c7f99
17 Jul, 2023 1 commit
- Fix the fetch of all example tests (#24864) · 12b908c6
  Sylvain Gugger authored Jul 17, 2023
  
  12b908c6
13 Jul, 2023 1 commit

Run hub tests (#24807) · f32303d5

Sylvain Gugger authored Jul 13, 2023

* Run hub tests

* [all-test] Run tests please!

* [all-test] Add vision dep for hub tests

* Fix tests

f32303d5

05 Jul, 2023 1 commit

Unpin `huggingface_hub` (#24667) · 050ef145

Yih-Dar authored Jul 05, 2023



* fix

* fix

* fix

* [test all] commit

* [test all] commit

* [test all] commit

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

050ef145

27 Jun, 2023 1 commit
- Update `huggingface_hub` commit sha (#24527) · 7d150d68
  Yih-Dar authored Jun 27, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  7d150d68
22 Jun, 2023 1 commit

Save `site-packages` as cache in CircleCI job (#24424) · 2c977e4a

Yih-Dar authored Jun 22, 2023



* fix

* fix

* Upgrade complete!

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

2c977e4a