Commits · 5603fad2479ad22ca4689f6a4dbf56ef2f1f0973 · chenpangpang / transformers

16 Nov, 2023 1 commit
- Update the TF pin for 2.15 (#27375) · 4989e73e
  Matt authored Nov 16, 2023
```
* Move the TF pin for 2.15

* make fixup
```
  4989e73e
15 Nov, 2023 1 commit

[`tokenizers`] update `tokenizers` version pin (#27494) · 3d1a7bf4

Arthur authored Nov 15, 2023



* update `tokenizers` version pin

* force tokenizers>=0.15

* use  0.14
Co-authored-by: Lysandre <lysandre@huggingface.co>

---------
Co-authored-by: Lysandre <lysandre@huggingface.co>

3d1a7bf4

26 Oct, 2023 1 commit

Save TB logs as part of push_to_hub (#27022) · 34a64064

Zach Mueller authored Oct 26, 2023

* Support runs/

* Upload runs folder as part of push to hub

* Add a test

* Add to test deps

* Update with proposed solution from Slack

* Ensure that repo gets deleted in tests

34a64064

23 Oct, 2023 1 commit
- Limit to inferior fsspec version (#27010) · 70032949
  Lysandre Debut authored Oct 23, 2023
```
Pin fsspec
```
  70032949
19 Oct, 2023 1 commit

Pin Keras for now (#26904) · cbd278f0

Matt authored Oct 19, 2023

* Pin Keras for now out of paranoia

* Add the keras pin to _tests_requirements.txt too

* Make sure the Keras version matches the TF one

* make fixup

cbd278f0

06 Oct, 2023 1 commit

remove SharedDDP as it is deprecated (#25702) · 27597fea

statelesshz authored Oct 06, 2023



* remove SharedDDP as it was drepracated

* apply review suggestion

* make style

* Oops,forgot to remove the compute_loss context manager in Seq2SeqTrainer.

* remove the unnecessary conditional statement

* keep the logic of IPEX

* clean code

* mix precision setup & make fixup

---------
Co-authored-by: statelesshz <jihuazhong1@huawei.com>

27597fea

21 Sep, 2023 1 commit
- update hf hub dependency to be compatible with the new tokenizers (#26301) · b132c170
  Arthur authored Sep 21, 2023
  
  b132c170
18 Sep, 2023 1 commit

🚨

[`Tokenizer`] attemp to fix add_token issues

🚨

(#23909) · 2da88537

Arthur authored Sep 18, 2023



* fix test for bart. Order is correct now let's skip BPEs

* ouf

* styling

* fix bert....

* slow refactoring

* current updates

* massive refactoring

* update

* NICE!

* update to see where I am at

* updates

* update

* update

* revert

* updates

* updates

* start supporting legacy_save

* styling

* big update

* revert some changes

* nits

* nniiiiiice

* small fixes

* kinda fix t5 with new behaviour

* major update

* fixup

* fix copies

* today's updates

* fix byt5

* upfate

* update

* update

* updates

* update vocab size test

* Barthez does not use not need the fairseq offset ids

* super calll must be after

* calll super

* move all super init

* move other super init

* fixup

* nits

* more fixes

* nits

* more fixes

* nits

* more fix

* remove useless files

* ouch all of them are affected

* and more!

* small imporvements

* no more sanitize token

* more changes around unique no split tokens

* partially fix more things

* keep legacy save but add warning

* so... more fixes

* updates

* guess deberta tokenizer could be nuked

* fixup

* fixup did some bad things

* nuke it if it breaks

* remove prints and pretrain fast from slow with new format.

* fixups

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* fiou

* nit

* by default specials should not be normalized?

* update

* remove brakpoint

* updates

* a lot of updates

* fixup

* fixes revert some changes to match fast

* small nits

* that makes it cleaner

* fix camembert accordingly

* update

* some lest breaking changes

* update

* fixup

* fix byt5 and whisper mostly

* some more fixes, canine's byte vocab

* fix gpt2

* fix most of the perceiver tests (4 left)

* fix layout lmv3

* fixup

* fix copies for gpt2 style

* make sure to only warn once

* fix perciever and gpt2 tests

* some more backward compatibility: also read special tokens map because some ppl use it........////.....

* fixup

* add else when reading

* nits

* fresh updates

* fix copies

* will this make everything faster?

* fixes

* more fixes

* update

* more fixes

* fixup

* is the source of truth right?

* sorry camembert for the troubles

* current updates

* fixup

* update led

* update

* fix regression

* fix single word

* more model specific fixes

* fix t5 tests

* fixup

* more comments

* update

* fix nllb

* rstrip removed

* small fixes

* better handle additional_special_tokens and vocab sizes

* fixing

* styling

* fix 4 / 21

* fixup

* fix nlbb's tests

* some fixes

* fix t5

* fixes

* style

* fix canine tests

* damn this is nice

* nits

* m2m100 nit

* fixups

* fixes!

* fixup

* stash

* fix merge

* revert bad change

* fixup

* correct order for code Llama

* fix speecht5 post merge

* styling

* revert source of 11 fails

* small nits

* all changes in one go

* fnet hack

* fix 2 more tests

* update based on main branch of tokenizers

* fixup

* fix VITS issues

* more fixes

* fix mgp test

* fix camembert issues

* oups camembert still has 2 failing tests

* mluke fixes

* decode fixes

* small nits

* nits

* fix llama and vits

* fix camembert

* smal nits

* more fixes when initialising a fast from a slow and etc

* fix one of the last test

* fix CPM tokenizer test

* fixups

* fix pop2piano

* fixup

* ⚠️ Change tokenizers required version ⚠️

* ⚠️ Change tokenizers required version ⚠️

* "tokenizers>=0.14,<0.15", don't forget smaller than

* fix musicgen tests and pretraiendtokenizerfast

* fix owlvit and all

* update t5

* fix 800 red

* fix tests

* fix the fix of the fix of t5

* styling

* documentation nits

* cache _added_tokens_encoder

* fixups

* Nit

* fix red tests

* one last nit!

* make eveything a lot simpler

* Now it's over 😉



* few small nits

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* updates that work for now

* tests that should no be skipped / changed and fixed next

* fixup

* i am ashamed

* pushe the fix

* update

* fixups

* nits

* fix added_tokens_encoder

* fix canine test

* fix pegasus vocab

* fix transfoXL

* fixup

* whisper needs to be fixed for train new

* pegasus nits

* more pegasus fixes

* minor update

* better error message in failed test

* fix whisper failing test

* fix whisper failing test

* fix pegasus

* fixup

* fix **** pegasus

* reset things

* remove another file

* attempts to fix the strange custome encoder and offset

* nits here and there

* update

* fixup

* nit

* fix the whisper test

* nits nits

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* updates based on review

* some small update to potentially remove

* nits

* import rlu cache

* Update src/transformers/tokenization_utils_base.py
Co-authored-by: Lysandre Debut <hi@lysand.re>

* move warning to `from_pretrained`

* update tests results now that the special tokens are always added

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: Lysandre Debut <hi@lysand.re>

2da88537

31 Aug, 2023 1 commit
- Update `setup.py` (#25893) · 3fb1535b
  Yih-Dar authored Aug 31, 2023
```
update
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  3fb1535b
22 Aug, 2023 1 commit

TF 2.14 compatibility (#25630) · 62396cff

Matt authored Aug 22, 2023

* Update the TF pin and see if anything breaks

* make fixup

* make fixup

* make fixup

62396cff

07 Aug, 2023 1 commit

Migrate Trainer from `Repository` to `upload_folder` (#25095) · baf1daa5

Sylvain Gugger authored Aug 07, 2023



* First draft

* Deal with progress bars

* Update src/transformers/utils/hub.py
Co-authored-by: Lucain <lucainp@gmail.com>

* Address review comments

* Forgot one

* Pin hf_hub

* Add argument for push all and fix tests

* Fix tests

* Address review comments

---------
Co-authored-by: Lucain <lucainp@gmail.com>

baf1daa5

03 Aug, 2023 1 commit
- [JAX] Bump min version (#25286) · 66c240f3
  Sanchit Gandhi authored Aug 03, 2023
```
* [JAX] Bump min version

* make fixup
```
  66c240f3
13 Jul, 2023 1 commit
- Upgrade jax/jaxlib/flax pin versions (#24791) · e5381899
  Yih-Dar authored Jul 13, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  e5381899
03 Jul, 2023 1 commit
- Pin `Pillow` for now (#24633) · 6eedfa6d
  Yih-Dar authored Jul 03, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  6eedfa6d
30 Jun, 2023 2 commits

Limit Pydantic to V1 in dependencies (#24596) · d51aa48a

Serge Matveenko authored Jul 01, 2023



* Limit Pydantic to V1 in dependencies

Pydantic is about to release V2 release which will break a lot of things. This change prevents `transformers` to be used with Pydantic V2 to avoid breaking things.

* more

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

d51aa48a

Use protobuf 4 (#24599) · 299aafe5

Yih-Dar authored Jun 30, 2023



* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

299aafe5

28 Jun, 2023 2 commits
- Unpin DeepSpeed and require DS >= 0.9.3 (#24541) · 11cb6e0f
  Yih-Dar authored Jun 28, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  11cb6e0f
- ⚠️ Time to say goodbye to py37 (#24091) · e84bf1f7
  Yih-Dar authored Jun 28, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  e84bf1f7
23 Jun, 2023 1 commit

Improved keras imports (#24448) · 8e164c54

Matt authored Jun 23, 2023

* An end to accursed version-specific imports

* No more K.is_keras_tensor() either

* Update dependency tables

* Use a cleaner call context function getter

* Add a cap to <2.14

* Add cap to examples requirements too

8e164c54

14 Jun, 2023 1 commit
- Clean up old Accelerate checks (#24279) · 26a2ec56
  Sylvain Gugger authored Jun 14, 2023
```
* Clean up old Accelerate checks

* Put back imports
```
  26a2ec56
08 Jun, 2023 1 commit
- Update the pin on Accelerate (#24110) · 8c5f3067
  Sylvain Gugger authored Jun 08, 2023
  
  8c5f3067
07 Jun, 2023 1 commit

Up pinned accelerate version (#24089) · 5eb3d3c7

Zachary Mueller authored Jun 07, 2023

* Min accelerate

* Also min version

* Min accelerate

* Also min version

* To different minor version

* Empty

5eb3d3c7

01 Jun, 2023 1 commit
- Pin rhoknp (#23937) · 91931882
  Sylvain Gugger authored Jun 01, 2023
  
  91931882
31 May, 2023 2 commits

Upgrade safetensors version (#23911) · 55451c66
Zachary Mueller authored May 31, 2023
```
* Upgrade safetensors

* Second table
```
55451c66

Unpin numba (#23162) · 8f915c45

Sanchit Gandhi authored May 31, 2023

* fix for ragged list

* unpin numba

* make style

* np.object -> object

* propagate changes to tokenizer as well

* np.long -> "long"

* revert tokenization changes

* check with tokenization changes

* list/tuple logic

* catch numpy

* catch else case

* clean up

* up

* better check

* trigger ci

* Empty commit to trigger CI

8f915c45

12 May, 2023 1 commit

Only add files with modification outside doc blocks (#23327) · a3975f94

Yih-Dar authored May 12, 2023



* min. version for pytest

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

a3975f94

11 May, 2023 1 commit
- Agents extras (#23301) · 71b19ee2
  Lysandre Debut authored May 11, 2023
```
* Agents extras

* Add to docs
```
  71b19ee2
10 May, 2023 1 commit

chore: allow protobuf 3.20.3 requirement (#22759) · 0c65fb7c

José Ángel Rey Liñares authored May 10, 2023



* chore: allow protobuf 3.20.3

Allow latest bugfix release for protobuf (3.20.3)

* chore: update auto-generated dependency table

update auto-generated dependency table

* run in subprocess

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Apply suggestions

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

0c65fb7c

08 May, 2023 1 commit
- New version of Accelerate for the Trainer (#23204) · 94056b57
  Sylvain Gugger authored May 08, 2023
  
  94056b57
04 May, 2023 1 commit
- Pin urllib3 · 3341bb41
  Sylvain Gugger authored May 04, 2023
  
  3341bb41
03 May, 2023 1 commit
- Pin numba for now (#23118) · 4b6aecb4
  Sylvain Gugger authored May 02, 2023
  
  4b6aecb4
20 Apr, 2023 1 commit
- Pin flax & optax version (#22895) · e5f34871
  amyeroberts authored Apr 20, 2023
```
* Pin optax version

* Pin flax too

* Fixup
```
  e5f34871
18 Apr, 2023 1 commit
- Update accelerate version + warning check fix (#22833) · aec10d16
  Zachary Mueller authored Apr 18, 2023
  
  aec10d16
29 Mar, 2023 1 commit
- Pin ruff (#22455) · 2194943a
  Sylvain Gugger authored Mar 29, 2023
  
  2194943a
24 Mar, 2023 2 commits
- TensorFlow: pin maximum version to 2.12 (#22364) · 88dae78f
  Joao Gante authored Mar 24, 2023
  
  88dae78f
- Pin tensorflow-text to go with tensorflow (#22362) · 6587125c
  Sylvain Gugger authored Mar 24, 2023
```
* Pin tensorflow-text to go with tensorflow

* Make it more convenient to pin TensorFlow

* setup don't like f-strings
```
  6587125c
22 Mar, 2023 1 commit
- [deepspeed] offload + non-cpuadam optimizer exception doc (#22044) · 89a0a9ea
  Stas Bekman authored Mar 21, 2023
```
* [deepspeed] offload + non-cpuadam optimizer exception doc

* deps
```
  89a0a9ea
21 Mar, 2023 2 commits
- Correct NATTEN function signatures and force new version (#22298) · 5990743f
  Ali Hassani authored Mar 21, 2023
  
  5990743f
- Time to Say Goodbye, torch 1.7 and 1.8 (#22291) · 67c2dbdb
  Yih-Dar authored Mar 21, 2023
```
* time to say goodbye, torch 1.7 and 1.8

* clean up torch_int_div

* clean up is_torch_less_than_1_8-9

* update

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  67c2dbdb
17 Mar, 2023 1 commit

Fix natten (#22229) · 3028b20a

Ali Hassani authored Mar 17, 2023

* Add kernel size to NATTEN's QK arguments.

The new NATTEN 0.14.5 supports PyTorch 2.0, but also adds an additional
argument to the QK operation to allow optional RPBs.

This ends up failing NATTEN tests.

This commit adds NATTEN back to circleci and adds the arguments to get
it working again.

* Force NATTEN >= 0.14.5

3028b20a