Commits · d8e13b3e04da9e61c6f16df43815656f59688abd · chenpangpang / transformers

04 Sep, 2023 1 commit
- v4.34.dev.0 · d8e13b3e
  Lysandre authored Sep 04, 2023
  
  d8e13b3e
31 Aug, 2023 1 commit
- Update `setup.py` (#25893) · 3fb1535b
  Yih-Dar authored Aug 31, 2023
```
update
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  3fb1535b
22 Aug, 2023 1 commit

TF 2.14 compatibility (#25630) · 62396cff

Matt authored Aug 22, 2023

* Update the TF pin and see if anything breaks

* make fixup

* make fixup

* make fixup

62396cff

21 Aug, 2023 1 commit
- v4.33.0.dev0 · 5c67682b
  Sylvain Gugger authored Aug 21, 2023
  
  5c67682b
17 Aug, 2023 1 commit

More utils doc (#25457) · 2defb6b0

Sylvain Gugger authored Aug 17, 2023

* Document and clean more utils.

* More documentation and fixes

* Switch to Lysandre's token

* Address review comments

* Actually put else

2defb6b0

07 Aug, 2023 1 commit

Migrate Trainer from `Repository` to `upload_folder` (#25095) · baf1daa5

Sylvain Gugger authored Aug 07, 2023



* First draft

* Deal with progress bars

* Update src/transformers/utils/hub.py
Co-authored-by: Lucain <lucainp@gmail.com>

* Address review comments

* Forgot one

* Pin hf_hub

* Add argument for push all and fix tests

* Fix tests

* Address review comments

---------
Co-authored-by: Lucain <lucainp@gmail.com>

baf1daa5

03 Aug, 2023 1 commit
- [JAX] Bump min version (#25286) · 66c240f3
  Sanchit Gandhi authored Aug 03, 2023
```
* [JAX] Bump min version

* make fixup
```
  66c240f3
17 Jul, 2023 1 commit
- 4.32.0.dev0 · e9ad5130
  Sylvain Gugger authored Jul 17, 2023
  
  e9ad5130
13 Jul, 2023 2 commits
- Update setup.py to be compatible with pipenv (#24789) · 08667050
  Georgie Mathews authored Jul 13, 2023
  
  08667050
- Upgrade jax/jaxlib/flax pin versions (#24791) · e5381899
  Yih-Dar authored Jul 13, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  e5381899
03 Jul, 2023 1 commit
- Pin `Pillow` for now (#24633) · 6eedfa6d
  Yih-Dar authored Jul 03, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  6eedfa6d
30 Jun, 2023 2 commits

Limit Pydantic to V1 in dependencies (#24596) · d51aa48a

Serge Matveenko authored Jul 01, 2023



* Limit Pydantic to V1 in dependencies

Pydantic is about to release V2 release which will break a lot of things. This change prevents `transformers` to be used with Pydantic V2 to avoid breaking things.

* more

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

d51aa48a

Use protobuf 4 (#24599) · 299aafe5

Yih-Dar authored Jun 30, 2023



* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

299aafe5

28 Jun, 2023 2 commits
- Unpin DeepSpeed and require DS >= 0.9.3 (#24541) · 11cb6e0f
  Yih-Dar authored Jun 28, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  11cb6e0f
- ⚠️ Time to say goodbye to py37 (#24091) · e84bf1f7
  Yih-Dar authored Jun 28, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  e84bf1f7
23 Jun, 2023 1 commit

Improved keras imports (#24448) · 8e164c54

Matt authored Jun 23, 2023

* An end to accursed version-specific imports

* No more K.is_keras_tensor() either

* Update dependency tables

* Use a cleaner call context function getter

* Add a cap to <2.14

* Add cap to examples requirements too

8e164c54

14 Jun, 2023 1 commit
- Clean up old Accelerate checks (#24279) · 26a2ec56
  Sylvain Gugger authored Jun 14, 2023
```
* Clean up old Accelerate checks

* Put back imports
```
  26a2ec56
08 Jun, 2023 1 commit
- Update the pin on Accelerate (#24110) · 8c5f3067
  Sylvain Gugger authored Jun 08, 2023
  
  8c5f3067
07 Jun, 2023 2 commits
- v4.31.0.dev0 · ba695c1e
  Sylvain Gugger authored Jun 07, 2023
  
  ba695c1e
- Up pinned accelerate version (#24089) · 5eb3d3c7
  Zachary Mueller authored Jun 07, 2023
```
* Min accelerate

* Also min version

* Min accelerate

* Also min version

* To different minor version

* Empty
```
  5eb3d3c7
01 Jun, 2023 1 commit
- Pin rhoknp (#23937) · 91931882
  Sylvain Gugger authored Jun 01, 2023
  
  91931882
31 May, 2023 2 commits

Upgrade safetensors version (#23911) · 55451c66
Zachary Mueller authored May 31, 2023
```
* Upgrade safetensors

* Second table
```
55451c66

Unpin numba (#23162) · 8f915c45

Sanchit Gandhi authored May 31, 2023

* fix for ragged list

* unpin numba

* make style

* np.object -> object

* propagate changes to tokenizer as well

* np.long -> "long"

* revert tokenization changes

* check with tokenization changes

* list/tuple logic

* catch numpy

* catch else case

* clean up

* up

* better check

* trigger ci

* Empty commit to trigger CI

8f915c45

23 May, 2023 1 commit

Making `safetensors` a core dependency. (#23254) · 9e8d7066

Nicolas Patry authored May 23, 2023

* Making `safetensors` a core dependency.

To be merged later, I'm creating the PR so we can try it out.

* Update setup.py

* Remove duplicates.

* Even more redundant.

9e8d7066

16 May, 2023 1 commit
- Build with non Python files (#23405) · 9cf4a8b4
  Sylvain Gugger authored May 16, 2023
```
* Add a test of the built release

* Polish everything

* Trigger CI
```
  9cf4a8b4
12 May, 2023 1 commit

Only add files with modification outside doc blocks (#23327) · a3975f94

Yih-Dar authored May 12, 2023



* min. version for pytest

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

a3975f94

11 May, 2023 2 commits
- Style · 786b9cf5
  Sylvain Gugger authored May 11, 2023
  
  786b9cf5
- Agents extras (#23301) · 71b19ee2
  Lysandre Debut authored May 11, 2023
```
* Agents extras

* Add to docs
```
  71b19ee2
10 May, 2023 1 commit

chore: allow protobuf 3.20.3 requirement (#22759) · 0c65fb7c

José Ángel Rey Liñares authored May 10, 2023



* chore: allow protobuf 3.20.3

Allow latest bugfix release for protobuf (3.20.3)

* chore: update auto-generated dependency table

update auto-generated dependency table

* run in subprocess

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Apply suggestions

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

0c65fb7c

09 May, 2023 1 commit
- v4.30.0.dev0 · a0c0a782
  Sylvain Gugger authored May 09, 2023
  
  a0c0a782
08 May, 2023 1 commit
- New version of Accelerate for the Trainer (#23204) · 94056b57
  Sylvain Gugger authored May 08, 2023
  
  94056b57
04 May, 2023 1 commit
- Pin urllib3 · 3341bb41
  Sylvain Gugger authored May 04, 2023
  
  3341bb41
03 May, 2023 1 commit
- Pin numba for now (#23118) · 4b6aecb4
  Sylvain Gugger authored May 02, 2023
  
  4b6aecb4
20 Apr, 2023 1 commit
- Pin flax & optax version (#22895) · e5f34871
  amyeroberts authored Apr 20, 2023
```
* Pin optax version

* Pin flax too

* Fixup
```
  e5f34871
18 Apr, 2023 1 commit
- Update accelerate version + warning check fix (#22833) · aec10d16
  Zachary Mueller authored Apr 18, 2023
  
  aec10d16
17 Apr, 2023 1 commit

Introduce `PartialState` as the device handler in the `Trainer` (#22752) · 03462875

Zachary Mueller authored Apr 17, 2023



* Use accelerate for device management

* Add accelerate to setup
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

03462875

13 Apr, 2023 1 commit
- v4.29.0.dev0 · 888c4a2a
  Sylvain Gugger authored Apr 12, 2023
  
  888c4a2a
07 Apr, 2023 1 commit
- Revert migration of setup to pyproject.toml (#22658) · 6db23af5
  Sylvain Gugger authored Apr 07, 2023
  
  6db23af5
06 Apr, 2023 1 commit

Adding Llama FastTokenizer support. (#22264) · 1670be4b

Nicolas Patry authored Apr 06, 2023

* Adding Llama FastTokenizer support.

- Requires https://github.com/huggingface/tokenizers/pull/1183 version
- Only support byte_fallback for llama, raise otherwise (safety net).
- Lots of questions are special tokens

How to test:

```python

from transformers.convert_slow_tokenizer import convert_slow_tokenizer
from transformers import AutoTokenizer
from tokenizers import Tokenizer

tokenizer = AutoTokenizer.from_pretrained("huggingface/llama-7b")

if False:
    new_tokenizer = Tokenizer.from_file("tok.json")
else:
    new_tokenizer = convert_slow_tokenizer(tokenizer)
    new_tokenizer.save("tok.json")

strings = [
    "This is a test",
    "生活的真谛是",
    "生活的真谛是[MASK]。",
    # XXX: This one is problematic because of special tokens
    # "<s> Something something",
]

for string in strings:
    encoded = tokenizer(string)["input_ids"]
    encoded2 = new_tokenizer.encode(string).ids

    assert encoded == encoded2, f"{encoded} != {encoded2}"

    decoded = tokenizer.decode(encoded)
    decoded2 = new_tokenizer.decode(encoded2)

    assert decoded.strip() == decoded2, f"{repr(decoded)} != {repr(decoded2)}"
```

The converter + some test script.

The test script.

Tmp save.

Adding Fast tokenizer + tests.

Adding the tokenization tests.

Correct combination.

Small fix.

Fixing tests.

Fixing with latest update.

Rebased.

fix copies + normalized added tokens  + copies.

Adding doc.

TMP.

Doc + split files.

Doc.

Versions + try import.

Fix Camembert + warnings -> Error.

Fix by ArthurZucker.

Not a decorator.

* Fixing comments.

* Adding more to docstring.

* Doc rewriting.

1670be4b

03 Apr, 2023 1 commit

[setup] migrate setup script to `pyproject.toml` (#22539) · 4169dc84

Xuehai Pan authored Apr 04, 2023

* [setup] migrate setup script to `pyproject.toml`

* [setup] cleanup configurations

* remove unused imports

4169dc84