Commits · 6ba4d5de3ae88235db1869352b2c71e1705742e0 · chenpangpang / transformers

03 Jul, 2023 1 commit
- Pin `Pillow` for now (#24633) · 6eedfa6d
  Yih-Dar authored Jul 03, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  6eedfa6d
30 Jun, 2023 2 commits

Limit Pydantic to V1 in dependencies (#24596) · d51aa48a

Serge Matveenko authored Jul 01, 2023



* Limit Pydantic to V1 in dependencies

Pydantic is about to release V2 release which will break a lot of things. This change prevents `transformers` to be used with Pydantic V2 to avoid breaking things.

* more

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

d51aa48a

Use protobuf 4 (#24599) · 299aafe5

Yih-Dar authored Jun 30, 2023



* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

299aafe5

28 Jun, 2023 2 commits
- Unpin DeepSpeed and require DS >= 0.9.3 (#24541) · 11cb6e0f
  Yih-Dar authored Jun 28, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  11cb6e0f
- ⚠️ Time to say goodbye to py37 (#24091) · e84bf1f7
  Yih-Dar authored Jun 28, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  e84bf1f7
23 Jun, 2023 1 commit

Improved keras imports (#24448) · 8e164c54

Matt authored Jun 23, 2023

* An end to accursed version-specific imports

* No more K.is_keras_tensor() either

* Update dependency tables

* Use a cleaner call context function getter

* Add a cap to <2.14

* Add cap to examples requirements too

8e164c54

14 Jun, 2023 1 commit
- Clean up old Accelerate checks (#24279) · 26a2ec56
  Sylvain Gugger authored Jun 14, 2023
```
* Clean up old Accelerate checks

* Put back imports
```
  26a2ec56
08 Jun, 2023 1 commit
- Update the pin on Accelerate (#24110) · 8c5f3067
  Sylvain Gugger authored Jun 08, 2023
  
  8c5f3067
07 Jun, 2023 2 commits
- v4.31.0.dev0 · ba695c1e
  Sylvain Gugger authored Jun 07, 2023
  
  ba695c1e
- Up pinned accelerate version (#24089) · 5eb3d3c7
  Zachary Mueller authored Jun 07, 2023
```
* Min accelerate

* Also min version

* Min accelerate

* Also min version

* To different minor version

* Empty
```
  5eb3d3c7
01 Jun, 2023 1 commit
- Pin rhoknp (#23937) · 91931882
  Sylvain Gugger authored Jun 01, 2023
  
  91931882
31 May, 2023 2 commits

Upgrade safetensors version (#23911) · 55451c66
Zachary Mueller authored May 31, 2023
```
* Upgrade safetensors

* Second table
```
55451c66

Unpin numba (#23162) · 8f915c45

Sanchit Gandhi authored May 31, 2023

* fix for ragged list

* unpin numba

* make style

* np.object -> object

* propagate changes to tokenizer as well

* np.long -> "long"

* revert tokenization changes

* check with tokenization changes

* list/tuple logic

* catch numpy

* catch else case

* clean up

* up

* better check

* trigger ci

* Empty commit to trigger CI

8f915c45

23 May, 2023 1 commit

Making `safetensors` a core dependency. (#23254) · 9e8d7066

Nicolas Patry authored May 23, 2023

* Making `safetensors` a core dependency.

To be merged later, I'm creating the PR so we can try it out.

* Update setup.py

* Remove duplicates.

* Even more redundant.

9e8d7066

16 May, 2023 1 commit
- Build with non Python files (#23405) · 9cf4a8b4
  Sylvain Gugger authored May 16, 2023
```
* Add a test of the built release

* Polish everything

* Trigger CI
```
  9cf4a8b4
12 May, 2023 1 commit

Only add files with modification outside doc blocks (#23327) · a3975f94

Yih-Dar authored May 12, 2023



* min. version for pytest

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

a3975f94

11 May, 2023 2 commits
- Style · 786b9cf5
  Sylvain Gugger authored May 11, 2023
  
  786b9cf5
- Agents extras (#23301) · 71b19ee2
  Lysandre Debut authored May 11, 2023
```
* Agents extras

* Add to docs
```
  71b19ee2
10 May, 2023 1 commit

chore: allow protobuf 3.20.3 requirement (#22759) · 0c65fb7c

José Ángel Rey Liñares authored May 10, 2023



* chore: allow protobuf 3.20.3

Allow latest bugfix release for protobuf (3.20.3)

* chore: update auto-generated dependency table

update auto-generated dependency table

* run in subprocess

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Apply suggestions

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

0c65fb7c

09 May, 2023 1 commit
- v4.30.0.dev0 · a0c0a782
  Sylvain Gugger authored May 09, 2023
  
  a0c0a782
08 May, 2023 1 commit
- New version of Accelerate for the Trainer (#23204) · 94056b57
  Sylvain Gugger authored May 08, 2023
  
  94056b57
04 May, 2023 1 commit
- Pin urllib3 · 3341bb41
  Sylvain Gugger authored May 04, 2023
  
  3341bb41
03 May, 2023 1 commit
- Pin numba for now (#23118) · 4b6aecb4
  Sylvain Gugger authored May 02, 2023
  
  4b6aecb4
20 Apr, 2023 1 commit
- Pin flax & optax version (#22895) · e5f34871
  amyeroberts authored Apr 20, 2023
```
* Pin optax version

* Pin flax too

* Fixup
```
  e5f34871
18 Apr, 2023 1 commit
- Update accelerate version + warning check fix (#22833) · aec10d16
  Zachary Mueller authored Apr 18, 2023
  
  aec10d16
17 Apr, 2023 1 commit

Introduce `PartialState` as the device handler in the `Trainer` (#22752) · 03462875

Zachary Mueller authored Apr 17, 2023



* Use accelerate for device management

* Add accelerate to setup
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

03462875

13 Apr, 2023 1 commit
- v4.29.0.dev0 · 888c4a2a
  Sylvain Gugger authored Apr 12, 2023
  
  888c4a2a
07 Apr, 2023 1 commit
- Revert migration of setup to pyproject.toml (#22658) · 6db23af5
  Sylvain Gugger authored Apr 07, 2023
  
  6db23af5
06 Apr, 2023 1 commit

Adding Llama FastTokenizer support. (#22264) · 1670be4b

Nicolas Patry authored Apr 06, 2023

* Adding Llama FastTokenizer support.

- Requires https://github.com/huggingface/tokenizers/pull/1183 version
- Only support byte_fallback for llama, raise otherwise (safety net).
- Lots of questions are special tokens

How to test:

```python

from transformers.convert_slow_tokenizer import convert_slow_tokenizer
from transformers import AutoTokenizer
from tokenizers import Tokenizer

tokenizer = AutoTokenizer.from_pretrained("huggingface/llama-7b")

if False:
    new_tokenizer = Tokenizer.from_file("tok.json")
else:
    new_tokenizer = convert_slow_tokenizer(tokenizer)
    new_tokenizer.save("tok.json")

strings = [
    "This is a test",
    "生活的真谛是",
    "生活的真谛是[MASK]。",
    # XXX: This one is problematic because of special tokens
    # "<s> Something something",
]

for string in strings:
    encoded = tokenizer(string)["input_ids"]
    encoded2 = new_tokenizer.encode(string).ids

    assert encoded == encoded2, f"{encoded} != {encoded2}"

    decoded = tokenizer.decode(encoded)
    decoded2 = new_tokenizer.decode(encoded2)

    assert decoded.strip() == decoded2, f"{repr(decoded)} != {repr(decoded2)}"
```

The converter + some test script.

The test script.

Tmp save.

Adding Fast tokenizer + tests.

Adding the tokenization tests.

Correct combination.

Small fix.

Fixing tests.

Fixing with latest update.

Rebased.

fix copies + normalized added tokens  + copies.

Adding doc.

TMP.

Doc + split files.

Doc.

Versions + try import.

Fix Camembert + warnings -> Error.

Fix by ArthurZucker.

Not a decorator.

* Fixing comments.

* Adding more to docstring.

* Doc rewriting.

1670be4b

03 Apr, 2023 2 commits

[setup] migrate setup script to `pyproject.toml` (#22539) · 4169dc84

Xuehai Pan authored Apr 04, 2023

* [setup] migrate setup script to `pyproject.toml`

* [setup] cleanup configurations

* remove unused imports

4169dc84

[setup] drop deprecated `distutils` usage (#22531) · 80d1319e

Xuehai Pan authored Apr 04, 2023

* [setup] drop deprecated `distutils` usage

* drop deprecated `distutils.util.strtobool` usage

* fix import order

* reformat docstring by `doc-builder`

80d1319e

29 Mar, 2023 2 commits
- Pin ruff (#22455) · 2194943a
  Sylvain Gugger authored Mar 29, 2023
  
  2194943a
- Update release instructions (#22454) · 4c295a26
  Sylvain Gugger authored Mar 29, 2023
  
  4c295a26
24 Mar, 2023 2 commits
- TensorFlow: pin maximum version to 2.12 (#22364) · 88dae78f
  Joao Gante authored Mar 24, 2023
  
  88dae78f
- Pin tensorflow-text to go with tensorflow (#22362) · 6587125c
  Sylvain Gugger authored Mar 24, 2023
```
* Pin tensorflow-text to go with tensorflow

* Make it more convenient to pin TensorFlow

* setup don't like f-strings
```
  6587125c
22 Mar, 2023 1 commit
- [deepspeed] offload + non-cpuadam optimizer exception doc (#22044) · 89a0a9ea
  Stas Bekman authored Mar 21, 2023
```
* [deepspeed] offload + non-cpuadam optimizer exception doc

* deps
```
  89a0a9ea
21 Mar, 2023 2 commits
- Correct NATTEN function signatures and force new version (#22298) · 5990743f
  Ali Hassani authored Mar 21, 2023
  
  5990743f
- Time to Say Goodbye, torch 1.7 and 1.8 (#22291) · 67c2dbdb
  Yih-Dar authored Mar 21, 2023
```
* time to say goodbye, torch 1.7 and 1.8

* clean up torch_int_div

* clean up is_torch_less_than_1_8-9

* update

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  67c2dbdb
17 Mar, 2023 1 commit

Fix natten (#22229) · 3028b20a

Ali Hassani authored Mar 17, 2023

* Add kernel size to NATTEN's QK arguments.

The new NATTEN 0.14.5 supports PyTorch 2.0, but also adds an additional
argument to the QK operation to allow optional RPBs.

This ends up failing NATTEN tests.

This commit adds NATTEN back to circleci and adds the arguments to get
it working again.

* Force NATTEN >= 0.14.5

3028b20a

14 Mar, 2023 1 commit
- v4.28.0.dev0 · ebdb185b
  Sylvain Gugger authored Mar 14, 2023
  
  ebdb185b