Commits · a2a3afbc8d26d6170909365ffba6bd75e186255f · chenpangpang / transformers

14 Sep, 2022 1 commit
- PyTorch >= 1.7.0 and TensorFlow >= 2.4.0 (#19016) · a2a3afbc
  Sylvain Gugger authored Sep 14, 2022
  
  a2a3afbc
12 Sep, 2022 1 commit
- TF: TF 2.10 unpin + related onnx test skips (#18995) · 1182b945
  Joao Gante authored Sep 12, 2022
  
  1182b945
10 Sep, 2022 2 commits
- Revert "TF: unpin maximum TF version (#18917)" (#18972) · a2611477
  Sylvain Gugger authored Sep 10, 2022
```
This reverts commit d8cf3b20.
```
  a2611477
- TF: unpin maximum TF version (#18917) · d8cf3b20
  Joao Gante authored Sep 10, 2022
  
  d8cf3b20
02 Sep, 2022 1 commit
- Clean up utils.hub using the latest from hf_hub (#18857) · 38c3cd52
  Sylvain Gugger authored Sep 02, 2022
```
* Clean up utils.hub using the latest from hf_hub

* Adapt test

* Address review comment

* Fix test
```
  38c3cd52
01 Sep, 2022 2 commits

Albert Villanova del Moral authored Sep 01, 2022

* Pin rouge_score

* Pin also in dependency_versions_table

* Update excluded versions

* Revert "Update excluded versions"

This reverts commit 0d0362df30a816108835f5c061272ee2bafec270.

* Revert "Revert "Update excluded versions""

This reverts commit 66c47af8a6baff253575631b0ba392e0354b6d56.

fafbb57d

Unpin fsspec (#18846) · a26c7523
Albert Villanova del Moral authored Sep 01, 2022

a26c7523

31 Aug, 2022 2 commits
- Pin ffspec (#18837) · 74690b62
  Sylvain Gugger authored Aug 31, 2022
```
* Pin ffspec

* Typo
```
  74690b62
- Pin max tf version (#18818) · fea4636c
  Joao Gante authored Aug 31, 2022
  
  fea4636c
08 Aug, 2022 2 commits

unpin resampy (#18527) · ec8d2624
Yih-Dar authored Aug 08, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
ec8d2624

Fix compatibility with 1.12 (#17925) · 70b0d4e1

Sylvain Gugger authored Aug 08, 2022



* Fix compatibility with 1.12

* Remove pin from examples requirements

* Update torch scatter version

* Fix compatibility with 1.12

* Remove pin from examples requirements

* Update torch scatter version

* fix torch.onnx.symbolic_opset12 import

* Reject bad version
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

70b0d4e1

03 Aug, 2022 1 commit
- Update pinned hhub version (#18448) · a507908c
  Omar Sanseviero authored Aug 03, 2022
```
* Update pinned hhub version

* Make style
```
  a507908c
01 Aug, 2022 2 commits
- Fix ROUGE add example check and update README (#18398) · 941d2331
  Sylvain Gugger authored Aug 01, 2022
```
* Fix ROUGE add example check and update README

* Stay consistent in values
```
  941d2331
- Add evaluate to test dependencies (#18396) · af1e6b4d
  Sylvain Gugger authored Aug 01, 2022
  
  af1e6b4d
08 Jul, 2022 1 commit
- Fix slow CI by pinning resampy (#18077) · 9bd39685
  Sylvain Gugger authored Jul 08, 2022
```
* Fix slow CI by pinning resampy

* Actually put it in the speech dependencies
```
  9bd39685
05 Jul, 2022 1 commit
- [Flax] Bump to v0.4.1 (#17966) · ec07eccc
  Sanchit Gandhi authored Jul 05, 2022
  
  ec07eccc
28 Jun, 2022 2 commits
- Pin PyTorch while we fix compatibility with 1.12 · 5a3d0cbd
  Sylvain Gugger authored Jun 28, 2022
  
  5a3d0cbd
- Pin black to 22.3.0 to benefit from a stable --preview flag (#17918) · 1dfa03f1
  Lysandre Debut authored Jun 28, 2022
  
  1dfa03f1
27 Jun, 2022 1 commit

Add a TF in-graph tokenizer for BERT (#17701) · ee0d001d

Matt authored Jun 27, 2022

* Add a TF in-graph tokenizer for BERT

* Add from_pretrained

* Add proper truncation, option handling to match other tokenizers

* Add proper imports and guards

* Add test, fix all the bugs exposed by said test

* Fix truncation of paired texts in graph mode, more test updates

* Small fixes, add a (very careful) test for savedmodel

* Add tensorflow-text dependency, make fixup

* Update documentation

* Update documentation

* make fixup

* Slight changes to tests

* Add some docstring examples

* Update tests

* Update tests and add proper lowercasing/normalization

* make fixup

* Add docstring for padding!

* Mark slow tests

* make fixup

* Fall back to BertTokenizerFast if BertTokenizer is unavailable

* Fall back to BertTokenizerFast if BertTokenizer is unavailable

* make fixup

* Properly handle tensorflow-text dummies

ee0d001d

17 Jun, 2022 1 commit

Migrate HFDeepSpeedConfig from trfrs to accelerate (#17623) · 21a77242

Sourab Mangrulkar authored Jun 17, 2022



* Migrate HFDeepSpeedConfig from trfrs to accelerate

* add `accelerate` to testing dep

* addressing comments

* addressing comments

Using `_shared_state` and avoiding object creation. This is necessary as `notebook_launcher` in `launcers.py` checks `len(AcceleratorState._shared_state)>0` to throw an error.

* resolving comments

1. Use simple API from accelerate to manage the deepspeed config integration
2. Update the related documentation

* reverting changes and addressing comments

* docstring correction

* addressing nits

* addressing nits

* addressing nits 3

* bumping up the accelerate version to 0.10.0

* resolving import

* update setup.py to include deepspeed dependencies

* Update dependency_versions_table.py

* fixing imports

* reverting changes to CI dependencies for "run_tests_pipelines_tf*" tests

These changes didn't help with resolving the failures and I believe this needs to be addressed in another PR.

* removing `accelerate` as hard dependency

Resolves issues related to CI Tests

* adding `accelerate` as dependency for building docs

resolves failure in Build PR Documentation test

* adding `accelerate` as dependency in "dev" to resolve doc build issue

* resolving comments

1. adding `accelerate` to extras["all"]
2. Including check for accelerate too before import HFDeepSpeedConfig from there
Co-Authored-By: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* resolving comments
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

21a77242

02 Jun, 2022 1 commit

[trainer/deepspeed] load_best_model (reimplement re-init) (#17151) · 2f59ad16

Stas Bekman authored Jun 02, 2022



* [trainer/deepspeed] load_best_model

* to sync with DS PR #1947

* simplify

* rework load_best_model test

* cleanup

* bump deepspeed>=0.6.5
Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>

2f59ad16

26 May, 2022 1 commit
- Pin protobouf that breaks TensorBoard in PyTorch (#17440) · 7535d92e
  Sylvain Gugger authored May 26, 2022
  
  7535d92e
23 May, 2022 1 commit

Use Accelerate in `from_pretrained` for big model inference (#17341) · 56f50590

Sylvain Gugger authored May 23, 2022



* Initial work

* More or less finished with first draft

* Update src/transformers/modeling_utils.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Update src/transformers/modeling_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Fix randomly initialized weights

* Update src/transformers/modeling_utils.py
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

* Address review comments

* Rename DeepSpeed folder to temporarily fix the test issue?

* Revert to try if Accelerate fix works

* Use latest Accelerate release

* Quality and fixes

* Style

* Quality

* Add doc

* Test + fix

* More blocks
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

56f50590

20 May, 2022 1 commit

Pin dill to fix examples (#17368) · 3fd7de49

Sylvain Gugger authored May 20, 2022

* Pin dill for now

* Try this version?

* force install

* Actually use dep in testing

* Try a larger pin

3fd7de49

12 May, 2022 2 commits
- Black preview (#17217) · afe5d42d
  Sylvain Gugger authored May 12, 2022
```
* Black preview

* Fixup too!

* Fix check copies

* Use the same version as the CI

* Bump black
```
  afe5d42d
- Fix dependency table · 30be0da5
  Lysandre Debut authored May 12, 2022
  
  30be0da5
10 May, 2022 1 commit

[Deepspeed] add many more models to the model zoo test (#12695) · f8615044

Stas Bekman authored May 10, 2022

* model zoo take 2

* add deberta

* new param for zero2

* doc update

* doc update

* add layoutlm

* bump deepspeed

* add deberta-v2, funnel, longformer

* new models

* style

* add t5_v1

* update TAPAS status

* reorg problematic models

* move doc to another PR

* style

* fix checkpoint check test

* making progress on more models running

* cleanup

* new version

* cleanup

f8615044

09 May, 2022 1 commit

Add the auto_find_batch_size capability from Accelerate into Trainer (#17068) · 2fbb2379

Zachary Mueller authored May 09, 2022


Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

- Adds auto_batch_size finder 
- Moves training loop to an inner training loop

2fbb2379

04 May, 2022 1 commit

Skip RoFormer ONNX test if rjieba not installed (#16981) · 4bb1d0ec

lewtun authored May 04, 2022

* Skip RoFormer ONNX test if rjieba not installed

* Update deps table

* Skip RoFormer serialization test

* Fix RoFormer vocab

* Add rjieba to CircleCI

4bb1d0ec

29 Apr, 2022 1 commit
- Result of new doc style with fixes (#17015) · 7152ed2b
  Sylvain Gugger authored Apr 29, 2022
```
* Result of new doc style with fixes

* Add last two files

* Bump hf-doc-builder
```
  7152ed2b
17 Apr, 2022 1 commit
- Pin Jax to last working release (#16808) · dee6f016
  Sylvain Gugger authored Apr 16, 2022
```
* Pin Jax to last working release

* Try lower

* Try lower
```
  dee6f016
01 Apr, 2022 1 commit
- Pin tokenizers version <0.13 (#16539) · 53a4d6b1
  Lysandre Debut authored Apr 01, 2022
```
* Pin tokenizers version <0.13

* Style
```
  53a4d6b1
28 Mar, 2022 1 commit

Use doc builder styler (#16412) · 473709fc

Sylvain Gugger authored Mar 28, 2022

* Config update

* Use doc-builder styler

* Cleanup

* Adapt import

* We need it there too!

473709fc

24 Mar, 2022 1 commit
- Fix style (#16391) · 8cbd9b8f
  Lysandre Debut authored Mar 24, 2022
  
  8cbd9b8f
18 Mar, 2022 1 commit
- update jax version and re-enable some tests (#16254) · b25b92ac
  Suraj Patil authored Mar 18, 2022
  
  b25b92ac
12 Mar, 2022 1 commit

[Deepspeed] add support for bf16 mode (#14569) · 580dd87c

Stas Bekman authored Mar 11, 2022



* [WIP] add support for bf16 mode

* prep for bf16

* prep for bf16

* fix; zero2/bf16 is ok

* check bf16 is available

* test fixes

* enable zero3_bf16

* config files

* docs

* split stage_dtype; merge back to non-dtype-specific config file

* fix doc

* cleanup

* cleanup

* bfloat16 => bf16 to match the PR changes

* s/zero_gather_fp16_weights_on_model_save/zero_gather_16bit_weights_on_model_save/; s/save_fp16_model/save_16bit_model/

* test fixes/skipping

* move

* fix

* Update docs/source/main_classes/deepspeed.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* backticks

* cleanup

* cleanup

* cleanup

* new version

* add note about grad accum in bf16
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

580dd87c

18 Feb, 2022 1 commit

fix CLIP fast tokenizer and change some properties of the slow version (#15067) · e93763d4

SaulLu authored Feb 18, 2022



Very big changes concerning the tokenizer fast of CLIP which did not correspond to the tokenizer slow of CLIP
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

e93763d4

15 Feb, 2022 1 commit
- Require tokenizers>=0.11.1 (#15266) · e1cbc073
  Alex Hedges authored Feb 15, 2022
```
`tokenizers` version that supports the feature to choose the direction of truncation
```
  e1cbc073
09 Feb, 2022 1 commit
- Upgrade black to version ~=22.0 (#15565) · 7732d0fe
  Lysandre Debut authored Feb 09, 2022
```
* Upgrade black to version ~=22.0

* Check copies

* Fix code
```
  7732d0fe
28 Jan, 2022 1 commit

[deepspeed] saving checkpoint fallback when fp16 weights aren't saved (#14948) · 297602c7

Stas Bekman authored Jan 28, 2022



* [deepspeed] saving checkpoint fallback when fp16 weights aren't saved

* Bump required deepspeed version to match usage when saving checkpoints

* update version
Co-authored-by: Mihai Balint <balint.mihai@gmail.com>

297602c7