Commits · e7da38f5dc84956c4459746d0fa2ea4aa153767c · chenpangpang / transformers

"model_cards/vscode:/vscode.git/clone" did not exist on "e93ccb3290ec4fb0076495c86af9de33f27048bd"

01 Sep, 2022 1 commit
- Unpin fsspec (#18846) · a26c7523
  Albert Villanova del Moral authored Sep 01, 2022
  
  a26c7523
31 Aug, 2022 2 commits
- Pin ffspec (#18837) · 74690b62
  Sylvain Gugger authored Aug 31, 2022
```
* Pin ffspec

* Typo
```
  74690b62
- Pin max tf version (#18818) · fea4636c
  Joao Gante authored Aug 31, 2022
  
  fea4636c
08 Aug, 2022 2 commits

unpin resampy (#18527) · ec8d2624
Yih-Dar authored Aug 08, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
ec8d2624

Fix compatibility with 1.12 (#17925) · 70b0d4e1

Sylvain Gugger authored Aug 08, 2022



* Fix compatibility with 1.12

* Remove pin from examples requirements

* Update torch scatter version

* Fix compatibility with 1.12

* Remove pin from examples requirements

* Update torch scatter version

* fix torch.onnx.symbolic_opset12 import

* Reject bad version
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

70b0d4e1

05 Aug, 2022 1 commit
- Remove py.typed (#18485) · c7849d9e
  Sylvain Gugger authored Aug 05, 2022
  
  c7849d9e
03 Aug, 2022 1 commit
- Update pinned hhub version (#18448) · a507908c
  Omar Sanseviero authored Aug 03, 2022
```
* Update pinned hhub version

* Make style
```
  a507908c
01 Aug, 2022 2 commits
- Fix ROUGE add example check and update README (#18398) · 941d2331
  Sylvain Gugger authored Aug 01, 2022
```
* Fix ROUGE add example check and update README

* Stay consistent in values
```
  941d2331
- Add evaluate to test dependencies (#18396) · af1e6b4d
  Sylvain Gugger authored Aug 01, 2022
  
  af1e6b4d
27 Jul, 2022 1 commit
- Dev version · c89a592e
  Lysandre authored Jul 27, 2022
  
  c89a592e
08 Jul, 2022 1 commit
- Fix slow CI by pinning resampy (#18077) · 9bd39685
  Sylvain Gugger authored Jul 08, 2022
```
* Fix slow CI by pinning resampy

* Actually put it in the speech dependencies
```
  9bd39685
05 Jul, 2022 1 commit
- [Flax] Bump to v0.4.1 (#17966) · ec07eccc
  Sanchit Gandhi authored Jul 05, 2022
  
  ec07eccc
28 Jun, 2022 2 commits
- Pin PyTorch while we fix compatibility with 1.12 · 5a3d0cbd
  Sylvain Gugger authored Jun 28, 2022
  
  5a3d0cbd
- Pin black to 22.3.0 to benefit from a stable --preview flag (#17918) · 1dfa03f1
  Lysandre Debut authored Jun 28, 2022
  
  1dfa03f1
27 Jun, 2022 1 commit

Add a TF in-graph tokenizer for BERT (#17701) · ee0d001d

Matt authored Jun 27, 2022

* Add a TF in-graph tokenizer for BERT

* Add from_pretrained

* Add proper truncation, option handling to match other tokenizers

* Add proper imports and guards

* Add test, fix all the bugs exposed by said test

* Fix truncation of paired texts in graph mode, more test updates

* Small fixes, add a (very careful) test for savedmodel

* Add tensorflow-text dependency, make fixup

* Update documentation

* Update documentation

* make fixup

* Slight changes to tests

* Add some docstring examples

* Update tests

* Update tests and add proper lowercasing/normalization

* make fixup

* Add docstring for padding!

* Mark slow tests

* make fixup

* Fall back to BertTokenizerFast if BertTokenizer is unavailable

* Fall back to BertTokenizerFast if BertTokenizer is unavailable

* make fixup

* Properly handle tensorflow-text dummies

ee0d001d

17 Jun, 2022 1 commit

Migrate HFDeepSpeedConfig from trfrs to accelerate (#17623) · 21a77242

Sourab Mangrulkar authored Jun 17, 2022



* Migrate HFDeepSpeedConfig from trfrs to accelerate

* add `accelerate` to testing dep

* addressing comments

* addressing comments

Using `_shared_state` and avoiding object creation. This is necessary as `notebook_launcher` in `launcers.py` checks `len(AcceleratorState._shared_state)>0` to throw an error.

* resolving comments

1. Use simple API from accelerate to manage the deepspeed config integration
2. Update the related documentation

* reverting changes and addressing comments

* docstring correction

* addressing nits

* addressing nits

* addressing nits 3

* bumping up the accelerate version to 0.10.0

* resolving import

* update setup.py to include deepspeed dependencies

* Update dependency_versions_table.py

* fixing imports

* reverting changes to CI dependencies for "run_tests_pipelines_tf*" tests

These changes didn't help with resolving the failures and I believe this needs to be addressed in another PR.

* removing `accelerate` as hard dependency

Resolves issues related to CI Tests

* adding `accelerate` as dependency for building docs

resolves failure in Build PR Documentation test

* adding `accelerate` as dependency in "dev" to resolve doc build issue

* resolving comments

1. adding `accelerate` to extras["all"]
2. Including check for accelerate too before import HFDeepSpeedConfig from there
Co-Authored-By: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* resolving comments
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

21a77242

16 Jun, 2022 1 commit
- v4.21.0.dev0 · 7c6ec195
  Sylvain Gugger authored Jun 16, 2022
  
  7c6ec195
02 Jun, 2022 2 commits
- [trainer/deepspeed] load_best_model (reimplement re-init) (#17151) · 2f59ad16
  Stas Bekman authored Jun 02, 2022
```
* [trainer/deepspeed] load_best_model

* to sync with DS PR #1947

* simplify

* rework load_best_model test

* cleanup

* bump deepspeed>=0.6.5
Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
```
  2f59ad16
- Clean README in post release job as well. (#17519) · f128ccb9
  Sylvain Gugger authored Jun 02, 2022
  
  f128ccb9
26 May, 2022 1 commit
- Pin protobouf that breaks TensorBoard in PyTorch (#17440) · 7535d92e
  Sylvain Gugger authored May 26, 2022
  
  7535d92e
23 May, 2022 1 commit

Use Accelerate in `from_pretrained` for big model inference (#17341) · 56f50590

Sylvain Gugger authored May 23, 2022



* Initial work

* More or less finished with first draft

* Update src/transformers/modeling_utils.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Update src/transformers/modeling_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Fix randomly initialized weights

* Update src/transformers/modeling_utils.py
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

* Address review comments

* Rename DeepSpeed folder to temporarily fix the test issue?

* Revert to try if Accelerate fix works

* Use latest Accelerate release

* Quality and fixes

* Style

* Quality

* Add doc

* Test + fix

* More blocks
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

56f50590

20 May, 2022 1 commit

Pin dill to fix examples (#17368) · 3fd7de49

Sylvain Gugger authored May 20, 2022

* Pin dill for now

* Try this version?

* force install

* Actually use dep in testing

* Try a larger pin

3fd7de49

12 May, 2022 2 commits
- Black preview (#17217) · afe5d42d
  Sylvain Gugger authored May 12, 2022
```
* Black preview

* Fixup too!

* Fix check copies

* Use the same version as the CI

* Bump black
```
  afe5d42d
- Dev version · 5294fa12
  Lysandre Debut authored May 12, 2022
  
  5294fa12
10 May, 2022 1 commit

[Deepspeed] add many more models to the model zoo test (#12695) · f8615044

Stas Bekman authored May 10, 2022

* model zoo take 2

* add deberta

* new param for zero2

* doc update

* doc update

* add layoutlm

* bump deepspeed

* add deberta-v2, funnel, longformer

* new models

* style

* add t5_v1

* update TAPAS status

* reorg problematic models

* move doc to another PR

* style

* fix checkpoint check test

* making progress on more models running

* cleanup

* new version

* cleanup

f8615044

09 May, 2022 1 commit

Add the auto_find_batch_size capability from Accelerate into Trainer (#17068) · 2fbb2379

Zachary Mueller authored May 09, 2022


Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

- Adds auto_batch_size finder 
- Moves training loop to an inner training loop

2fbb2379

04 May, 2022 1 commit

Skip RoFormer ONNX test if rjieba not installed (#16981) · 4bb1d0ec

lewtun authored May 04, 2022

* Skip RoFormer ONNX test if rjieba not installed

* Update deps table

* Skip RoFormer serialization test

* Fix RoFormer vocab

* Add rjieba to CircleCI

4bb1d0ec

02 May, 2022 2 commits
- Clean up setup.py (#17045) · 1073f00d
  Sylvain Gugger authored May 02, 2022
```
* Clean up setup.py

* Trigger CI

* Upgrade Python used
```
  1073f00d
- Make the sacremoses dependency optional (#17049) · 30ca5299
  Lysandre Debut authored May 02, 2022
```
* Make sacremoses optional

* Pickle
```
  30ca5299
29 Apr, 2022 1 commit
- Result of new doc style with fixes (#17015) · 7152ed2b
  Sylvain Gugger authored Apr 29, 2022
```
* Result of new doc style with fixes

* Add last two files

* Bump hf-doc-builder
```
  7152ed2b
28 Apr, 2022 1 commit
- Update README to latest release (#16997) · e6f00a11
  Sylvain Gugger authored Apr 28, 2022
  
  e6f00a11
17 Apr, 2022 1 commit
- Pin Jax to last working release (#16808) · dee6f016
  Sylvain Gugger authored Apr 16, 2022
```
* Pin Jax to last working release

* Try lower

* Try lower
```
  dee6f016
15 Apr, 2022 1 commit

[trainer / deepspeed] fix hyperparameter_search (#16740) · ce2fef2a

Stas Bekman authored Apr 14, 2022

* [trainer / deepspeed] fix hyperparameter_search

* require optuna

* style

* oops

* add dep in the right place

* create deepspeed-testing dep group

* Trigger CI

ce2fef2a

06 Apr, 2022 1 commit
- Dev version · a180efe7
  Lysandre Debut authored Apr 06, 2022
  
  a180efe7
01 Apr, 2022 1 commit
- Pin tokenizers version <0.13 (#16539) · 53a4d6b1
  Lysandre Debut authored Apr 01, 2022
```
* Pin tokenizers version <0.13

* Style
```
  53a4d6b1
28 Mar, 2022 1 commit

Use doc builder styler (#16412) · 473709fc

Sylvain Gugger authored Mar 28, 2022

* Config update

* Use doc-builder styler

* Cleanup

* Adapt import

* We need it there too!

473709fc

24 Mar, 2022 1 commit
- bump cookiecutter version (#16387) · 9d88be57
  Yih-Dar authored Mar 24, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  9d88be57
23 Mar, 2022 1 commit

Updates the default branch from master to main (#16326) · eca77f47

Lysandre Debut authored Mar 23, 2022



* Updates the default branch from master to main

* Links from `master` to `main`

* Typo

* Update examples/flax/README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

eca77f47

18 Mar, 2022 1 commit
- update jax version and re-enable some tests (#16254) · b25b92ac
  Suraj Patil authored Mar 18, 2022
  
  b25b92ac
12 Mar, 2022 1 commit

[Deepspeed] add support for bf16 mode (#14569) · 580dd87c

Stas Bekman authored Mar 11, 2022



* [WIP] add support for bf16 mode

* prep for bf16

* prep for bf16

* fix; zero2/bf16 is ok

* check bf16 is available

* test fixes

* enable zero3_bf16

* config files

* docs

* split stage_dtype; merge back to non-dtype-specific config file

* fix doc

* cleanup

* cleanup

* bfloat16 => bf16 to match the PR changes

* s/zero_gather_fp16_weights_on_model_save/zero_gather_16bit_weights_on_model_save/; s/save_fp16_model/save_16bit_model/

* test fixes/skipping

* move

* fix

* Update docs/source/main_classes/deepspeed.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* backticks

* cleanup

* cleanup

* cleanup

* new version

* add note about grad accum in bf16
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

580dd87c