Commits · badb9d2aaa58df2fddc09a868d8e3e5655b101a3 · chenpangpang / transformers

"vscode:/vscode.git/clone" did not exist on "d0b942d1dcefd643571cbf1a48a0e51db611406e"

18 Aug, 2022 1 commit

[bnb] Move documentation (#18671) · a123eee9

Younes Belkada authored Aug 18, 2022



* fix bnb documentation

- move bnb documentation to `infer_gpu_many`

* small refactoring

- added text on infer_gpu_one
- added a small note on infer_gpu_many
- added customized multi gpu example on infer_gpu_many

* Update docs/source/en/perf_infer_gpu_many.mdx
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* apply suggestions
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

a123eee9

16 Aug, 2022 1 commit

[bnb] Minor modifications (#18631) · 6d175c11

Younes Belkada authored Aug 17, 2022



* bnb minor modifications

- refactor documentation
- add troubleshooting README
- add PyPi library on DockerFile

* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Apply suggestions from code review

* Apply suggestions from code review

* Apply suggestions from code review

* put in one block

- put bash instructions in one block

* update readme

- refactor a bit hardware requirements

* change text a bit

* Apply suggestions from code review
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

* apply suggestions
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

* add link to paper

* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Update tests/mixed_int8/README.md

* Apply suggestions from code review

* refactor a bit

* add instructions Turing & Amperer
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* add A6000

* clarify a bit

* remove small part

* Update tests/mixed_int8/README.md
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

6d175c11

08 Aug, 2022 1 commit
- Update perf_train_gpu_one.mdx (#18532) · f1f5de31
  Mishig Davaadorj authored Aug 08, 2022
  
  f1f5de31
06 Aug, 2022 1 commit

Just re-reading the whole doc every couple of months

😬

(#18489) · 8d1f9039

Julien Chaumond authored Aug 06, 2022

* Delete valohai.yaml

* NLP => ML

* typo

* website supports https

* datasets

* 60k + modalities

* unrelated link fixing for accelerate

* Ok those links were actually broken

* Fix link

* Make `AutoTokenizer` auto-link

* wording tweak

* add at least one non-nlp task

8d1f9039

13 Jul, 2022 1 commit

Enable torchdynamo with torch_tensorrt(fx path) (#17765) · 7ea6ccc2

Wei authored Jul 13, 2022



* enable fx2trt

* Update perf_train_gpu_one.mdx

* Update perf_train_gpu_one.mdx

* add lib check

* update

* format

* update

* fix import check

* fix isort

* improve doc

* refactor ctx manager

* fix isort

* black format

* isort fix

* fix format

* update args

* update black

* cleanups

* Update perf_train_gpu_one.mdx

* code refactor

* code refactor to init

* remove redundancy

* isort

* replace self.args with args
Co-authored-by: Stas Bekman <stas@stason.org>

7ea6ccc2

06 Jul, 2022 1 commit
- Doc to dataset (#18037) · 2e90c3df
  Sylvain Gugger authored Jul 06, 2022
```
* Link to the Datasets doc

* Remove unwanted file
```
  2e90c3df
01 Jul, 2022 1 commit
- Fix typo in perf_train_gpu_one.mdx (#17983) · cb425024
  Billy Cao authored Jul 01, 2022
  
  cb425024
16 May, 2022 1 commit

[WIP] [doc] performance/scalability revamp (#15723) · 71abd3ad

Stas Bekman authored May 16, 2022



* [doc] performance/scalability revamp

* link the new docs

* no :

* mixed precision

* work on the first doc

* expand the main doc

* Trigger CI

* style

* revamp single GPU training section

* work on training performance

* remove files not used anymore or will be added later

* final touches

* fix rebase

* Add hardware section to toctree

* fix toctree again

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* remove `fast_tokenizers` entry that was copied in rebase

* add warning about DP vs DDP

* remove todo

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* fix missing closure of codeblock

* Update docs/source/en/perf_train_gpu_many.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* sync with #16860

* update toc
Co-authored-by: leandro <leandro.vonwerra@spoud.io>
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

71abd3ad

20 Apr, 2022 1 commit
- [docs] fix url (#16860) · 67ed0e43
  Stas Bekman authored Apr 20, 2022
  
  67ed0e43
04 Apr, 2022 1 commit

Enable doc in Spanish (#16518) · b9a768b3

Sylvain Gugger authored Apr 04, 2022

* Reorganize doc for multilingual support

* Fix style

* Style

* Toc trees

* Adapt templates

b9a768b3

25 Mar, 2022 1 commit
- Big file_utils cleanup (#16396) · 088c1880
  Sylvain Gugger authored Mar 25, 2022
```
* Big file_utils cleanup

* This one still needs to be treated separately
```
  088c1880
09 Feb, 2022 1 commit

add model scaling section (#15119) · d923f762

Leandro von Werra authored Feb 09, 2022



* add model scaling section

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* integrate reviewer feedback

* initialize GPU properly

* add note about BnB optimizer

* move doc from `scaling.mdx` to `performance.mdx`

* integrate reviewer feedback

* revert section levels
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

d923f762

17 Jan, 2022 1 commit
- [doc] new MoE paper (#15184) · edd3fce2
  Stas Bekman authored Jan 17, 2022
```
add new paper
```
  edd3fce2
15 Jan, 2022 1 commit
- [doc] performance: Efficient Software Prebuilds (#15147) · 669e3c50
  Stas Bekman authored Jan 14, 2022
```
* Efficient Software Prebuilds

* improve
```
  669e3c50
10 Jan, 2022 1 commit

[performance doc] Power and Cooling (#14935) · 37bc0b4e

Stas Bekman authored Jan 10, 2022



* [performance doc] Power and Cooling

* more docs

* Update docs/source/performance.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* reword
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

37bc0b4e

22 Dec, 2021 1 commit

Convert rst files (#14888) · 207594be

Sylvain Gugger authored Dec 22, 2021

* Convert all tutorials and guides

* Convert all remaining rst to mdx

* Track and fix bad links

207594be

16 Dec, 2021 1 commit
- Removes images to put them in a dataset (#14781) · 8010fda9
  Lysandre Debut authored Dec 16, 2021
```
* First try

* Update instructions
```
  8010fda9
15 Dec, 2021 1 commit
- [doc] performance: groups of operations by compute-intensity (#14757) · fdf3ce28
  Stas Bekman authored Dec 14, 2021
  
  fdf3ce28
11 Dec, 2021 1 commit
- [doc] document MoE model approach and current solutions (#14725) · 027074f4
  Stas Bekman authored Dec 10, 2021
```
* document MoE model approach

* additional info from Samyam

* fix
```
  027074f4
08 Dec, 2021 1 commit

[bf16 support] tweaks (#14580) · 12286612

Stas Bekman authored Dec 08, 2021



* [bf16 support] tweaks

* corrections
Co-authored-by: Manuel R. Ciosici <manuelrciosici@gmail.com>

12286612

03 Dec, 2021 1 commit

[trainer] add tf32-mode control (#14606) · 71b1bf7e

Stas Bekman authored Dec 03, 2021



* [trainer] add --tf32 support

* it's pt>=.17

* it's pt>=.17

* flip the default to True

* add experimental note

* simplify logic

* style

* switch to 3-state logic

* doc

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* re-style code
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

71b1bf7e

02 Dec, 2021 1 commit

Update doc img links (#14593) · 275402bf

Mishig Davaadorj authored Dec 02, 2021

* Update doc img links

* Rename toctree.yml -> _toctree.yml (#14594)

* Update doc img links

* Update performance.md img link

275402bf

01 Dec, 2021 1 commit

[doc] bf16/tf32 guide (#14579) · fbe278c7

Stas Bekman authored Dec 01, 2021



* [doc] bf16/tf32 guide

* expand

* expand

* Update docs/source/performance.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

fbe278c7

15 Nov, 2021 1 commit
- [doc] performance and parallelism updates (#14391) · 29dfb2db
  Stas Bekman authored Nov 14, 2021
```
* [doc] performance and parallelism doc update

* improve

* improve
```
  29dfb2db
22 Sep, 2021 1 commit

Make gradient_checkpointing a training argument (#13657) · 27d46397

Sylvain Gugger authored Sep 22, 2021



* Make gradient_checkpointing a training argument

* Update src/transformers/modeling_utils.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Update src/transformers/configuration_utils.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Fix tests

* Style

* document Gradient Checkpointing as a performance feature

* Small rename

* PoC for not using the config

* Adapt BC to new PoC

* Forgot to save

* Rollout changes to all other models

* Fix typo
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas@stason.org>

27d46397

15 Jul, 2021 1 commit
- [doc] performance: batch sizes (#12725) · 31cfcbd3
  Stas Bekman authored Jul 15, 2021
  
  31cfcbd3
22 Jun, 2021 1 commit

[docs] performance (#12258) · bfd5da8e

Stas Bekman authored Jun 22, 2021



* initial performance document

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* rewrites based on suggestions

* 8x multiple is for AMP only

* add contribute section
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

bfd5da8e