Commits · 01c081d1381e20b1f5b2f31d6fa8b0af8092c0b5 · chenpangpang / transformers

20 Dec, 2023 1 commit
- [docs] Trainer docs (#28145) · 01c081d1
  Steven Liu authored Dec 20, 2023
```
* fsdp, debugging, gpu selection

* fix hfoption

* fix
```
  01c081d1
15 Dec, 2023 2 commits
- [docs] MPS (#28016) · ebfdb9ca
  Steven Liu authored Dec 15, 2023
```
* mps docs

* toctree
```
  ebfdb9ca
- [docs] Trainer (#27986) · 0d63d177
  Steven Liu authored Dec 15, 2023
```
* first draft

* add to toctree

* edits

* feedback
```
  0d63d177
27 Nov, 2023 1 commit

docs: replace torch.distributed.run by torchrun (#27528) · ce315081

Peter Pan authored Nov 28, 2023



* docs: replace torch.distributed.run by torchrun

 `transformers` now officially support pytorch >= 1.10.
 The entrypoint `torchrun`` is present from 1.10 onwards.
Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>

* Update src/transformers/trainer.py

with @ArthurZucker's suggestion
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------
Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

ce315081

24 Nov, 2023 2 commits

Reflect RoCm support in the documentation (#27636) · c13a43aa

fxmarty authored Nov 24, 2023



* reflect RoCm support in the documentation

* Update docs/source/en/main_classes/trainer.md
Co-authored-by: Lysandre Debut <hi@lysand.re>

* fix review comments

* use ROCm instead of RoCm

---------
Co-authored-by: Lysandre Debut <hi@lysand.re>

c13a43aa

Refactoring Trainer, adds `save_only_model` arg and simplifying FSDP integration (#27652) · a761d6e9

Sourab Mangrulkar authored Nov 24, 2023



* add code changes

1. Refactor FSDP
2. Add `--save_only_model` option: When checkpointing, whether to only save the model, or also the optimizer, scheduler & rng state.
3. Bump up the minimum `accelerate` version to `0.21.0`

* quality

* fix quality?

* Revert "fix quality?"

This reverts commit 149330a6abc078827be274db84c8a2d26a76eba1.

* fix fsdp doc strings

* fix quality

* Update src/transformers/training_args.py
Co-authored-by: Zach Mueller <muellerzr@gmail.com>

* please fix the quality issue 😅



* Apply suggestions from code review
Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

* address comment

* simplify conditional check as per the comment

* update documentation

---------
Co-authored-by: Zach Mueller <muellerzr@gmail.com>
Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

a761d6e9

20 Nov, 2023 1 commit
- docs: fix 404 link (#27529) · e4280d65
  Peter Pan authored Nov 20, 2023
```
Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>
```
  e4280d65
31 Oct, 2023 1 commit

[FEAT] Add Neftune into transformers Trainer (#27141) · 309a9066

Younes Belkada authored Oct 31, 2023



* add v1 neftune

* use `unwrap_model` instead

* add test + docs

* Apply suggestions from code review
Co-authored-by: Zach Mueller <muellerzr@gmail.com>

* more details

* fixup

* Update docs/source/en/main_classes/trainer.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* refactor a bit

* more elaborated test

* fix unwrap issue

---------
Co-authored-by: Zach Mueller <muellerzr@gmail.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

309a9066

30 Oct, 2023 1 commit

Translating `en/main_classes` folder docs to Japanese

🇯🇵

(#26894) · 84724efd

Rockerz authored Oct 30, 2023



* add

* add

* add

* Add deepspeed.md

* Add

* add

* Update docs/source/ja/main_classes/callback.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/main_classes/output.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/main_classes/pipelines.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/main_classes/processors.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/main_classes/processors.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/main_classes/text_generation.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/main_classes/processors.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update  logging.md

* Update toctree.yml

* Update docs/source/ja/main_classes/deepspeed.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Add suggesitons

* m

* Update docs/source/ja/main_classes/trainer.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update toctree.yml

* Update Quantization.md

* Update docs/source/ja/_toctree.yml
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update toctree.yml

* Update docs/source/en/main_classes/deepspeed.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/main_classes/deepspeed.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

84724efd

24 Oct, 2023 1 commit
- add info on TRL docs (#27024) · b18e3140
  Leandro von Werra authored Oct 24, 2023
```
* add info on TRL docs

* add TRL link

* tweak text

* tweak text
```
  b18e3140
29 Aug, 2023 1 commit

Arde/fsdp activation checkpointing (#25771) · 738ecd17

Arup De authored Aug 29, 2023

* add FSDP config option to enable activation-checkpointing

* update docs

* add checks and remove redundant code

* fix formatting error

738ecd17

21 Aug, 2023 1 commit
- fix documentation for CustomTrainer (#25635) · 6f041fcb
  mchau authored Aug 21, 2023
```
fix doc
```
  6f041fcb
21 Jul, 2023 1 commit

fsdp fixes and enhancements (#24980) · f4eb459e

Sourab Mangrulkar authored Jul 21, 2023

* fix fsdp prepare to remove the warnings and fix excess memory usage

* Update training_args.py

* parity for FSDP+XLA

* Update trainer.py

f4eb459e

17 Jul, 2023 1 commit

deprecate `sharded_ddp` training argument (#24825) · 8ba26c18

statelesshz authored Jul 17, 2023



* deprecate fairscale's ShardedDDP

* fix code style

* roll back

* deprecate the `sharded_ddp` training argument

---------
Co-authored-by: jihuazhong <jihuazhong1@huawei.com>

8ba26c18

20 Jun, 2023 1 commit

Migrate doc files to Markdown. (#24376) · eb849f66

Sylvain Gugger authored Jun 20, 2023



* Rename index.mdx to index.md

* With saved modifs

* Address review comment

* Treat all files

* .mdx -> .md

* Remove special char

* Update utils/tests_fetcher.py
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

---------
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

eb849f66

13 Jun, 2023 2 commits

docs wrt using accelerate launcher with trainer (#24250) · e0603d89

Sourab Mangrulkar authored Jun 14, 2023



* update docs

* missing part

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* address comments

* address Zach's comment

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

e0603d89

deprecate `use_mps_device` (#24239) · 3723329d
Sourab Mangrulkar authored Jun 13, 2023

3723329d

26 May, 2023 1 commit
- Update trainer.mdx class_weights example (#23787) · d61d7476
  amitportnoy authored May 26, 2023
```
class_weights tensor should follow model's device
```
  d61d7476
01 Mar, 2023 1 commit
- update FSDP and add XLA-FSDP documentation (#21812) · 571dd693
  Sourab Mangrulkar authored Mar 01, 2023
```
* update FSDP and add XLA-FSDP documentation

* resolving comments

* minor update

* fix xla-fsdp docs
```
  571dd693
07 Nov, 2022 1 commit

docs: Resolve many typos in the English docs (#20088) · 3222fc64

Tom Aarsen authored Nov 07, 2022

* docs: Fix typo in ONNX parser help: 'tolerence' => 'tolerance'

* docs: Resolve many typos in the English docs

Typos found via 'codespell ./docs/source/en'

3222fc64

16 Aug, 2022 1 commit

mac m1 `mps` integration (#18598) · 9cf27468

Sourab Mangrulkar authored Aug 16, 2022



* mac m1 `mps` integration

* Update docs/source/en/main_classes/trainer.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* addressing comments

* Apply suggestions from code review
Co-authored-by: Dan Saattrup Nielsen <47701536+saattrupdan@users.noreply.github.com>

* resolve comment
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Dan Saattrup Nielsen <47701536+saattrupdan@users.noreply.github.com>

9cf27468

08 Aug, 2022 1 commit
- update fsdp docs (#18521) · 2fecde74
  Sourab Mangrulkar authored Aug 08, 2022
```
* updating fsdp documentation

* typo fix
```
  2fecde74
09 Jun, 2022 1 commit
- Mention in the doc we drop support for fairscale (#17610) · 29080643
  Sylvain Gugger authored Jun 09, 2022
  
  29080643
09 May, 2022 1 commit

PyTorch FSDP integration in Trainer (#17136) · 05fc1766

Sourab Mangrulkar authored May 09, 2022



* PyTorch FSDP integration in Trainer

* reformatting

make style and make quality are now compliant.

* Updating dependency check

* Trigger CI
Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com>

05fc1766

05 May, 2022 1 commit
- Fix link to example scripts (#17103) · cad61b68
  Steven Liu authored May 05, 2022
  
  cad61b68
04 Apr, 2022 1 commit

Enable doc in Spanish (#16518) · b9a768b3

Sylvain Gugger authored Apr 04, 2022

* Reorganize doc for multilingual support

* Fix style

* Style

* Toc trees

* Adapt templates

b9a768b3

25 Mar, 2022 1 commit
- Big file_utils cleanup (#16396) · 088c1880
  Sylvain Gugger authored Mar 25, 2022
```
* Big file_utils cleanup

* This one still needs to be treated separately
```
  088c1880
09 Feb, 2022 1 commit
- [trainer docs] document how to select specific gpus (#15551) · dee17d56
  Stas Bekman authored Feb 09, 2022
```
* [trainer docs] document how to select specific gpus

* expand

* add urls

* add accelerate launcher
```
  dee17d56
19 Jan, 2022 1 commit
- Update Trainer code example (#15070) · 80f72960
  NielsRogge authored Jan 19, 2022
```
* Update code example

* Fix code quality

* Add comment
```
  80f72960
28 Dec, 2021 1 commit

Doc styler examples (#14953) · b5e2b183

Sylvain Gugger authored Dec 27, 2021

* Fix bad examples

* Add black formatting to style_doc

* Use first nonempty line

* Put it at the right place

* Don't add spaces to empty lines

* Better templates

* Deal with triple quotes in docstrings

* Result of style_doc

* Enable mdx treatment and fix code examples in MDXs

* Result of doc styler on doc source files

* Last fixes

* Break copy from

b5e2b183

15 Dec, 2021 1 commit

PoC for conserving old links (#14754) · 459677ae

Sylvain Gugger authored Dec 15, 2021



* PoC for conserving old links

* Do the same for other links

* remap the redirects section

* add instructions on how to move sections

* improve
Co-authored-by: Stas Bekman <stas@stason.org>

459677ae

13 Dec, 2021 1 commit
- Convert Trainer doc page to MarkDown (#14753) · 7533d30a
  Sylvain Gugger authored Dec 13, 2021
```
* Convert Trainer doc page to MarkDown

* Fix repo consistency

* Fix the doc build test job
```
  7533d30a