Commits · 4975002df50c472cbb6f8ac3580e475f570606ab · chenpangpang / transformers

23 Mar, 2022 2 commits

Reorganize file utils (#16264) · 4975002d

Sylvain Gugger authored Mar 23, 2022

* Split file_utils in several submodules

* Fixes

* Add back more objects

* More fixes

* Who exactly decided to import that from there?

* Second suggestion to code with code review

* Revert wront move

* Fix imports

* Adapt all imports

* Adapt all imports everywhere

* Revert this import, will fix in a separate commit

4975002d

Updates the default branch from master to main (#16326) · eca77f47

Lysandre Debut authored Mar 23, 2022



* Updates the default branch from master to main

* Links from `master` to `main`

* Typo

* Update examples/flax/README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

eca77f47

21 Mar, 2022 1 commit
- [xtreme-s] Update Minds14 results (#16241) · e226a24f
  Anton Lozhkov authored Mar 21, 2022
```
* update results

* per-language metrics

* Format the per-language metrics
```
  e226a24f
17 Mar, 2022 1 commit
- remove jax.ops.index (#16220) · 93d3fd86
  Suraj Patil authored Mar 17, 2022
  
  93d3fd86
16 Mar, 2022 3 commits

Minor fixes to XTREME-S (#16193) · d35e0c62

Anton Lozhkov authored Mar 16, 2022



* Minor fixes

* Fix vocab union

* Update examples/research_projects/xtreme-s/README.md
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update README

* unused import
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d35e0c62

Replace all deprecated `jax.ops` operations with jnp's `at` (#16078) · ee27b3d7
Sanchit Gandhi authored Mar 16, 2022
```
* Replace all deprecated `jax.ops` operations with jnp's `at`

* np to jnp scores

* suggested changes
```
ee27b3d7
[Xtreme-S] fix some namings (#16183) · c2dc89be
Patrick von Platen authored Mar 16, 2022

c2dc89be

15 Mar, 2022 1 commit

Add the XTREME-S fine-tuning example (#15985) · 99fd3eb4

Anton Lozhkov authored Mar 16, 2022

* CTC+classification draft

* CTC+classification draft

* style

* multilingual runs

* Fix race condition during processor.from_reatrained

* Merge covost experiments

* Add README

* Quality

* Switch to .all configs

* Fix typos

99fd3eb4

12 Mar, 2022 1 commit

[Deepspeed] add support for bf16 mode (#14569) · 580dd87c

Stas Bekman authored Mar 11, 2022



* [WIP] add support for bf16 mode

* prep for bf16

* prep for bf16

* fix; zero2/bf16 is ok

* check bf16 is available

* test fixes

* enable zero3_bf16

* config files

* docs

* split stage_dtype; merge back to non-dtype-specific config file

* fix doc

* cleanup

* cleanup

* bfloat16 => bf16 to match the PR changes

* s/zero_gather_fp16_weights_on_model_save/zero_gather_16bit_weights_on_model_save/; s/save_fp16_model/save_16bit_model/

* test fixes/skipping

* move

* fix

* Update docs/source/main_classes/deepspeed.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* backticks

* cleanup

* cleanup

* cleanup

* new version

* add note about grad accum in bf16
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

580dd87c

10 Mar, 2022 2 commits
- Don't compute metrics in LM examples on TPU (#16029) · 19597998
  Sylvain Gugger authored Mar 10, 2022
  
  19597998
- Update README.md · 6c9010ef
  Sanchit Gandhi authored Mar 10, 2022
  
  6c9010ef
09 Mar, 2022 2 commits
- Fix broken code blocks in README.md (#15967) · 8feede22
  Shotaro Ishihara authored Mar 10, 2022
```
at transformers/examples/pytorch/contrastive-image-text
```
  8feede22
- Swag example: Update doc format (#16014) · e7f34ccd
  Joao Gante authored Mar 09, 2022
  
  e7f34ccd
08 Mar, 2022 2 commits
- Update TF multiple choice example (#15868) · 62d84760
  Joao Gante authored Mar 08, 2022
  
  62d84760
- Speedup training by using numpy instead of jnp for batch shuffling (#15963) · 91fb62d0
  Yeb Havinga authored Mar 08, 2022
```
Speedup training by using numpy instead of jnp for batch shuffling
Co-authored-by: Yeb Havinga <y.t.havinga@mgrid.net>
```
  91fb62d0
04 Mar, 2022 2 commits
- [FlaxT5 Example] fix flax t5 example pretraining (#15835) · 10b76987
  Patrick von Platen authored Mar 04, 2022
  
  10b76987
- Update README.md · b7147489
  Sanchit Gandhi authored Mar 04, 2022
  
  b7147489
03 Mar, 2022 2 commits
- Fix #15898 (#15928) · c0281feb
  davidleonfdez authored Mar 03, 2022
  
  c0281feb
- v4.18.0.dev.0 · 79d28e80
  Sylvain Gugger authored Mar 03, 2022
  
  79d28e80
02 Mar, 2022 2 commits
- Fix tiny typo (#15884) · e535c389
  Ross Johnstone authored Mar 02, 2022
  
  e535c389
- Update TF QA example (#15870) · 05c237ea
  Joao Gante authored Mar 02, 2022
  
  05c237ea
01 Mar, 2022 1 commit
- Update TF LM examples (#15855) · 3f2e6368
  Joao Gante authored Mar 01, 2022
  
  3f2e6368
25 Feb, 2022 1 commit
- [examples/summarization and translation] fix readme (#15833) · bf1fe328
  Suraj Patil authored Feb 25, 2022
  
  bf1fe328
23 Feb, 2022 1 commit

[Test refactor 1/5] Per-folder tests reorganization (#15725) · 29c10a41

Lysandre Debut authored Feb 23, 2022



* Per-folder tests reorganization
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
Co-authored-by: Stas Bekman <stas@stason.org>

29c10a41

22 Feb, 2022 1 commit
- Fix typo on examples/pytorch/question-answering (#15644) · 3db2e8f9
  Yongrae Jo authored Feb 23, 2022
```
cna -> can
```
  3db2e8f9
21 Feb, 2022 4 commits

TF text classification examples (#15704) · 3956b133
Joao Gante authored Feb 21, 2022
```
* Working example with to_tf_dataset

* updated text_classification

* more comments
```
3956b133

add VisionTextDualEncoder and CLIP fine-tuning script (#15701) · 86119c11

Suraj Patil authored Feb 21, 2022



* begin script

* update script

* fix features and data args

* main

* add requirements

* add column name args

* fix captions

* don't jit transforms

* fix caption

* fix labels, handle attention mask

* convert pixel values to numpy

* labels => input_ids

* transform images on the fly

* use AutoModel class, create the hybird model outside of the script

* fix version message

* add readme

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* adderss review comments

* add more comments

* allow freezing vision and text models
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

86119c11

Fix minor comment typos (#15740) · 5444687f
Ivan Agarský authored Feb 21, 2022

5444687f
Remove input and target reset after preprocessing (#15741) · a63bd367
Simon Sardorf authored Feb 21, 2022
```
Remove input and target reset after preprocessing
```
a63bd367

17 Feb, 2022 2 commits

Add SimMIM (#15586) · 57882177

NielsRogge authored Feb 17, 2022



* Add first draft

* Make model importable

* Make SwinForMaskedImageModeling importable

* Fix imports

* Add missing inits

* Add support for Swin

* Fix bug

* Fix bug

* Fix another bug

* Fix Swin MIM implementation

* Fix default encoder stride

* Fix Swin

* Add print statements for debugging

* Add image_size data argument

* Fix Swin

* Fix image_size

* Add print statements for debugging

* Fix print statement

* Remove print statements

* Improve reshaping of bool_masked_pos

* Add support for DeiT, fix tests

* Improve docstrings

* Apply new black version

* Improve script

* Fix bug

* Improve README

* Apply suggestions from code review

* Remove DS_Store and add to gitignore

* Apply suggestions from code review + fix BEiT Flax

* Revert BEiT changes

* Improve README

* Fix code quality

* Improve README
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain>
Co-authored-by: Niels Rogge <nielsr...

57882177

Add image classification notebook (#15667) · 0e91f885
NielsRogge authored Feb 17, 2022
```
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
```
0e91f885

15 Feb, 2022 1 commit
- updated with latest PL and Ray (#15653) · 80f1a591
  Shamane Siri authored Feb 16, 2022
  
  80f1a591
11 Feb, 2022 1 commit
- [research_projects] deal with security alerts (#15594) · fcb0f743
  Stas Bekman authored Feb 11, 2022
```
* [research_projects] deal with security alerts

* add a note of the original PL ver and warning
```
  fcb0f743
10 Feb, 2022 1 commit
- Add example batch size to all commands (#15596) · 3d5dea9b
  Patrick von Platen authored Feb 10, 2022
  
  3d5dea9b
09 Feb, 2022 1 commit
- Upgrade black to version ~=22.0 (#15565) · 7732d0fe
  Lysandre Debut authored Feb 09, 2022
```
* Upgrade black to version ~=22.0

* Check copies

* Fix code
```
  7732d0fe
07 Feb, 2022 1 commit

Add ASR CTC streaming example (#15309) · a459f7f9

Anton Lozhkov authored Feb 07, 2022



* Single-epoch run

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Infinite dataset

* Trainer fix + distributed benchmark

* Benchmark fix

* unused import

* interleaved splits

* interleaved splits

* has_length util

* Move to research projects

* Leftover Sized checks

* Bump min version

* Unused import

* Revert trainer changes
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

a459f7f9

03 Feb, 2022 1 commit
- [WIP] Add preprocess_logits_for_metrics Trainer param (#15473) · f1a4c4ea
  davidleonfdez authored Feb 03, 2022
```
* Add preprocess_logits_for_metrics Trainer param

* Compute accuracy in LM examples

* Improve comments
```
  f1a4c4ea
02 Feb, 2022 1 commit
- Fix labels stored in model config for token classification examples (#15482) · 45cac3fa
  Sylvain Gugger authored Feb 02, 2022
```
* Playing

* Properly set labels in model config for token classification example

* Port to run_ner_no_trainer

* Quality
```
  45cac3fa
01 Feb, 2022 2 commits
- Harder check for IndexErrors in QA scripts (#15438) · d0b5ed11
  Sylvain Gugger authored Feb 01, 2022
```
* Harder check for IndexErrors in QA scripts

* Make test stronger
```
  d0b5ed11
- Update README.md (#15462) · d2749cf7
  Kamal Raj authored Feb 01, 2022
```
fix typo
```
  d2749cf7