Commits · 4213728067b36457124b069e66f7acb6b7079421 · chenpangpang / transformers

01 Oct, 2021 3 commits

[Examples] Add an official audio classification example (#13722) · 42137280

Anton Lozhkov authored Oct 01, 2021



* Restore broken merge

* Additional args, DDP, remove CommonLanguage

* Update examples for V100, add training results

* Style

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Remove custom datasets for simplicity, apply suggestions from code review

* Add the attention_mask flag, reorganize README
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

42137280

Update CITATION.cff (#13833) · c4113721
Arfon Smith authored Oct 01, 2021

c4113721

Fix warning situation: UserWarning: max_length is ignored when padding=True" (#13829) · 90f980ed

Yuta Hayashibe authored Oct 01, 2021



* Removed wrong warning

* Raise a warning when `max_length` is given with wrong `truncation`

* Update the error message

* Update the warning message
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

90f980ed

30 Sep, 2021 8 commits
- skip gptj slow generate tests for now (#13809) · 8bbb53e2
  Suraj Patil authored Oct 01, 2021
  
  8bbb53e2
- [DPR] Correct init (#13796) · 41436d3d
  Patrick von Platen authored Sep 30, 2021
```
* update

* add to docs and init

* make fix-copies
```
  41436d3d
- map only on one process (#13810) · 44eb8bde
  Patrick von Platen authored Sep 30, 2021
  
  44eb8bde
- Add MultiBERTs conversion script (#13077) · 9a9805fc
  Gunjan Chhablani authored Sep 30, 2021
```
* Init multibert checkpoint conversion script

* Rename conversion script

* Fix MultiBerts Conversion Script

* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
```
  9a9805fc
- [testing] auto-replay captured streams (#13803) · e1d1c7c0
  Stas Bekman authored Sep 30, 2021
  
  e1d1c7c0
- Update doc for v4.11.2 · 5f25855b
  Sylvain Gugger authored Sep 30, 2021
  
  5f25855b
- Fix gather for TPU (#13813) · 269c3d14
  Sylvain Gugger authored Sep 30, 2021
  
  269c3d14
- [examples/flax] use Repository API for push_to_hub (#13672) · 7db2a79b
  Suraj Patil authored Sep 30, 2021
```
* use Repository for push_to_hub

* update readme

* update other flax scripts

* update readme

* update qa example

* fix push_to_hub call

* fix typo

* fix more typos

* update readme

* use abosolute path to get repo name

* fix glue script
```
  7db2a79b
29 Sep, 2021 10 commits

[examples `run_glue.py`] missing requirements `scipy`, `sklearn` (#13768) · b90096fe
Stas Bekman authored Sep 29, 2021
```
* missing requirement

* list both
```
b90096fe
[docs/gpt-j] addd instructions for how minimize CPU RAM usage (#13795) · bf6118e7
Suraj Patil authored Sep 29, 2021
```
* add a note about tokenizer

* add  tips to load model is less RAM

* fix link

* fix more links
```
bf6118e7
Merge remote-tracking branch 'origin/master' · 55695df0
Sylvain Gugger authored Sep 29, 2021

55695df0
Update doc for v4.11.1 · cf4aa359
Sylvain Gugger authored Sep 29, 2021

cf4aa359
Add TF notebooks (#13793) · 2a51b155
Matt authored Sep 29, 2021

2a51b155
Fix length of IterableDatasetShard and add test (#13792) · 63cc5bda
Sylvain Gugger authored Sep 29, 2021
```
* Fix length of IterableDatasetShard and add test

* Add comments
```
63cc5bda
Enable readme link synchronization (#13785) · 7d84c3a4
Li-Huai (Allan) Lin authored Sep 29, 2021
```
* Enable readme link synchronization

* Style

* Reuse regex pattern

* Apply suggestions

* Update
```
7d84c3a4
Fix LayoutLM ONNX test error (#13710) · a1ea3adb
Nishant Prabhu authored Sep 29, 2021
```
Fix LayoutLM ONNX test error
```
a1ea3adb

Keras callback to push to hub each epoch, or after N steps (#13773) · 3a8a8013

Matt authored Sep 29, 2021



* Keras callback to push to hub each epoch, or after N steps

* Reworked the callback to use Repository

* Use an Enum for save_strategy

* Style pass

* Correct type for tokenizer

* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Adding print message to the final upload

* Adding print message to the final upload

* Change how we wait for the last process to finish

* is_done is a property, not a method, derp

* Docstrings and documentation

* Style pass

* Style edit

* Docstring reformat

* Docstring rewrite

* Replacing print with internal logger
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

3a8a8013

up (#13777) · aa018a79
Patrick von Platen authored Sep 29, 2021

aa018a79

28 Sep, 2021 2 commits
- Implement len in IterableDatasetShard (#13780) · a21ee1f9
  Sylvain Gugger authored Sep 28, 2021
  
  a21ee1f9
- Fix warning for gradient_checkpointing (#13767) · 83d3dc0f
  Sylvain Gugger authored Sep 28, 2021
  
  83d3dc0f
27 Sep, 2021 9 commits
- Fix filtering in test fetcher utils (#13766) · 5e3b4a70
  Sylvain Gugger authored Sep 27, 2021
  
  5e3b4a70
- Docs for version v4.11.0 · 11c69b80
  Lysandre authored Sep 27, 2021
  
  11c69b80
- Release: v4.11.0 · dc193c90
  Lysandre authored Sep 27, 2021
  
  dc193c90
- Fix gather for SageMaker model parallel · 1c965000
  Sylvain Gugger authored Sep 27, 2021
  
  1c965000
- Fix in gather for SM distributed · 4e0410e9
  Sylvain Gugger authored Sep 27, 2021
  
  4e0410e9
- Modified TF train_step (#13678) · 367c2ef5
  Matt authored Sep 27, 2021
```
Allows models to be compiled without a loss, and to use the internal loss computations for training with fit()
```
  367c2ef5
- Silence warning in gradient checkpointing when it's False (#13734) · e00bc7cd
  Sylvain Gugger authored Sep 27, 2021
  
  e00bc7cd
- Fix loss computation in Trainer (#13760) · 3ffd18a6
  Sylvain Gugger authored Sep 27, 2021
```
Co-authored-by: quantitative-technologies <james.hirschorn@quantitative-technologies.com>
Co-authored-by: quantitative-technologies <james.hirschorn@quantitative-technologies.com>
```
  3ffd18a6
- Fix type annotations for `distributed_concat()` (#13746) · 3ccc2701
  Xiaohan Zou authored Sep 27, 2021
```
* Fix type annotations for `distributed_concat()`

* Use Any
```
  3ccc2701
26 Sep, 2021 4 commits
- [Tests] Cast Hubert test models to fp16 (#13755) · e0d31a89
  Anton Lozhkov authored Sep 26, 2021
  
  e0d31a89
- [megatron gpt checkpoint conversion] causal mask requires pos_embed dimension (#13735) · 400c5a15
  Stas Bekman authored Sep 26, 2021
  
  400c5a15
- [Trainer] Make sure shown loss in distributed training is correctly averaged... · 91df4551
  Patrick von Platen authored Sep 26, 2021
```
[Trainer] Make sure shown loss in distributed training is correctly averaged over all workers (#13681)

* push

* improve tr loss gather
```
  91df4551
- Update requirements for speech example (#13745) · 044eff5b
  Sylvain Gugger authored Sep 26, 2021
  
  044eff5b
25 Sep, 2021 2 commits
- finish (#13743) · 067413fb
  Patrick von Platen authored Sep 25, 2021
  
  067413fb
- Update test dependence for torch examples (#13738) · a8ec0029
  Sylvain Gugger authored Sep 25, 2021
  
  a8ec0029
24 Sep, 2021 2 commits
- Update README.md · 469b80d4
  Patrick von Platen authored Sep 24, 2021
  
  469b80d4
- up (#13733) · 493643ff
  Patrick von Platen authored Sep 24, 2021
  
  493643ff