Commits · 5f25855b3efb3c92f26428b5b28a4cd668d2feff · chenpangpang / transformers

30 Sep, 2021 3 commits
- Update doc for v4.11.2 · 5f25855b
  Sylvain Gugger authored Sep 30, 2021
  
  5f25855b
- Fix gather for TPU (#13813) · 269c3d14
  Sylvain Gugger authored Sep 30, 2021
  
  269c3d14
- [examples/flax] use Repository API for push_to_hub (#13672) · 7db2a79b
  Suraj Patil authored Sep 30, 2021
```
* use Repository for push_to_hub

* update readme

* update other flax scripts

* update readme

* update qa example

* fix push_to_hub call

* fix typo

* fix more typos

* update readme

* use abosolute path to get repo name

* fix glue script
```
  7db2a79b
29 Sep, 2021 10 commits

[examples `run_glue.py`] missing requirements `scipy`, `sklearn` (#13768) · b90096fe
Stas Bekman authored Sep 29, 2021
```
* missing requirement

* list both
```
b90096fe
[docs/gpt-j] addd instructions for how minimize CPU RAM usage (#13795) · bf6118e7
Suraj Patil authored Sep 29, 2021
```
* add a note about tokenizer

* add  tips to load model is less RAM

* fix link

* fix more links
```
bf6118e7
Merge remote-tracking branch 'origin/master' · 55695df0
Sylvain Gugger authored Sep 29, 2021

55695df0
Update doc for v4.11.1 · cf4aa359
Sylvain Gugger authored Sep 29, 2021

cf4aa359
Add TF notebooks (#13793) · 2a51b155
Matt authored Sep 29, 2021

2a51b155
Fix length of IterableDatasetShard and add test (#13792) · 63cc5bda
Sylvain Gugger authored Sep 29, 2021
```
* Fix length of IterableDatasetShard and add test

* Add comments
```
63cc5bda
Enable readme link synchronization (#13785) · 7d84c3a4
Li-Huai (Allan) Lin authored Sep 29, 2021
```
* Enable readme link synchronization

* Style

* Reuse regex pattern

* Apply suggestions

* Update
```
7d84c3a4
Fix LayoutLM ONNX test error (#13710) · a1ea3adb
Nishant Prabhu authored Sep 29, 2021
```
Fix LayoutLM ONNX test error
```
a1ea3adb

Keras callback to push to hub each epoch, or after N steps (#13773) · 3a8a8013

Matt authored Sep 29, 2021



* Keras callback to push to hub each epoch, or after N steps

* Reworked the callback to use Repository

* Use an Enum for save_strategy

* Style pass

* Correct type for tokenizer

* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Adding print message to the final upload

* Adding print message to the final upload

* Change how we wait for the last process to finish

* is_done is a property, not a method, derp

* Docstrings and documentation

* Style pass

* Style edit

* Docstring reformat

* Docstring rewrite

* Replacing print with internal logger
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

3a8a8013

up (#13777) · aa018a79
Patrick von Platen authored Sep 29, 2021

aa018a79

28 Sep, 2021 2 commits
- Implement len in IterableDatasetShard (#13780) · a21ee1f9
  Sylvain Gugger authored Sep 28, 2021
  
  a21ee1f9
- Fix warning for gradient_checkpointing (#13767) · 83d3dc0f
  Sylvain Gugger authored Sep 28, 2021
  
  83d3dc0f
27 Sep, 2021 9 commits
- Fix filtering in test fetcher utils (#13766) · 5e3b4a70
  Sylvain Gugger authored Sep 27, 2021
  
  5e3b4a70
- Docs for version v4.11.0 · 11c69b80
  Lysandre authored Sep 27, 2021
  
  11c69b80
- Release: v4.11.0 · dc193c90
  Lysandre authored Sep 27, 2021
  
  dc193c90
- Fix gather for SageMaker model parallel · 1c965000
  Sylvain Gugger authored Sep 27, 2021
  
  1c965000
- Fix in gather for SM distributed · 4e0410e9
  Sylvain Gugger authored Sep 27, 2021
  
  4e0410e9
- Modified TF train_step (#13678) · 367c2ef5
  Matt authored Sep 27, 2021
```
Allows models to be compiled without a loss, and to use the internal loss computations for training with fit()
```
  367c2ef5
- Silence warning in gradient checkpointing when it's False (#13734) · e00bc7cd
  Sylvain Gugger authored Sep 27, 2021
  
  e00bc7cd
- Fix loss computation in Trainer (#13760) · 3ffd18a6
  Sylvain Gugger authored Sep 27, 2021
```
Co-authored-by: quantitative-technologies <james.hirschorn@quantitative-technologies.com>
Co-authored-by: quantitative-technologies <james.hirschorn@quantitative-technologies.com>
```
  3ffd18a6
- Fix type annotations for `distributed_concat()` (#13746) · 3ccc2701
  Xiaohan Zou authored Sep 27, 2021
```
* Fix type annotations for `distributed_concat()`

* Use Any
```
  3ccc2701
26 Sep, 2021 4 commits
- [Tests] Cast Hubert test models to fp16 (#13755) · e0d31a89
  Anton Lozhkov authored Sep 26, 2021
  
  e0d31a89
- [megatron gpt checkpoint conversion] causal mask requires pos_embed dimension (#13735) · 400c5a15
  Stas Bekman authored Sep 26, 2021
  
  400c5a15
- [Trainer] Make sure shown loss in distributed training is correctly averaged... · 91df4551
  Patrick von Platen authored Sep 26, 2021
```
[Trainer] Make sure shown loss in distributed training is correctly averaged over all workers (#13681)

* push

* improve tr loss gather
```
  91df4551
- Update requirements for speech example (#13745) · 044eff5b
  Sylvain Gugger authored Sep 26, 2021
  
  044eff5b
25 Sep, 2021 2 commits
- finish (#13743) · 067413fb
  Patrick von Platen authored Sep 25, 2021
  
  067413fb
- Update test dependence for torch examples (#13738) · a8ec0029
  Sylvain Gugger authored Sep 25, 2021
  
  a8ec0029
24 Sep, 2021 10 commits

Update README.md · 469b80d4
Patrick von Platen authored Sep 24, 2021

469b80d4
up (#13733) · 493643ff
Patrick von Platen authored Sep 24, 2021

493643ff
Add model card creation snippet to example scripts (#13730) · 38580455
Gunjan Chhablani authored Sep 24, 2021
```
* Update run_glue.py

* Update run_glue.py

* Add model creation snippet to other scripts

* Fix style
```
38580455
Warn for unexpected argument combinations (#13509) · 66b01ce8
Yuta Hayashibe authored Sep 24, 2021
```
* Warn for unexpected argument combinations

* Updated the waning message for pad_to_max_length
```
66b01ce8
up (#13729) · e579f855
Patrick von Platen authored Sep 24, 2021

e579f855
Fixing zero-shot backward compatiblity (#13725) · 0eabe492
Nicolas Patry authored Sep 24, 2021
```
Fixes #13697
```
0eabe492

Use torch.unique_consecutive to check same element (#13637) · a2ef9c54

Tommy Chiang authored Sep 24, 2021

We use `torch.unique` here only to check whether every elements have
the same value.
Therefore, we can use `torch.unique_consecutive` here.

This function eliminates all but the first element from every consecutive
group of equivalent elements.
Like, if we apply this function to `[1, 2, 2, 1]`, it will result in
`[1, 2, 1]`.

As you could see, this is enough for checking whether every elements
have the same value.

Since `torch.unique_consecutive` do less thing, it is much more faster.
On my computer, it is 25x faster on GPU and 15x faster on CPU.

a2ef9c54

Update README.md · 95f888fd
Patrick von Platen authored Sep 24, 2021

95f888fd

Make assertions only if actually chunking forward (#13598) · 678bb248

Josh Devins authored Sep 24, 2021

This moves the assertion on checking input dimensions into a block that will only be called if the function is actually going to do chunking forward. This is often not the case at inference time and PyTorch tracing a model with this assertion in it leads to a tracing warning.

TracerWarning: Converting a tensor to a Python boolean might cause the trace to be incorrect. We can't record the data flow of Python values, so this value will be treated as a constant in the future. This means that the trace might not generalize to other inputs!
input_tensor.shape[chunk_dim] == tensor_shape for input_tensor in input_tensors

678bb248

[ASR] Add official ASR CTC example to `examples/pytorch/speech-recognition` (#13620) · 4a320f6c

Patrick von Platen authored Sep 24, 2021



* up

* rename

* add asr example

* add auto feature extractor

* some more fixes

* correct layerdrop

* correct for multi-gpu dist

* clean up

* refactor

* refactor

* more fixes

* more fixes

* clean-up

* finish

* up

* Apply suggestions from code review

* fix isort

* update

* up

* add note

* apply surajs suggestions

* Apply suggestions from code review
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* isort

* small change

* Apply suggestions from code review
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* Apply suggestions from code review
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* add hubert

* Update examples/pytorch/speech-recognition/run_speech_recognition_ctc.py
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

4a320f6c