Commits · d4e4efce68a5d18ca3175475b59051dec336fa6b · chenpangpang / transformers

"docs/vscode:/vscode.git/clone" did not exist on "2dce350b3374ac76432fd6659e81b28083b705fa"

05 Oct, 2021 6 commits
- Initial support for symbolic tracing with torch.fx allowing dynamic axes (#13579) · d4e4efce
  Michael Benayoun authored Oct 05, 2021
```
* Symbolic trace dynamic axes support for BERT like models (albert, bert, distilbert, mobilebert, electra, megatron-bert)
* Sanity checks before tracing that make sure the model to trace is supported
* Adapted to PyTorch 1.9
Co-authored-by: Michael Benayoun <michael@huggingface.co>
```
  d4e4efce
- Improve error message when loading models from Hub (#13836) · 46efc580
  Alex Hedges authored Oct 05, 2021
```
* Improve error message when loading models from Hub

* Adjust error message wording
```
  46efc580
- Fixing empty prompts for text-generation when BOS exists. (#13859) · 3a9c0f23
  Nicolas Patry authored Oct 05, 2021
```
* Fixing empty prompts for text-generation when BOS exists.

* Fixing odd case with Pegasus.

* Fixing Bert is Assertion Error.
```
  3a9c0f23
- Fix: save checkpoint after each epoch and push checkpoint to the hub (#13872) · a6ea244f
  Yih-Dar authored Oct 05, 2021
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  a6ea244f
- Fixing 1-length special tokens cut. (#13862) · 7079a99e
  Nicolas Patry authored Oct 05, 2021
  
  7079a99e
- Update Tatoeba conversion (#13757) · 7051b892
  Sam Hardwick authored Oct 05, 2021
```
* Update Tatoeba conversion
```
  7051b892
04 Oct, 2021 6 commits

Update no_* argument (HfArgumentParser) (#13865) · 12b4d66a

Bram Vanroy authored Oct 04, 2021

* update no_* argument

Changes the order so that the no_* argument is created after the original argument AND sets the default for this no_* argument to False

* import copy

* update test

* make style

* Use kwargs to set default=False

* make style

12b4d66a

✨ update image classification example (#13824) · cc0a415e
Nathan Raw authored Oct 04, 2021
```
* ✨ update image classification example

* 📌 update reqs
```
cc0a415e
Fix broken link to distill models in docs (#13848) · 6c088406
Evgeniy Zheltonozhskiy authored Oct 04, 2021
```
* Fix broken link to distill models

* Missing symbol

* Fix spaces
```
6c088406

Add Mistral GPT-2 Stability Tweaks (#13573) · 3a8de58c

Sidd Karamcheti authored Oct 04, 2021



* Add layer-wise scaling

* Add reorder & upcasting argument

* Add OpenAI GPT-2 weight initialization scheme

* start `layer_idx` count at zero for consistency

* disentangle attn and reordered and upscaled attn function

* rename `scale_attn_by_layer` to `scale_attn_by_layer_id`

* make autocast from amp compatible with pytorch<1.6

* fix docstring

* style fixes

* Add fixes from PR feedback, style tweaks

* Fix doc whitespace

* Reformat

* First pass scale_attn_by_layer_idx and reorder_and_upcast_attn tests

* Rename scale_attn_by_layer_idx, add tip

* Remove extra newline

* add test for weight initialization

* update code format

* add assert check weights are fp32

* remove assert

* Fix incorrect merge

* Fix shape mismatch in baddbmm

* Add generation test for Mistral flags
Co-authored-by: leandro <leandro.vonwerra@spoud.io>
Co-authored-by: Keshav Santhanam <keshav2@stanford.edu>
Co-authored-by: J38 <jebolton@stanford.edu>

3a8de58c

[docs/gpt-j] fix typo (#13851) · 955fd4fe
Yaser Abdelaziz authored Oct 04, 2021

955fd4fe
Delete convert_multiberts_checkpoint_to_pytorch.py (#13852) · de948350
Gunjan Chhablani authored Oct 04, 2021

de948350

01 Oct, 2021 5 commits

include megatron_gpt2 in installed modules (#13834) · bcc3f7b6
Stas Bekman authored Oct 01, 2021

bcc3f7b6

Bart: check if decoder_inputs_embeds is set (#13800) · 707f7eb1

Silviu Oprea authored Oct 01, 2021



In BartForConditionalGeneration.forward, if labels are provided,
   decoder_input_ids are set to the labels shifted to the right.
   This is problematic: if decoder_inputs_embeds is also set,
   the call to self.model, which eventually gets to BartDecoder.forward,
   will raise an error.
   The fix is quite simple, similar to what is there already in
   BartModel.forward. Mainly, we should not
   compute decoder_input_ids if decoder_inputs_embeds is provided.
Co-authored-by: Silviu Vlad Oprea <silviuvo@amazon.co.uk>

707f7eb1

[Examples] Add an official audio classification example (#13722) · 42137280

Anton Lozhkov authored Oct 01, 2021



* Restore broken merge

* Additional args, DDP, remove CommonLanguage

* Update examples for V100, add training results

* Style

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Remove custom datasets for simplicity, apply suggestions from code review

* Add the attention_mask flag, reorganize README
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

42137280

Update CITATION.cff (#13833) · c4113721
Arfon Smith authored Oct 01, 2021

c4113721

Fix warning situation: UserWarning: max_length is ignored when padding=True" (#13829) · 90f980ed

Yuta Hayashibe authored Oct 01, 2021



* Removed wrong warning

* Raise a warning when `max_length` is given with wrong `truncation`

* Update the error message

* Update the warning message
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

90f980ed

30 Sep, 2021 8 commits
- skip gptj slow generate tests for now (#13809) · 8bbb53e2
  Suraj Patil authored Oct 01, 2021
  
  8bbb53e2
- [DPR] Correct init (#13796) · 41436d3d
  Patrick von Platen authored Sep 30, 2021
```
* update

* add to docs and init

* make fix-copies
```
  41436d3d
- map only on one process (#13810) · 44eb8bde
  Patrick von Platen authored Sep 30, 2021
  
  44eb8bde
- Add MultiBERTs conversion script (#13077) · 9a9805fc
  Gunjan Chhablani authored Sep 30, 2021
```
* Init multibert checkpoint conversion script

* Rename conversion script

* Fix MultiBerts Conversion Script

* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
```
  9a9805fc
- [testing] auto-replay captured streams (#13803) · e1d1c7c0
  Stas Bekman authored Sep 30, 2021
  
  e1d1c7c0
- Update doc for v4.11.2 · 5f25855b
  Sylvain Gugger authored Sep 30, 2021
  
  5f25855b
- Fix gather for TPU (#13813) · 269c3d14
  Sylvain Gugger authored Sep 30, 2021
  
  269c3d14
- [examples/flax] use Repository API for push_to_hub (#13672) · 7db2a79b
  Suraj Patil authored Sep 30, 2021
```
* use Repository for push_to_hub

* update readme

* update other flax scripts

* update readme

* update qa example

* fix push_to_hub call

* fix typo

* fix more typos

* update readme

* use abosolute path to get repo name

* fix glue script
```
  7db2a79b
29 Sep, 2021 10 commits

[examples `run_glue.py`] missing requirements `scipy`, `sklearn` (#13768) · b90096fe
Stas Bekman authored Sep 29, 2021
```
* missing requirement

* list both
```
b90096fe
[docs/gpt-j] addd instructions for how minimize CPU RAM usage (#13795) · bf6118e7
Suraj Patil authored Sep 29, 2021
```
* add a note about tokenizer

* add  tips to load model is less RAM

* fix link

* fix more links
```
bf6118e7
Merge remote-tracking branch 'origin/master' · 55695df0
Sylvain Gugger authored Sep 29, 2021

55695df0
Update doc for v4.11.1 · cf4aa359
Sylvain Gugger authored Sep 29, 2021

cf4aa359
Add TF notebooks (#13793) · 2a51b155
Matt authored Sep 29, 2021

2a51b155
Fix length of IterableDatasetShard and add test (#13792) · 63cc5bda
Sylvain Gugger authored Sep 29, 2021
```
* Fix length of IterableDatasetShard and add test

* Add comments
```
63cc5bda
Enable readme link synchronization (#13785) · 7d84c3a4
Li-Huai (Allan) Lin authored Sep 29, 2021
```
* Enable readme link synchronization

* Style

* Reuse regex pattern

* Apply suggestions

* Update
```
7d84c3a4
Fix LayoutLM ONNX test error (#13710) · a1ea3adb
Nishant Prabhu authored Sep 29, 2021
```
Fix LayoutLM ONNX test error
```
a1ea3adb

Keras callback to push to hub each epoch, or after N steps (#13773) · 3a8a8013

Matt authored Sep 29, 2021



* Keras callback to push to hub each epoch, or after N steps

* Reworked the callback to use Repository

* Use an Enum for save_strategy

* Style pass

* Correct type for tokenizer

* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Adding print message to the final upload

* Adding print message to the final upload

* Change how we wait for the last process to finish

* is_done is a property, not a method, derp

* Docstrings and documentation

* Style pass

* Style edit

* Docstring reformat

* Docstring rewrite

* Replacing print with internal logger
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

3a8a8013

up (#13777) · aa018a79
Patrick von Platen authored Sep 29, 2021

aa018a79

28 Sep, 2021 2 commits
- Implement len in IterableDatasetShard (#13780) · a21ee1f9
  Sylvain Gugger authored Sep 28, 2021
  
  a21ee1f9
- Fix warning for gradient_checkpointing (#13767) · 83d3dc0f
  Sylvain Gugger authored Sep 28, 2021
  
  83d3dc0f
27 Sep, 2021 3 commits
- Fix filtering in test fetcher utils (#13766) · 5e3b4a70
  Sylvain Gugger authored Sep 27, 2021
  
  5e3b4a70
- Docs for version v4.11.0 · 11c69b80
  Lysandre authored Sep 27, 2021
  
  11c69b80
- Release: v4.11.0 · dc193c90
  Lysandre authored Sep 27, 2021
  
  dc193c90