Commits · 4be8b95a9f0665a94589231cf4dd50d379ac1710 · chenpangpang / transformers

02 May, 2022 8 commits

Disable Flax GPU tests on push (#17042) · 4be8b95a
Sylvain Gugger authored May 02, 2022

4be8b95a
add torch.no_grad when in eval mode (#17020) · bdd690a7
yujun authored May 02, 2022
```
* add torch.no_grad when in eval mode

* make style quality
```
bdd690a7
Fix typo in RetriBERT docstring (#17018) · 9586e222
Martin Pömsl authored May 02, 2022

9586e222
[Flax(Speech)EncoderDecoder] Fix bug in `decoder_module` (#17036) · 93b802c4
Sanchit Gandhi authored May 02, 2022
```
* [FlaxSpeechEncoderDecoder] Fix bug in `decoder_module`

* [FlaxEncoderDecoder] Fix bug in `decoder_module`
```
93b802c4
Fix style · 1ae182d9
Sylvain Gugger authored May 02, 2022

1ae182d9

Michael Benayoun authored May 02, 2022

* Add meta proxy

* Uses meta data to trace data dependent control-flow

* Remove commented class

* Handles torch creating functions

* Added type annotation to fix tracing

* Tracing works for everything but T5 and GPT-J

* Almost all previously supported models pass

* All architectures can be traced except T5

* Intermediate commit to have a trace of the comparison operators for HFProxy

* Everything works, except loss computation

* Everything works

* Removed unused import

* Overriden methods do not use underlying ops (linear and torch.matmul), and model attributes are copied to the traced version

* Fix torch_matmul_override

* Change attributes reference to deepcopy

* Remove breakpoint and add torch_index_override

* Small fix

* Fix typo

* Replace asserts by explicit exceptions

2c2a2169

[FlaxGenerate] Fix bug in decoder_start_token_id (#17035) · ff846e9b
Sanchit Gandhi authored May 02, 2022

ff846e9b
update docs of length_penalty (#17022) · eb877f1f
Manan Dey authored May 02, 2022

eb877f1f

30 Apr, 2022 2 commits

Add translating guide (#17004) · da47c264
Omar U. Espejel authored Apr 30, 2022
```
* Add translating guide
```
da47c264

Add a check on config classes docstring checkpoints (#17012) · ede5e041

Yih-Dar authored Apr 30, 2022



* Add the check

* add missing ckpts

* add a list to ignore

* call the added check script

* better regex pattern
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

ede5e041

29 Apr, 2022 9 commits
- Result of new doc style with fixes (#17015) · 7152ed2b
  Sylvain Gugger authored Apr 29, 2022
```
* Result of new doc style with fixes

* Add last two files

* Bump hf-doc-builder
```
  7152ed2b
- Replace dict/BatchEncoding instance checks by Mapping (#17014) · 18df4407
  Sylvain Gugger authored Apr 29, 2022
```
* Replace dict/BatchEncoding instance checks by Mapping

* Typo
```
  18df4407
- Revert "Updating variable names. (#16445)" (#17011) · b8dffd1f
  Nicolas Patry authored Apr 29, 2022
```
This reverts commit 4f3a14e3.
```
  b8dffd1f
- Updating variable names. (#16445) · 4f3a14e3
  Nicolas Patry authored Apr 29, 2022
  
  4f3a14e3
- Update README_zh-hans.md (#16977) · 20fb5d51
  tarzan authored Apr 29, 2022
  
  20fb5d51
- Make create_extended_attention_mask_for_decoder static method (#16893) · 63fbed5c
  Pavel Belevich authored Apr 29, 2022
  
  63fbed5c
- TF: XLA bad words logits processor and list of processors (#16974) · fb0ae129
  Joao Gante authored Apr 29, 2022
  
  fb0ae129
- Update all require decorators to use skipUnless when possible (#16999) · 57e6464a
  Zachary Mueller authored Apr 29, 2022
  
  57e6464a
- use scale=1.0 in floats_tensor called in speech model testers (#17007) · e952e049
  Yih-Dar authored Apr 29, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  e952e049
28 Apr, 2022 6 commits
- Update README to latest release (#16997) · e6f00a11
  Sylvain Gugger authored Apr 28, 2022
  
  e6f00a11
- Fix savedir for by epoch (#16996) · 3486a92a
  Zachary Mueller authored Apr 28, 2022
  
  3486a92a
- set eos_token_id to None to generate until max length (#16989) · 5af5735f
  Yih-Dar authored Apr 28, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  5af5735f
- Rename a class to reflect framework pattern AutoModelXxx -> TFAutoModelXxx (#16993) · 01562dac
  amyeroberts authored Apr 28, 2022
  
  01562dac
- Add parameter --config_overrides for run_mlm_wwm.py (#16961) · 1be8d56e
  conan1024hao authored Apr 28, 2022
```
* dd parameter --config_overrides for run_mlm_wwm.py

* linter
```
  1be8d56e
- Update check_models_are_tested to deal with Windows path (#16973) · 1f9e8625
  Yih-Dar authored Apr 28, 2022
```
* fix

* Apply suggestions from code review
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  1f9e8625
27 Apr, 2022 14 commits

Update tokenization_bertweet.py (#16941) · dced2624

Dat Quoc Nguyen authored Apr 28, 2022

The emoji version must be either 0.5.4 or 0.6.0. Newer emoji versions have been updated to newer versions of the Emoji Charts, thus not consistent with the one used for pre-processing the pre-training Tweet corpus (i.e. not consistent with the vocab).

dced2624

Add -e flag to some GH workflow yml files (#16959) · 992996e9

Yih-Dar authored Apr 27, 2022



* Add -e flag

* add check

* create new keys

* run python setup.py build install

* add comments

* change to develop
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

992996e9

Fix check_all_models_are_tested (#16970) · 596afb42
Yih-Dar authored Apr 27, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
596afb42
Fix doc notebooks links (#16969) · 691cdbb7
Sylvain Gugger authored Apr 27, 2022
```
* Fix doc notebooks links

* Remove missing section
```
691cdbb7
Fixup no_trainer save logic (#16968) · 60e1d883
Zachary Mueller authored Apr 27, 2022
```
* Fixup all examples
```
60e1d883
Fix multiple deletions of the same files in save_pretrained (#16947) · c79bbc3b
Sylvain Gugger authored Apr 27, 2022
```
* Fix multiple deletions of the same files in save_pretrained

* Add is_main_process argument
```
c79bbc3b
Fix add-new-model-like when model doesn't support all frameworks (#16966) · bfbec177
Sylvain Gugger authored Apr 27, 2022

bfbec177
Update custom_models.mdx (#16964) · cf8a7c24
Mishig Davaadorj authored Apr 27, 2022
```
BertModelForSequenceClassification -> BertForSequenceClassification
```
cf8a7c24
Fix `distributed_concat` with scalar tensor (#16963) · 5896b3ec
Antoni Baum authored Apr 27, 2022
```
* Fix `distributed_concat` with scalar tensor

* Update trainer_pt_utils.py
```
5896b3ec

[HF Argparser] Fix parsing of optional boolean arguments (#16946) · 084c38c5

NielsRogge authored Apr 27, 2022



* Add fix

* Apply suggestion from code review
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

084c38c5

Misc. fixes for Pytorch QA examples: (#16958) · c82e017a

Leonid Boytsov authored Apr 27, 2022

1. Fixes evaluation errors popping up when you train/eval on squad v2 (one was newly encountered and one that was previously reported Running SQuAD 1.0 sample command raises IndexError #15401 but not completely fixed).
2. Removes boolean arguments that don't use store_true. Please, don't use these: *ANY non-empty string is being converted to True in this case and this clearly is not the desired behavior (and it creates a LOT of confusion).
3. All no-trainer test scripts are now saving metric values in the same way (with the right prefix eval_), which is consistent with the trainer-based versions.
4. Adds forgotten model.eval() in the no-trainer versions. This improved some results, but not everything (see the discussion in the end). Please, see the F1 scores and the discussion below.

c82e017a

Fix HubertRobustTest PT/TF equivalence test on GPU (#16943) · 49d5bcb0
Yih-Dar authored Apr 27, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
49d5bcb0

Add semantic script, trainer (#16834) · 479fdc49

NielsRogge authored Apr 27, 2022

* Add first draft

* Improve script and README

* Improve README

* Apply suggestions from code review

* Improve script, add link to resulting model

* Add corresponding test

* Adjust learning rate

479fdc49

[Research] Speed up evaluation for XTREME-S (#16785) · a4a88fa0
Anton Lozhkov authored Apr 27, 2022
```
* Avoid repeated per-lang filtering

* Language groups and logits preprocessing

* Style
```
a4a88fa0

26 Apr, 2022 1 commit
- use original loaded keys to find mismatched keys (#16920) · 2d91e3c3
  Yongliang Shen authored Apr 27, 2022
  
  2d91e3c3