Commits · 8e13b7359388882d93af5fe312efe56b6556fa23 · chenpangpang / transformers

11 Feb, 2021 5 commits

Update README.md · 8e13b735
Patrick von Platen authored Feb 11, 2021

8e13b735
Update ADD_BIG_BIRD.md · d6b4f48e
Patrick von Platen authored Feb 11, 2021

d6b4f48e

[Wav2Vec2] Improve Tokenizer & Model for batched inference (#10117) · 495c157d

Patrick von Platen authored Feb 11, 2021

* save intermediate

* finish batch the same as fairseq

* add normalization

* fix batched input

* add better comment

* Update src/transformers/models/wav2vec2/modeling_wav2vec2.py

* add nice docstring

* add tokenizer tests

* make all slow tests pass

* finish PR

* correct import

495c157d

Add new community notebook - Blenderbot (#10126) · 2f3b5f4d

Tanmay Thakur authored Feb 11, 2021

* Update:community.md, new nb add

* feat: updated grammar on  nb description

* Update: Train summarizer for BlenderBotSmall

2f3b5f4d

Update run_xnli.py to use Datasets library (#9829) · 8dcfaea0

Qbiwan authored Feb 11, 2021

* remove xnli_compute_metrics, add load_dataset, load_metric, set_seed,metric.compute,load_metric

* fix

* fix

* fix

* push

* fix

* everything works

* fix init

* fix

* special treatment for sepconv1d

* style

* 🙏🏽

* add doc and cleanup


* fix doc

* fix doc again

* fix doc again

* Apply suggestions from code review

* make style

* Proposal that should work

* Remove needless code

* Fix test

* Apply suggestions from code review

* remove xnli_compute_metrics, add load_dataset, load_metric, set_seed,metric.compute,load_metric

* amend README

* removed data_args.task_name and replaced with task_name = "xnli"; use split function to load train and validation dataset separately; remove __post_init__; remove flag --task_name from README.

* removed dict task_to_keys, use str "xnli" instead of variable task_name, change preprocess_function to use examples["premise"], examples["hypothesis"] directly, remove sentence1_key and sentence2_key, change compute_metrics function to cater only to accuracy metric, add condition for train_langauge is None when using dataset.load_dataset()

* removed `torch.distributed.barrier()` and `import torch` as `from_pretrained` is able to do the work; amend README

8dcfaea0

10 Feb, 2021 9 commits

[DeepSpeed] restore memory for evaluation (#10114) · 77b86284
Stas Bekman authored Feb 10, 2021
```
* free up memory at the end of train

* rework tests

* consistent formatting

* correction
```
77b86284

remove adjust_logits_during_generation method (#10087) · c130e67d

Suraj Patil authored Feb 10, 2021

* add forced logits processors

* delete adjust_logits method

* add forced_eos_token_id argument in config

* add tests for forced logits processors

* update gen utils tests

* add forced option to tf generate

* remove adjust_logits method from tf models

* update adjust_logits for marian

* delete _force_token_id_to_be_generated method

* style

* import warnings

* pass max_length to _get_logits_processor

* set forced_eos_token_id to None

* set forced attributes in conf utils

* typo

* fix rag generate

* add forced_eos_token_id in rag config

* remove force_bos_token_to_be_generated from BartConfig

* remove _force_token_ids_generation from FSMT

* nit

* fix negative constant

* apply suggestions from code review

c130e67d

Fix TF LED/Longformer attentions computation (#10007) · 22a32cf4

Julien Plu authored Feb 10, 2021

* Fix test

* Remove commented test

* Fix name

* Apply style

* Fix check copies

* Remove prints

* Restore boolean

* Fix reshape

22a32cf4

Line endings should be LF across repo and not CRLF (#10119) · 0d8e554d
Lysandre Debut authored Feb 10, 2021

0d8e554d
add deepspeed fairscale (#10116) · 937f6707
Stas Bekman authored Feb 10, 2021

937f6707
[CI] build docs faster (#10115) · d478257d
Stas Bekman authored Feb 10, 2021
```
I assume the CI machine should have at least 4 cores, so let's build docs faster
```
d478257d

[DeepSpeed docs] new information (#9610) · 7c07a47d

Stas Bekman authored Feb 09, 2021

* how to specify a specific gpu

* new paper

* expand on buffer sizes

* style

* where to find config examples

* specific example

* small updates

7c07a47d

Fix tokenizers training in notebook (#10110) · 1fbaa3c1
Anthony MOI authored Feb 09, 2021

1fbaa3c1
Remove speed metrics from default compute objective (#10107) · 85395e49
Shiva Zamani authored Feb 09, 2021

85395e49

09 Feb, 2021 16 commits
- doc: update W&B related doc (#10086) · 7c7962ba
  Boris Dayma authored Feb 09, 2021
```
* doc: update W&B related doc

* doc(wandb): mention report_to

* doc(wandb): commit suggestion
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* doc(wandb): fix typo

* doc(wandb): remove WANDB_DISABLED
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  7c7962ba
- Fix TFConvBertModelIntegrationTest::test_inference_masked_lm Test (#10104) · 480a9d6b
  abhishek thakur authored Feb 09, 2021
  
  480a9d6b
- Add patch releases to the doc · 0c3d23df
  Sylvain Gugger authored Feb 09, 2021
  
  0c3d23df
- [RAG] fix generate (#10094) · 3e0c62b6
  Suraj Patil authored Feb 10, 2021
```
* fix rag generate and tests

* put back adjust_logits_during_generation

* tests are okay
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  3e0c62b6
- fix import (#10103) · 226973a9
  Patrick von Platen authored Feb 09, 2021
  
  226973a9
- Update ADD_BIG_BIRD.md · 4cda2d73
  Patrick von Platen authored Feb 09, 2021
  
  4cda2d73
- Replace strided slice with tf.expand_dims (#10078) · b82fe7d2
  Julien Plu authored Feb 09, 2021
```
* Replace tf.newaxis -> tf.expand_dims

* Fix tests

* Fix tests

* Use reshape when a tensors needs a double expand

* Fix GPT2

* Fix GPT2
```
  b82fe7d2
- Add head_mask and decoder_head_mask to TF LED (#9988) · e7381c45
  Daniel Stancl authored Feb 09, 2021
```
* Add head masking to TF LED

* Add head_mask to Longformer + one doc piece to LED

* Fix integration tests
```
  e7381c45
- Fix some edge cases in report_to and add deprecation warnings (#10100) · 77c0ce8c
  Sylvain Gugger authored Feb 09, 2021
  
  77c0ce8c
- Logging propagation (#10092) · 78f4a0e7
  Lysandre Debut authored Feb 09, 2021
```
* Enable propagation by default

* Document enable/disable default handler
```
  78f4a0e7
- [examples/s2s] add test set predictions (#10085) · 63fddcf6
  Suraj Patil authored Feb 09, 2021
```
* add do_predict, pass eval_beams durig eval

* update help

* apply suggestions from code review
```
  63fddcf6
- Fix naming (#10095) · c6d5e565
  Julien Plu authored Feb 09, 2021
  
  c6d5e565
- Fix example in Wav2Vec2 documentation (#10096) · 4ed76377
  abhishek thakur authored Feb 09, 2021
```
* Fix example in Wav2Vec2 documentation

* fix style
```
  4ed76377
- Docs for v4.3.1 release · bf1a06a4
  Lysandre authored Feb 09, 2021
  
  bf1a06a4
- Deprecate Wav2Vec2ForMaskedLM and add Wav2Vec2ForCTC (#10089) · b972125c
  Patrick von Platen authored Feb 09, 2021
```
* add wav2vec2CTC and deprecate for maskedlm

* remove from docs
```
  b972125c
- Fix deployment script · ba542ffb
  Lysandre authored Feb 09, 2021
  
  ba542ffb
08 Feb, 2021 10 commits

Integration test for electra model (#10073) · 263fac71
sandip authored Feb 09, 2021

263fac71
transition to new tests dir (#10080) · 781220ac
Stas Bekman authored Feb 08, 2021

781220ac
remove token_type_ids from TokenizerBertGeneration output (#10070) · 84acf0c7
demSd authored Feb 08, 2021

84acf0c7

Removing run_pl_glue.py from text classification docs, include run_xnli.py &... · e4bf9910

Juan Cruz-Benito authored Feb 08, 2021


Removing run_pl_glue.py from text classification docs, include run_xnli.py & run_tf_text_classification.py (#10066)

* Removing run_pl_glue.py from seq classification docs

* Adding run_tf_text_classification.py

* Using :prefix_link: to refer local files

* Applying "make style" to the branch

* Update docs/source/task_summary.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Removing last underscores
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

e4bf9910

Docs for v4.3.0 · 0dd579c9
Lysandre authored Feb 08, 2021

0dd579c9
[trainer] deepspeed bug fixes and tests (#10039) · 322037e8
Stas Bekman authored Feb 08, 2021
```
* deepspeed bug fixes and tests

* manual wrap?
```
322037e8
Update tokenizers requirement (#10077) · f285e4c3
Anthony MOI authored Feb 08, 2021

f285e4c3

Fix mlflow param overflow clean (#10071) · ddaafd78

noise-field authored Feb 08, 2021

* Unify logging with f-strings

* Get limits from MLflow rather than hardcode

* Add a check for parameter length overflow

Also constants are marked as internal

* Don't stop run in on_train_end

This causes bad behaviour when there is a seprarte validation step:
validation gets recorded as separate run.

* Fix style

ddaafd78

[s2s examples] Replace -100 token ids with the tokenizer pad_id for compute_metrics (#10046) · ece6c514
Olivier authored Feb 08, 2021
```
* replace -100 token ids with the tokenizer pad_id for compute_metrics

* fixed typo for label_ids
```
ece6c514
Model templates (#10072) · c9df1b1d
Lysandre Debut authored Feb 08, 2021

c9df1b1d