Commits · b013842244df7be96b8cc841491bd1e35e475e36 · chenpangpang / transformers

02 Mar, 2021 1 commit

Changed `num_beams` to `num_beams // num_beam_groups` when initialising... · b0138422

Martin Schmitt authored Mar 02, 2021

Changed `num_beams` to `num_beams // num_beam_groups` when initialising `PrefixConstrainedLogitsProcessor` in `_get_logits_processor` to fix compatibility issue when constrained decoding is used together with grouped beam search (#10475)

b0138422

01 Mar, 2021 5 commits

Add I-BERT to README (#10462) · 0c232519
Lysandre Debut authored Mar 01, 2021

0c232519
Remove Anthony from the bug reports in Transformers · 9248e270
Lysandre Debut authored Mar 01, 2021

9248e270
[Wav2Vec2FeatureExtractor] smal fixes (#10455) · a106bde5
Suraj Patil authored Mar 01, 2021
```
* smal fixes

* don't check for None
```
a106bde5
remove feature extraction config (#10457) · 11655faf
Patrick von Platen authored Mar 01, 2021

11655faf

Add Fine-Tuning for Wav2Vec2 (#10145) · 0234de84

Patrick von Platen authored Mar 01, 2021



* add encode labels function to tokenizer

* start adding finetuning

* init dropout

* upload

* correct convert script

* apply changes

* fix second typo

* make first dummy training run

* adapt convert script

* push confg for comparison

* remove conf

* finish training

* adapt data collator

* add research folder

* update according to fairseq feedback

* some minor corrections

* refactor masking indices a bit

* some minor changes

* clean tokenizer

* finish clean-up

* remove previous logic

* update run script

* correct training

* finish changes

* finish model

* correct bug

* fix training a bit more

* add some tests

* finish gradient checkpointing

* finish example

* correct gradient checkpointing

* improve tokenization method

* revert changes in tokenizer

* revert general change

* adapt fine-tuning

* update

* save intermediate test

* Update README.md

* finish finetuning

* delete conversion script

* Update src/transformers/models/wav2vec2/configuration_wav2vec2.py

* Update src/transformers/models/wav2vec2/processing_wav2vec2.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* finish wav2vec2 script

* finish wav2vec2 fine-tuning

* finalize test

* correct test

* adapt tests

* finish

* remove test file
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

0234de84

28 Feb, 2021 3 commits

Update ibert.rst (#10445) · 3c733f32
Patrick von Platen authored Feb 28, 2021

3c733f32

Adds terms to Glossary (#10443) · aeba4f95

Darigov Research authored Feb 28, 2021

* feat: Adds three definitions to glossary from @cronoik

Needed a definition for transformer which in turn needed 2 more definitions

To do with issue https://github.com/huggingface/transformers/issues/9078

* fix: Adjusts definition of neural network to make it easier to read

aeba4f95

Introduce save_strategy training argument (#10286) · 256482ac

Tanmay Garg authored Feb 28, 2021

* Introduce save_strategy training argument

* deprecate EvaluationStrategy

* collapse EvaluationStrategy and LoggingStrategy into a single
  IntervalStrategy enum

* modify tests to use modified enum

256482ac

27 Feb, 2021 5 commits
- updated logging and saving metrics (#10436) · aca6288f
  Bhadresh Savani authored Feb 27, 2021
```
* updated logging and saving metrics

* space removal
```
  aca6288f
- [run_seq2seq.py] restore functionality: saving to test_generations.txt (#10428) · f52a1589
  Stas Bekman authored Feb 27, 2021
```
This PR restores the original functionality that for some reason was modified.

Fixes: https://github.com/huggingface/transformers/issues/10381

@sgugger
```
  f52a1589
- Fix conda-build (#10431) · 311b7048
  Lysandre Debut authored Feb 27, 2021
  
  311b7048
- [examples] better model example (#10427) · ee04b698
  Stas Bekman authored Feb 26, 2021
```
* refactors

* typo
```
  ee04b698
- Ray Tune Integration Bug Fixes (#10406) · a85eb616
  Amog Kamsetty authored Feb 26, 2021
```
* fixes

* update resources

* formatting

* remove import

* add log statement

* use fstring

* add period

* Update src/transformers/integrations.py
```
  a85eb616
26 Feb, 2021 4 commits
- Add Ray Tune hyperparameter search integration test (#10414) · 98569d4b
  Kai Fricke authored Feb 26, 2021
  
  98569d4b
- [LED] Correct Docs (#10419) · d03695f3
  Patrick von Platen authored Feb 26, 2021
```
* correct docs

* correct tf model docs as well
```
  d03695f3
- Sagemaker Model Parallel tensoboard writing fix (#10403) · 7fc686ef
  Mansi Mane authored Feb 26, 2021
```
* Added tb fix

* Removed local rank condition

* Updated reference to args
```
  7fc686ef
- [ci, flax] non-existing models are unlikely to pass tests (#10409) · 83d2d55c
  Julien Chaumond authored Feb 26, 2021
```
😂
```
  83d2d55c
25 Feb, 2021 11 commits

Fix run_glue evaluation when model has a label correspondence (#10401) · 17b6e0d4
Sylvain Gugger authored Feb 25, 2021

17b6e0d4
Make Barthez tokenizer tests a bit faster (#10399) · 26f8b2cb
Sylvain Gugger authored Feb 25, 2021
```
* Make Barthez tokenizer tests a bit faster

* Quality
```
26f8b2cb

Fix None in add_token_positions - issue #10210 (#10374) · b040e6ef

Andrea Bacciu authored Feb 25, 2021

* Fix None in add_token_positions - issue #10210

Fix None in add_token_positions related to the issue #10210

* add_token_positions fix None values in end_positions vector

add_token_positions fix None in end_positions vector as proposed by @joeddav

b040e6ef

Add support for ZeRO-2/3 and ZeRO-offload in fairscale (#10354) · 9d14be5c

Sylvain Gugger authored Feb 25, 2021



* Ass support for ZeRO-2/3 and ZeRO-offload in fairscale

* Quality

* Rework from review comments

* Add doc

* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Address review comments
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

9d14be5c

Ignore unexpected weights from PT conversion (#10397) · 88cc26dc
Lysandre Debut authored Feb 25, 2021

88cc26dc

I-BERT model support (#10153) · 63645b3b

Sehoon Kim authored Feb 26, 2021



* IBertConfig, IBertTokentizer added

* IBert Model names moified

* tokenizer bugfix

* embedding -> QuantEmbedding

* quant utils added

* quant_mode added to configuration

* QuantAct added, Embedding layer + QuantAct addition

* QuantAct added

* unused path removed, QKV quantized

* self attention layer all quantized, except softmax

* temporarl commit

* all liner layers quantized

* quant_utils bugfix

* bugfix: requantization missing

* IntGELU added

* IntSoftmax added

* LayerNorm implemented

* LayerNorm implemented all

* names changed: roberta->ibert

* config not inherit from ROberta

* No support for CausalLM

* static quantization added, quantize_model.py removed

* import modules uncommented

* copyrights fixed

* minor bugfix

* quant_modules, quant_utils merged as one file

* import * fixed

* unused runfile removed

* make style run

* configutration.py docstring fixed

* refactoring: comments removed, function name fixed

* unused dependency removed

* typo fixed

* comments(Copied from), assertion string added

* refactoring: super(..) -> super(), etc.

* refactoring

* refarctoring

* make style

* refactoring

* cuda -> to(x.device)

* weight initialization removed

* QuantLinear set_param removed

* QuantEmbedding set_param removed

* IntLayerNorm set_param removed

* assert string added

* assertion error message fixed

* is_decoder removed

* enc-dec arguments/functions removed

* Converter removed

* quant_modules docstring fixed

* conver_slow_tokenizer rolled back

* quant_utils docstring fixed

* unused aruments e.g. use_cache removed from config

* weight initialization condition fixed

* x_min, x_max initialized with small values to avoid div-zero exceptions

* testing code for ibert

* test emb, linear, gelu, softmax added

* test ln and act added

* style reformatted

* force_dequant added

* error tests overrided

* make style

* Style + Docs

* force dequant tests added

* Fix fast tokenizer in init

* Fix doc

* Remove space

* docstring, IBertConfig, chunk_size

* test_modeling_ibert refactoring

* quant_modules.py refactoring

* e2e integration test added

* tokenizers removed

* IBertConfig added to tokenizer_auto.py

* bugfix

* fix docs & test

* fix style num 2

* final fixes
Co-authored-by: Sehoon Kim <sehoonkim@berkeley.edu>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

63645b3b

[PretrainedFeatureExtractor] + Wav2Vec2FeatureExtractor, Wav2Vec2Processor,... · cb38ffcc

Patrick von Platen authored Feb 25, 2021

[PretrainedFeatureExtractor] + Wav2Vec2FeatureExtractor, Wav2Vec2Processor, Wav2Vec2Tokenizer (#10324)

* push to show

* small improvement

* small improvement

* Update src/transformers/feature_extraction_utils.py

* Update src/transformers/feature_extraction_utils.py

* implement base

* add common tests

* make all tests pass for wav2vec2

* make padding work & add more tests

* finalize feature extractor utils

* add call method to feature extraction

* finalize feature processor

* finish tokenizer

* finish general processor design

* finish tests

* typo

* remove bogus file

* finish docstring

* add docs

* finish docs

* small fix

* correct docs

* save intermediate

* load changes

* apply changes

* apply changes to doc

* change tests

* apply surajs recommend

* final changes

* Apply suggestions from code review

* fix typo

* fix import

* correct docstring

cb38ffcc

Remove unused variable in example for Q&A (#10392) · 9dc78257
abhishek thakur authored Feb 25, 2021

9dc78257

Bugfix: Removal of padding_idx in BartLearnedPositionalEmbedding (#10200) · 894db670

mingruimingrui authored Feb 25, 2021



* Assumption of padding_idx <2 might not stand

* Use offset instead of 2

* Fix with black

* Change behavior to warning instead for backward compatibility.

* Fix with black

* Remove warning

* Make padding_idx non-required

* padding_idx fix for blenderbot

* padding_idx fix for blenderbot_small

* padding_idx fix for led

* padding_idx fix for mbart

* Remove extra whitespaces

* padding_idx fix for template

* Fix padding_idx passed to nn.Embedding mistake

* Fixed padding_idx passed to positional embedding in template

* Remove padding_idx from pytorch learned positional embeddings

* Remove accidentally added quotes

* Remove padding_idx from tf learned positional embeddings

* Remove zeroing of weights in __init__
Co-authored-by: Wang Ming Rui <mingrui.wang@C02CJTUYMD6M.local>

894db670

Only run model templates tests once (#10388) · 55fe80d0
Lysandre Debut authored Feb 25, 2021

55fe80d0
Run GA on every push even on forks (#10383) · 22bd047e
Lysandre Debut authored Feb 25, 2021

22bd047e

24 Feb, 2021 6 commits

v4.3.3 docs · 35918443
Lysandre authored Feb 24, 2021

35918443
[trainer] move secondary methods into a separate file (#10363) · bdbb2c75
Stas Bekman authored Feb 24, 2021
```
* move secondary methods into a separate file

* cleanup

* style
```
bdbb2c75

fix deprecated ref to `tokenizer.max_len` (#10220) · 5f2a3d72

Poedator authored Feb 24, 2021

This is to fix deprecated reference to `tokenizer.max_len` with `tokenizer.model_max_length` - similar to [issue 8739](https://github.com/huggingface/transformers/issues/8739) and [PR 8604](https://github.com/huggingface/transformers/pull/8604). 
Example [here](https://colab.research.google.com/gist/poedator/f8776349e5c625ce287fc6fcd312fa1e/tokenizer-max_len-error-in-transformers_glue.ipynb). The error happens when `glue_convert_examples_to_features` is called without `max_length` parameter specified. In that case line 119 with wrong reference gets called. This simple fix should  do it.

5f2a3d72

Rework casts (#10274) · cdcdd5f0
Julien Plu authored Feb 24, 2021

cdcdd5f0

ConvBERT fix torch <> tf weights conversion (#10314) · 2d458b2c

abhishek thakur authored Feb 24, 2021



* convbert conversion test

* fin

* fin

* fin

* clean up tf<->pt conversion

* remove from_pt
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>

2d458b2c

[Trainer/Deepspeed] handle get_last_lr() before first step() (#10362) · 3437d121

Stas Bekman authored Feb 23, 2021

* handle get_last_lr() before first step()

* abstract away the lr getting logic

* cleanup

* add test

* move to utils

3437d121

23 Feb, 2021 3 commits
- [bert-base-german-cased] cp to hardcoded urls (#10353) · 4a1ab7cb
  Julien Chaumond authored Feb 23, 2021
  
  4a1ab7cb
- Fix broken examples/seq2seq/README.md markdown (#10344) · 23e87c27
  Akmal authored Feb 23, 2021
  
  23e87c27
- Easier self-scheduled debugging · 83f890dd
  Lysandre authored Feb 23, 2021
  
  83f890dd
22 Feb, 2021 2 commits

Fix evaluation with label smoothing in Trainer (#10338) · 461e8cac
Sylvain Gugger authored Feb 22, 2021

461e8cac

[trainer] add Trainer methods for metrics logging and saving (#10266) · 622a8c59

Stas Bekman authored Feb 22, 2021



* make logging and saving trainer built-in

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

622a8c59