Commits · a5bd40b75ce73855631e32ed97cb56f0025bf573 · chenpangpang / transformers

"vscode:/vscode.git/clone" did not exist on "d6b6ab11f04eec306a7386d2b52bd31018902088"

04 Mar, 2021 3 commits
- Not always consider a local model a checkpoint in run_glue (#10517) · a5bd40b7
  Sylvain Gugger authored Mar 04, 2021
  
  a5bd40b7
- Revert "Not always consider a local model a checkpoint in run_glue" · 745ea78d
  Sylvain Gugger authored Mar 04, 2021
```
This reverts commit f3660613.
```
  745ea78d
- Not always consider a local model a checkpoint in run_glue · f3660613
  Sylvain Gugger authored Mar 04, 2021
  
  f3660613
03 Mar, 2021 9 commits
- Remove unsupported methods from ModelOutput doc (#10505) · 948b730f
  Sylvain Gugger authored Mar 03, 2021
  
  948b730f
- Smp grad accum (#10488) · b70f441b
  Sylvain Gugger authored Mar 03, 2021
```
* Fix gradient accumulation for SM Model Parallelism

* Style and divide loss by grad accum steps
```
  b70f441b
- Fix the bug in constructing the all_hidden_states of DeBERTa v2 (#10466) · d064fb56
  felixgwu authored Mar 03, 2021
```
* fix all_hidden_states

* use output_states instead of next_kv
```
  d064fb56
- remap MODEL_FOR_QUESTION_ANSWERING_MAPPING classes to names auto-generated file (#10487) · 188574ac
  Stas Bekman authored Mar 03, 2021
```
* remap classes to strings

* missing new util

* style

* doc

* move the autogenerated file

* Trigger CI
```
  188574ac
- Refactor checkpoint name in BERT and MobileBERT (#10424) · 801ff969
  Sylvain Gugger authored Mar 03, 2021
```
* Refactor checkpoint name in BERT and MobileBERT

* Add option to check copies

* Add QuestionAnswering

* Add last models

* Make black happy
```
  801ff969
- feat(docs): navigate with left/right arrow keys (#10481) · 39f70a40
  Jeff Yang authored Mar 03, 2021
```
* feat(docs): navigate with left/right arrow keys

* fix: add missing comma
```
  39f70a40
- [T5] Fix speed degradation bug t5 (#10496) · 2d2ed2cc
  Patrick von Platen authored Mar 03, 2021
```
* fix speed degradation bug t5

* fix for all models

* fix code quality
```
  2d2ed2cc
- Fixed minor spelling mistakes (#10489) · 5dc303e2
  WybeKoper authored Mar 03, 2021
```
Co-authored-by: WybeKoper <WybeKoper@users.noreply.github.com>
```
  5dc303e2
- Generate can return cross-attention weights too (#10493) · 1750e629
  Mehrad Moradshahi authored Mar 03, 2021
  
  1750e629
02 Mar, 2021 1 commit

Changed `num_beams` to `num_beams // num_beam_groups` when initialising... · b0138422

Martin Schmitt authored Mar 02, 2021

Changed `num_beams` to `num_beams // num_beam_groups` when initialising `PrefixConstrainedLogitsProcessor` in `_get_logits_processor` to fix compatibility issue when constrained decoding is used together with grouped beam search (#10475)

b0138422

01 Mar, 2021 5 commits

Add I-BERT to README (#10462) · 0c232519
Lysandre Debut authored Mar 01, 2021

0c232519
Remove Anthony from the bug reports in Transformers · 9248e270
Lysandre Debut authored Mar 01, 2021

9248e270
[Wav2Vec2FeatureExtractor] smal fixes (#10455) · a106bde5
Suraj Patil authored Mar 01, 2021
```
* smal fixes

* don't check for None
```
a106bde5
remove feature extraction config (#10457) · 11655faf
Patrick von Platen authored Mar 01, 2021

11655faf

Add Fine-Tuning for Wav2Vec2 (#10145) · 0234de84

Patrick von Platen authored Mar 01, 2021



* add encode labels function to tokenizer

* start adding finetuning

* init dropout

* upload

* correct convert script

* apply changes

* fix second typo

* make first dummy training run

* adapt convert script

* push confg for comparison

* remove conf

* finish training

* adapt data collator

* add research folder

* update according to fairseq feedback

* some minor corrections

* refactor masking indices a bit

* some minor changes

* clean tokenizer

* finish clean-up

* remove previous logic

* update run script

* correct training

* finish changes

* finish model

* correct bug

* fix training a bit more

* add some tests

* finish gradient checkpointing

* finish example

* correct gradient checkpointing

* improve tokenization method

* revert changes in tokenizer

* revert general change

* adapt fine-tuning

* update

* save intermediate test

* Update README.md

* finish finetuning

* delete conversion script

* Update src/transformers/models/wav2vec2/configuration_wav2vec2.py

* Update src/transformers/models/wav2vec2/processing_wav2vec2.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* finish wav2vec2 script

* finish wav2vec2 fine-tuning

* finalize test

* correct test

* adapt tests

* finish

* remove test file
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

0234de84

28 Feb, 2021 3 commits

Update ibert.rst (#10445) · 3c733f32
Patrick von Platen authored Feb 28, 2021

3c733f32

Adds terms to Glossary (#10443) · aeba4f95

Darigov Research authored Feb 28, 2021

* feat: Adds three definitions to glossary from @cronoik

Needed a definition for transformer which in turn needed 2 more definitions

To do with issue https://github.com/huggingface/transformers/issues/9078

* fix: Adjusts definition of neural network to make it easier to read

aeba4f95

Introduce save_strategy training argument (#10286) · 256482ac

Tanmay Garg authored Feb 28, 2021

* Introduce save_strategy training argument

* deprecate EvaluationStrategy

* collapse EvaluationStrategy and LoggingStrategy into a single
  IntervalStrategy enum

* modify tests to use modified enum

256482ac

27 Feb, 2021 5 commits
- updated logging and saving metrics (#10436) · aca6288f
  Bhadresh Savani authored Feb 27, 2021
```
* updated logging and saving metrics

* space removal
```
  aca6288f
- [run_seq2seq.py] restore functionality: saving to test_generations.txt (#10428) · f52a1589
  Stas Bekman authored Feb 27, 2021
```
This PR restores the original functionality that for some reason was modified.

Fixes: https://github.com/huggingface/transformers/issues/10381

@sgugger
```
  f52a1589
- Fix conda-build (#10431) · 311b7048
  Lysandre Debut authored Feb 27, 2021
  
  311b7048
- [examples] better model example (#10427) · ee04b698
  Stas Bekman authored Feb 26, 2021
```
* refactors

* typo
```
  ee04b698
- Ray Tune Integration Bug Fixes (#10406) · a85eb616
  Amog Kamsetty authored Feb 26, 2021
```
* fixes

* update resources

* formatting

* remove import

* add log statement

* use fstring

* add period

* Update src/transformers/integrations.py
```
  a85eb616
26 Feb, 2021 4 commits
- Add Ray Tune hyperparameter search integration test (#10414) · 98569d4b
  Kai Fricke authored Feb 26, 2021
  
  98569d4b
- [LED] Correct Docs (#10419) · d03695f3
  Patrick von Platen authored Feb 26, 2021
```
* correct docs

* correct tf model docs as well
```
  d03695f3
- Sagemaker Model Parallel tensoboard writing fix (#10403) · 7fc686ef
  Mansi Mane authored Feb 26, 2021
```
* Added tb fix

* Removed local rank condition

* Updated reference to args
```
  7fc686ef
- [ci, flax] non-existing models are unlikely to pass tests (#10409) · 83d2d55c
  Julien Chaumond authored Feb 26, 2021
```
😂
```
  83d2d55c
25 Feb, 2021 10 commits

Fix run_glue evaluation when model has a label correspondence (#10401) · 17b6e0d4
Sylvain Gugger authored Feb 25, 2021

17b6e0d4
Make Barthez tokenizer tests a bit faster (#10399) · 26f8b2cb
Sylvain Gugger authored Feb 25, 2021
```
* Make Barthez tokenizer tests a bit faster

* Quality
```
26f8b2cb

Fix None in add_token_positions - issue #10210 (#10374) · b040e6ef

Andrea Bacciu authored Feb 25, 2021

* Fix None in add_token_positions - issue #10210

Fix None in add_token_positions related to the issue #10210

* add_token_positions fix None values in end_positions vector

add_token_positions fix None in end_positions vector as proposed by @joeddav

b040e6ef

Add support for ZeRO-2/3 and ZeRO-offload in fairscale (#10354) · 9d14be5c

Sylvain Gugger authored Feb 25, 2021



* Ass support for ZeRO-2/3 and ZeRO-offload in fairscale

* Quality

* Rework from review comments

* Add doc

* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Address review comments
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

9d14be5c

Ignore unexpected weights from PT conversion (#10397) · 88cc26dc
Lysandre Debut authored Feb 25, 2021

88cc26dc

I-BERT model support (#10153) · 63645b3b

Sehoon Kim authored Feb 26, 2021



* IBertConfig, IBertTokentizer added

* IBert Model names moified

* tokenizer bugfix

* embedding -> QuantEmbedding

* quant utils added

* quant_mode added to configuration

* QuantAct added, Embedding layer + QuantAct addition

* QuantAct added

* unused path removed, QKV quantized

* self attention layer all quantized, except softmax

* temporarl commit

* all liner layers quantized

* quant_utils bugfix

* bugfix: requantization missing

* IntGELU added

* IntSoftmax added

* LayerNorm implemented

* LayerNorm implemented all

* names changed: roberta->ibert

* config not inherit from ROberta

* No support for CausalLM

* static quantization added, quantize_model.py removed

* import modules uncommented

* copyrights fixed

* minor bugfix

* quant_modules, quant_utils merged as one file

* import * fixed

* unused runfile removed

* make style run

* configutration.py docstring fixed

* refactoring: comments removed, function name fixed

* unused dependency removed

* typo fixed

* comments(Copied from), assertion string added

* refactoring: super(..) -> super(), etc.

* refactoring

* refarctoring

* make style

* refactoring

* cuda -> to(x.device)

* weight initialization removed

* QuantLinear set_param removed

* QuantEmbedding set_param removed

* IntLayerNorm set_param removed

* assert string added

* assertion error message fixed

* is_decoder removed

* enc-dec arguments/functions removed

* Converter removed

* quant_modules docstring fixed

* conver_slow_tokenizer rolled back

* quant_utils docstring fixed

* unused aruments e.g. use_cache removed from config

* weight initialization condition fixed

* x_min, x_max initialized with small values to avoid div-zero exceptions

* testing code for ibert

* test emb, linear, gelu, softmax added

* test ln and act added

* style reformatted

* force_dequant added

* error tests overrided

* make style

* Style + Docs

* force dequant tests added

* Fix fast tokenizer in init

* Fix doc

* Remove space

* docstring, IBertConfig, chunk_size

* test_modeling_ibert refactoring

* quant_modules.py refactoring

* e2e integration test added

* tokenizers removed

* IBertConfig added to tokenizer_auto.py

* bugfix

* fix docs & test

* fix style num 2

* final fixes
Co-authored-by: Sehoon Kim <sehoonkim@berkeley.edu>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

63645b3b

[PretrainedFeatureExtractor] + Wav2Vec2FeatureExtractor, Wav2Vec2Processor,... · cb38ffcc

Patrick von Platen authored Feb 25, 2021

[PretrainedFeatureExtractor] + Wav2Vec2FeatureExtractor, Wav2Vec2Processor, Wav2Vec2Tokenizer (#10324)

* push to show

* small improvement

* small improvement

* Update src/transformers/feature_extraction_utils.py

* Update src/transformers/feature_extraction_utils.py

* implement base

* add common tests

* make all tests pass for wav2vec2

* make padding work & add more tests

* finalize feature extractor utils

* add call method to feature extraction

* finalize feature processor

* finish tokenizer

* finish general processor design

* finish tests

* typo

* remove bogus file

* finish docstring

* add docs

* finish docs

* small fix

* correct docs

* save intermediate

* load changes

* apply changes

* apply changes to doc

* change tests

* apply surajs recommend

* final changes

* Apply suggestions from code review

* fix typo

* fix import

* correct docstring

cb38ffcc

Remove unused variable in example for Q&A (#10392) · 9dc78257
abhishek thakur authored Feb 25, 2021

9dc78257

Bugfix: Removal of padding_idx in BartLearnedPositionalEmbedding (#10200) · 894db670

mingruimingrui authored Feb 25, 2021



* Assumption of padding_idx <2 might not stand

* Use offset instead of 2

* Fix with black

* Change behavior to warning instead for backward compatibility.

* Fix with black

* Remove warning

* Make padding_idx non-required

* padding_idx fix for blenderbot

* padding_idx fix for blenderbot_small

* padding_idx fix for led

* padding_idx fix for mbart

* Remove extra whitespaces

* padding_idx fix for template

* Fix padding_idx passed to nn.Embedding mistake

* Fixed padding_idx passed to positional embedding in template

* Remove padding_idx from pytorch learned positional embeddings

* Remove accidentally added quotes

* Remove padding_idx from tf learned positional embeddings

* Remove zeroing of weights in __init__
Co-authored-by: Wang Ming Rui <mingrui.wang@C02CJTUYMD6M.local>

894db670

Only run model templates tests once (#10388) · 55fe80d0
Lysandre Debut authored Feb 25, 2021

55fe80d0