Commits · 22454ae492eca4bb749fa6d770dffc91d17dab87 · chenpangpang / transformers

18 Jan, 2022 3 commits

Li-Huai (Allan) Lin authored Jan 18, 2022



* REALM initial commit

* Retriever OK (Update new_gelu).

* Encoder prediction score OK

* Encoder pretrained model OK

* Update retriever comments

* Update docs, tests, and imports

* Prune unused models

* Make embedder as a module `RealmEmbedder`

* Add RealmRetrieverOutput

* Update tokenization

* Pass all tests in test_modeling_realm.py

* Prune RealmModel

* Update docs

* Add training test.

* Remove completed TODO

* Style & Quality

* Prune `RealmModel`

* Fixup

* Changes:
1. Remove RealmTokenizerFast
2. Update docstrings
3. Add a method to RealmTokenizer to handle candidates tokenization.

* Fix up

* Style

* Add tokenization tests

* Update `from_pretrained` tests

* Apply suggestions

* Style & Quality

* Copy BERT model

* Fix comment to avoid docstring copying

* Make RealmBertModel private

* Fix bug

* Style

* Basic QA

* Save

* Complete reader logits

* Add searcher

* Complete searcher & reader

* Move block records init to constructor

* Fix training bug

* Add some outputs to RealmReader

* Add finetuned checkpoint variable names parsing

* Fix bug

* Update REALM config

* Add RealmForOpenQA

* Update convert_tfrecord logits

* Fix bugs

* Complete imports

* Update docs

* Update naming

* Add brute-force searcher

* Pass realm model tests

* Style

* Exclude RealmReader from common tests

* Fix

* Fix

* convert docs

* up

* up

* more make style

* up

* upload

* up

* Fix

* Update src/transformers/__init__.py

* adapt testing

* change modeling code

* fix test

* up

* up

* up

* correct more

* make retriever work

* update

* make style

* finish main structure

* Resolve merge conflict

* Make everything work

* Style

* Fixup

* Fixup

* Update training test

* fix retriever

* remove hardcoded path

* Fix

* Fix modeling test

* Update model links

* Initial retrieval test

* Fix modeling test

* Complete retrieval tests

* Fix

* style

* Fix tests

* Fix docstring example

* Minor fix of retrieval test

* Update license headers and docs

* Apply suggestions from code review

* Style

* Apply suggestions from code review

* Add an example to RealmEmbedder

* Fix
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

22454ae4

[Fix doc example] TFRagModel (#15187) · b25067d8

Yih-Dar authored Jan 18, 2022



* fix doc example - NameError: name 'PATH' is not defined

* fix name 'TFRagModel' is not defined

* correct TFRagRagSequenceForGeneration

* fix name 'tf' is not defined

* fix style
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

b25067d8

`is_ctc` needs to be updated to `self.type == "ctc". (#15194) · dea563c9
Nicolas Patry authored Jan 18, 2022
```
* `is_ctc` needs to be updated to `self.type == "ctc".

* Adding fast test for this functionality.
```
dea563c9

17 Jan, 2022 5 commits
- [Fix doc example] UniSpeechSatForPreTraining (#15152) · 32090c72
  Yih-Dar authored Jan 18, 2022
```
* fix doc example - cannot import name 'UniSpeechSatFeatureEncoder'

* fix ckpt name
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  32090c72
- Mark bad tokenizers version (#15188) · 6f8e644f
  Sylvain Gugger authored Jan 17, 2022
  
  6f8e644f
- [doc] new MoE paper (#15184) · edd3fce2
  Stas Bekman authored Jan 17, 2022
```
add new paper
```
  edd3fce2
- Fix dtype issue in TF BART (#15178) · 9a2dabae
  Matt authored Jan 17, 2022
  
  9a2dabae
- Added forward pass of test_inference_image_classification_head with torch.no_grad() (#14777) · 0167edc8
  MrinalTyagi authored Jan 17, 2022
  
  0167edc8
16 Jan, 2022 1 commit
- [Speech models] Disable non-existing chunking in tests (#15163) · 7a787c68
  Patrick von Platen authored Jan 16, 2022
  
  7a787c68
15 Jan, 2022 1 commit
- [doc] performance: Efficient Software Prebuilds (#15147) · 669e3c50
  Stas Bekman authored Jan 14, 2022
```
* Efficient Software Prebuilds

* improve
```
  669e3c50
14 Jan, 2022 11 commits

update from keras2onnx to tf2onnx (#15162) · ebc4edfe
Joao Gante authored Jan 14, 2022

ebc4edfe

Better dummies (#15148) · 1b730c3d

Sylvain Gugger authored Jan 14, 2022

* Better dummies

* See if this fixes the issue

* Fix quality

* Style

* Add doc for DummyObject

1b730c3d

Fixing flaky test (hopefully). (#15154) · b212ff9f
Nicolas Patry authored Jan 14, 2022
```
* Fixing flaky test (hopefully).

* tf compliant.
```
b212ff9f
TF Bert inference - support `np.ndarray` optional arguments (#15074) · 7d9a33fb
Joao Gante authored Jan 14, 2022
```
* TF Bert inference - support np.ndarray optional arguments

* apply np input tests to all TF architectures
```
7d9a33fb

Add "open in hf spaces" gradio button issue #73 (#15106) · 4663c609

AK391 authored Jan 14, 2022

* update XLMProphetNet link

* update DPR link

* change prophetnet link

* change link MBART

* change link GPT

* update gpt2 link

* ctrl update link

* update Transformer-XL link

* Update Reformer link

* update xlnet link

* bert update link

* udpate albert link

* roberta update link

* update distilbert link

* update convbert link

* update XLM link

* xlm roberta update link

* update Flaubert link

* update electra link

* update funnel transformer and longformer

* bart update link

* pegasus update link

* udpate marianmt link

* t5 update link

* mt5 update link

4663c609

Update test_configuration_common.py (#15160) · 735d2bb6
novice authored Jan 14, 2022

735d2bb6
fix BertTokenizerFast `tokenize_chinese_chars` arg (#15158) · 51d7ebf2
SaulLu authored Jan 14, 2022
```
* add new test

* fix in init

* more relevant test
```
51d7ebf2
fix doc example - object has no attribute 'lm_logits' (#15143) · 4aa16fce
Yih-Dar authored Jan 14, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
4aa16fce
Make sure all submodules are properly registered (#15144) · 7cbf8429
Sylvain Gugger authored Jan 14, 2022
```
* Make sure all submodules are properly registered

* Try to fix tests

* Fix tests
```
7cbf8429
add TF glu activation function (#15146) · c4f7eb12
Joao Gante authored Jan 14, 2022

c4f7eb12

Check the repo consistency in model templates test (#15141) · 5f3c57fc

Sylvain Gugger authored Jan 14, 2022

* Check the repo consistency in model templates test

* Fix doc template

* Fix docstrings

* Fix last docstring

5f3c57fc

13 Jan, 2022 8 commits

Remove assert on optional arg · 96881729
Sylvain Gugger authored Jan 13, 2022

96881729
[deepspeed tests] fix summarization (#15149) · 1eb40338
Stas Bekman authored Jan 13, 2022

1eb40338

Enable AMP for xla:gpu device in trainer class (#15022) · 6e058e84

Yanming Wang authored Jan 13, 2022

* Multiple fixes of trainer class with XLA GPU

* Make fp16 valid for xla:gpu

* Add mark_step in should_log to reduce compilation overhead

6e058e84

Update model_sharing.mdx (#15142) · 3fc221d0
Carlos Aguayo authored Jan 13, 2022
```
Fix typo
```
3fc221d0

Deprecates AdamW and adds `--optim` (#14744) · 7b83feb5

Manuel R. Ciosici authored Jan 13, 2022



* Add AdamW deprecation warning

* Add --optim to Trainer

* Update src/transformers/optimization.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/optimization.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/optimization.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/optimization.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/training_args.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/training_args.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/training_args.py

* fix style

* fix

* Regroup adamws together
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Change --adafactor to --optim adafactor

* Use Enum for optimizer values

* fixup! Change --adafactor to --optim adafactor

* fixup! Change --adafactor to --optim adafactor

* fixup! Change --adafactor to --optim adafactor

* fixup! Use Enum for optimizer values

* Improved documentation for --adafactor
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Add mention of no_deprecation_warning
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Rename OptimizerOptions to OptimizerNames

* Use choices for --optim

* Move optimizer selection code to a function and add a unit test

* Change optimizer names

* Rename method
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Rename method
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Remove TODO comment
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Rename variable
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Rename variable
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Rename function

* Rename variable

* Parameterize the tests for supported optimizers

* Refactor

* Attempt to make tests pass on CircleCI

* Add a test with apex

* rework to add apex to parameterized; add actual train test

* fix import when torch is not available

* fix optim_test_params when torch is not available

* fix optim_test_params when torch is not available

* re-org

* small re-org

* fix test_fused_adam_no_apex

* Update src/transformers/training_args.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/training_args.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/training_args.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Remove .value from OptimizerNames

* Rename optimizer strings s|--adam_|--adamw_|

* Also rename Enum options

* small fix

* Fix instantiation of OptimizerNames. Remove redundant test

* Use ExplicitEnum instead of Enum

* Add unit test with string optimizer

* Change optimizer default to string value
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas@stason.org>

7b83feb5

[examples/flax/language-modeling] set loglevel (#15129) · 762416ff
Stas Bekman authored Jan 13, 2022

762416ff
fix doc example - AssertionError: has to be configured as a decoder. (#15124) · 74837171
Yih-Dar authored Jan 13, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
74837171

doc-builder -> doc-build (#15134) · 6950ccec

Lysandre Debut authored Jan 13, 2022



* Updated script

* Commit everything

* Ready for review!

* Update .github/workflows/build_documentation.yml
Co-authored-by: Julien Chaumond <julien@huggingface.co>
Co-authored-by: Julien Chaumond <julien@huggingface.co>

6950ccec

12 Jan, 2022 8 commits

mBART support for run_summarization.py (#15125) · 9a94bb8e

Edoardo Federici authored Jan 12, 2022

* Update run_summarization.py

* Fixed languages and added missing code

* fixed obj, docs, removed source_lang and target_lang

* make style, run_summarization.py reformatted

9a94bb8e

Add `with torch.no_grad()` to DistilBERT integration test forward pass (#14979) · 97f3beed

Jake Tae authored Jan 13, 2022

* refactor: wrap forward pass around no_grad context

* Update tests/test_modeling_distilbert.py

* fix: rm `no_grad` from non-integration tests

* chore: rm whitespace change

97f3beed

Add ONNX configuration classes to docs (#15121) · 021f2ea9

lewtun authored Jan 12, 2022

* Add ONNX classes to main package

* Remove permalinks from ONNX guide

* Fix ToC entry

* Revert "Add ONNX classes to main package"

This reverts commit eb794a5b00d66b0b4eab234987301676d8357630.

* Add ONNX classes to main doc

* Fix syntax highlighting in doc

* Fix text

* Add FeaturesManager to doc

* Use paths to reference ONNX classes

* Add FeaturesManager to init

* Add missing ONNX paths

021f2ea9

Fix link to deepspeed config · c425d60b
Sylvain Gugger authored Jan 12, 2022

c425d60b
Fix #14357 (#15001) · 68209044
Yih-Dar authored Jan 12, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
68209044
fix: switch from slow to generic tokenizer class (#15122) · aa0135f2
Leandro von Werra authored Jan 12, 2022

aa0135f2

use block_size instead of max_seq_length in tf run_clm example (#15036) · 27b819b0

Russell Klopfer authored Jan 12, 2022



* use block_size instead of max_seq_length

* fixup

* remove pad_to_block_size
Co-authored-by: Russell Klopfer <russell@kloper.us>

27b819b0

Pipeline ASR with LM. (#15071) · 68cc4ccd

Nicolas Patry authored Jan 12, 2022



* Pipeline ASR with LM.

* Revamped into `self.decoder`.

* Fixing.

* 2nd fix.

* Update src/transformers/pipelines/__init__.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Fixing.
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

68cc4ccd

11 Jan, 2022 3 commits

Fix typo in doc template · 1a00863e
Sylvain Gugger authored Jan 11, 2022

1a00863e

Update TF test_step to match train_step (#15111) · 44eaa2b3

Matt authored Jan 11, 2022

* Update TF test_step to match train_step

* Update compile() warning to be clearer about what to pass

44eaa2b3

Fix saving FlaubertTokenizer configs (#14991) · 57b980a6

Vladimir Maryasin authored Jan 11, 2022

All specific tokenizer config properties must be passed to its base
class (XLMTokenizer) in order to be saved. This was not the case for
do_lowercase config. Thus it was not saved by save_pretrained() method
and saving and reloading the tokenizer changed its behaviour.

This commit fixes it.

57b980a6