Commits · cebb96f53a0a8cc2e56c106ae64288ef397abffe · chenpangpang / transformers

18 May, 2021 4 commits

Add more subsections to main doc (#11758) · cebb96f5
Patrick von Platen authored May 18, 2021
```
* add headers to main doc

* Apply suggestions from code review

* update

* upload
```
cebb96f5
Fix incorrect newline in #11650 (#11757) · da7e73b7
Tommy Chiang authored May 18, 2021

da7e73b7
Fix checkpoint deletion (#11748) · a515caa3
Sylvain Gugger authored May 18, 2021

a515caa3

[TokenClassification] Label realignment for subword aggregation (#11680) · b88e0e01

Nicolas Patry authored May 18, 2021

* [TokenClassification] Label realignment for subword aggregation

Tentative to replace https://github.com/huggingface/transformers/pull/11622/files



- Added `AggregationStrategy`
- `ignore_subwords` and `grouped_entities` arguments are now fused
  into `aggregation_strategy`. It makes more sense anyway because
  `ignore_subwords=True` with `grouped_entities=False` did not have a
  meaning anyway.
- Added 2 new ways to aggregate which are MAX, and AVERAGE
- AVERAGE requires a bit more information than the others, for now this
case is slightly specific, we should keep that in mind for future
changes.
- Testing has been modified to reflect new argument, and to check the
correct deprecation and the new aggregation_strategy.
- Put the testing argument and testing results for aggregation_strategy,
close together, so that readers can understand what is supposed to
happen.
- `aggregate` is now only tested on a small model as it does not mean
anything to test it globally for all models.
- Previous tests are unchanged in desired output.
- Added a new test case that showcases better the difference between the
  FIRST, MAX and AVERAGE strategies.

* Wrong framework.

* Addressing three issues.

1- Tags might not follow B-, I- convention, so any tag should work now
(assumed as B-TAG)
2- Fixed an issue with average that leads to a substantial code change.
3- The testing suite was not checking for the "index" key for "none"
strategy. This is now fixed.

The issue is that "O" could not be chosen by AVERAGE strategy because
those tokens were filtered out beforehand, so their relative scores were
not counted in the average. Now filtering on
ignore_labels will happen at the very end of the pipeline fixing
that issue.
It's a bit hard to make sure this stays like that because we do
not have a end-to-end test for that behavior

* Formatting.

* Adding formatting to code + cleaner handling of B-, I- tags.
Co-authored-by: Francesco Rubbo <rubbo.francesco@gmail.com>
Co-authored-by: elk-cloner <rezakakhki.rk@gmail.com>

* Typo.
Co-authored-by: Francesco Rubbo <rubbo.francesco@gmail.com>
Co-authored-by: elk-cloner <rezakakhki.rk@gmail.com>

b88e0e01

17 May, 2021 7 commits
- push (#11750) · c73e3532
  Patrick von Platen authored May 17, 2021
  
  c73e3532
- Use new evaluation loop in TrainerQA (#11746) · 936b5715
  Sylvain Gugger authored May 17, 2021
  
  936b5715
- [BigBird Pegasus] Make tests faster (#11744) · 73893fc7
  Patrick von Platen authored May 17, 2021
```
* improve tests

* remove bogus file

* make style
Co-authored-by: Patrick von Platen <patrick@huggingface.co>
```
  73893fc7
- fixed shape issue for T5 tracing (#11742) · a0531c8a
  Michael Benayoun authored May 17, 2021
```
Co-authored-by: Michael Benayoun <michael@huggingface.co>
```
  a0531c8a
- Add visual + link to Premium Support webpage (#11740) · 0fc56df5
  Julien Chaumond authored May 17, 2021
```
* Update README.md

* Update index.rst
```
  0fc56df5
- Remove tapas model card (#11739) · 2f88bd9c
  Julien Chaumond authored May 17, 2021
  
  2f88bd9c
- Improvements to Flax finetuning script (#11727) · 726e953d
  Marc van Zee authored May 17, 2021
```
* Add Cloud details to README

* Flax script and readme updates

* Some simplifications of Flax script
```
  726e953d
14 May, 2021 4 commits
- Experimental symbolic tracing feature with torch.fx for BERT, ELECTRA and T5 (#11475) · 86d5fb0b
  Michael Benayoun authored May 14, 2021
```
Symbolic tracing feature for BERT, ELECTRA and T5
Co-authored-by: Michael Benayoun <michael@huggingface.co>
Co-authored-by: Stas Bekman <stas@stason.org>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  86d5fb0b
- Add Cloud details to README (#11706) · 94a23487
  Marc van Zee authored May 14, 2021
```
* Add Cloud details to README

* Flax script and readme updates
```
  94a23487
- correct example script (#11726) · 113eaa75
  Patrick von Platen authored May 14, 2021
  
  113eaa75
- Fix T5 beam search using parallelize (#11717) · bd3b599c
  Oyvind Tafjord authored May 14, 2021
  
  bd3b599c
13 May, 2021 8 commits

Fix loading the best model on the last stage of training (#11718) · 218d552f
Volodymyr Byno authored May 13, 2021

218d552f
Fix v4.6.0 doc · 25208200
Sylvain Gugger authored May 13, 2021

25208200
Fix doc deployment · cbbf49f6
Sylvain Gugger authored May 13, 2021

cbbf49f6

[T5] Add 3D attention mask to T5 model (2) (#9643) (#11197) · 91cf2915

lexhuismans authored May 13, 2021

* Add 3D attention mask to T5 model (#9643)

Added code for 3D attention mask in T5 model. Similar to BERT model.

* Add test for 3D attention mask

Added test for 3D attention mask: test_decoder_model_past_with_3d_attn_mask()
3D attention mask of the shape [Batch_size, Seq_length, Seq_length] both for
attention mask and decoder attention mask. Test is passing.

91cf2915

add everything (#11651) · 6ee1a4fd
Vasudev Gupta authored May 13, 2021

6ee1a4fd

[Flax] Fix BERT initialization & token_type_ids default (#11695) · 57b6a80d

Patrick von Platen authored May 13, 2021



* fix some stuff

* fix roberta & electra as well

* del run bug
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

57b6a80d

Fix gpt-2 warnings (#11709) · daf0d6a9
Lysandre Debut authored May 13, 2021

daf0d6a9

Enable option for subword regularization in more tokenizers. (#11417) · 37ed3ab7

Philip May authored May 13, 2021

* improve slow class tok usage at xlm rob

* add subword regularization for barthez

* improve barthez tok. test

* fix tokenizer tests

* add subword regularization for camembert

* add subword regularization for deberta v2 tokenizer

* add more doc to deberta v2 tokenizer

* add subword regularization for speech to text tok.

* fix sp_model_kwargs type in speech 2 text tok.

* add subword regularization for M2M100 tok.

* add more concrete type hints

* fix tests for m2m100 and s2t tok.

* add missing Any import

* fix syntax error in m2m100 tok.

* fix unpickle of m2m100 and s2t tok.

* fix test of m2m100 and s2t tok.

* improve unpickle of deberta v2 tok.

* add test for pickle of barthez & camembert

* fix pickle of barthez & camembert

* add test for deberta v2 tok. pickle

* fix m2m100 tok. pickle

* fix s2t tok. pickle

* add subword regularization to albert tok.

* refactor subword reg. test into TokenizerTesterMixin

improve albert tok. test

remove sample argument form albert tok.

check subword reg. using TokenizerTesterMixin

improve tok. tests

improve xlm roberta tok. tests

improve xlm roberta tok. tests

* add subword regularization for big bird t.

* improve xlm roberta tok. test

* add subword regularization for mbart50 tok.

* add subword regularization for pegasus tok.

* add subword regularization for reformer tok.

* add subword regularization for T5 tok.

* fix t5 tok. test formatting

* add subword regularization for xlm_proph. tok.

* add subword regularization for xlnet tok.

* add subword regularization for gert_gen tok.

* add typing to tokenizers

* add typing to xlm rob. tok

* add subword regularization for marian tok.

* add reverse tok. test

* fix marian tok test

* fix marian tok test

* fix casing in tok. tests

* fix style of tok. common test

* fix deberta v2 tok test

* add type annotations to tok. tests

* add type annotations to tok. __init__

* add typing to kokenizer

* add type annotations to tok. __init__

* don't specify the default when it's None

* fix barthez tok. doc

* move sentencepiece tok. tests to TokenizerTesterMixin

* fix unused imports

* fix albert tok. test

* add comment to sentencepiece test options

* fix Any import at big bird tok.

* fix Any import at xlm prophetnet tok.

* empty commit to trigger CI

37ed3ab7

12 May, 2021 9 commits

Vit deit fixes (#11309) · fa84540e

NielsRogge authored May 12, 2021



* Improve docs of DeiT and ViT, add community notebook

* Add gitignore for test_samples

* Add notebook with Trainer
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

fa84540e

Docs for v4.7.0.dev0 · d77eb0cf
Lysandre authored May 12, 2021

d77eb0cf
Release: v4.6.0 · 64e78564
Lysandre authored May 12, 2021

64e78564

[Lazy init] Force fall back to slow init for composite models (#11705) · fd6204b2

Patrick von Platen authored May 12, 2021



* fix encoder-decoder & RAG

* finalize

* Update src/transformers/models/encoder_decoder/modeling_encoder_decoder.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/models/rag/modeling_rag.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Patrick von Platen <patrick@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

fd6204b2

fix example in config doc (#11696) · 5c1cda9d
Suraj Patil authored May 12, 2021

5c1cda9d
remove defaults to None if optional (#11703) · 77f4c46b
Philip May authored May 12, 2021

77f4c46b
Updates README and fixes bug (#11701) · 6797cdc0
Marc van Zee authored May 12, 2021

6797cdc0
Fix clip docs (#11694) · f063c56d
Suraj Patil authored May 12, 2021
```
* fix doc url

* fix example
```
f063c56d

CLIP (#11445) · 8719afa1

Suraj Patil authored May 12, 2021



* begin second draft

* fix import, style

* add loss

* fix embeds, logits_scale, and projection

* fix imports

* add conversion script

* add feature_extractor and processor

* style

* add tests for tokenizer, extractor and processor

* add vision model tests

* add weight init

* add more tests

* fix save_load  test

* model output, dosstrings, causal mask

* config doc

* add clip model tests

* return dict

* bigin integration test

* add integration tests

* fix-copies

* fix init

* Clip => CLIP

* fix module name

* docs

* fix doc

* output_dim => projection_dim

* fix checkpoint names

* remoe fast tokenizer file

* fix conversion script

* fix tests, quality

* put causal mask on device

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* fix attribute test

* style

* address sylvains comments

* style

* fix docstrings

* add qucik_gelu in activations, docstrings

* clean-up attention test

* fix act fun

* fix config

* fix torchscript tests

* even batch_size

* remove comment

* fix ouput tu_tuple

* fix save load tests

* fix add tokens test

* add fast tokenizer

* update copyright

* new processor API

* fix docs

* docstrings

* docs

* fix doc

* fix doc

* fix tokenizer

* fix import in doc example

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* check types of config

* valhalla => openai

* load image using url

* fix test

* typo
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

8719afa1

11 May, 2021 8 commits

Adds Flax BERT finetuning example on GLUE (#11564) · 4ce6bcc3

Marc van Zee authored May 11, 2021



* Adds Flax BERT finetuning example

* fix traced jax tensor type

* Use Optax losses and learning schedulers

* Add 1GPU training results

* merge into master & make style

* fix input

* del file

* Fix bug in loss and add torch runs

* finish bert flax fine-tune

* Update examples/flax/text-classification/README.md

* Update examples/flax/text-classification/run_flax_glue.py

* add requirements

* finalize

* finalize
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

4ce6bcc3

Test checkpointing (#11682) · f13f1f8f
Sylvain Gugger authored May 11, 2021
```
* Add test and see where CI is unhappy

* Load with strict=False
```
f13f1f8f
Fix TF Roberta for mixed precision training (#11675) · d9b28627
Julien Plu authored May 11, 2021

d9b28627

Auto modelcard (#11599) · a135f595

Sylvain Gugger authored May 11, 2021



* Autogenerate model cards from the Trainer

* ModelCard deprecated

* Fix test

* Style

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Address review comments

* Quality

* With all metadata

* Metadata

* Post-merge conflict mess

* Data args and all examples

* Default license and languages when possible
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

a135f595

Grammar and style edits for the frontpage README (#11679) · b3429ab6

Matt authored May 11, 2021



* Grammar and style edits for the frontpage README

* Going all-in on em-dashes because you only live once

* Update README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

b3429ab6

Fix docstring of description about input_ids (#11672) · 901153c6
nxznm authored May 11, 2021

901153c6
Add --text_column to run_summarization_no_trainer (#11673) · 64232bc0
Jonathan Chang authored May 11, 2021

64232bc0
Add MacOS TF version (#11674) · 024cd19b
Julien Plu authored May 11, 2021
```
Co-authored-by: Julien Plu <jplu@argos.local>
```
024cd19b