Commits · cd7961b632c66299a700d8d54912f63a0348d58d · chenpangpang / transformers

"docs/source/en/main_classes/agent.md" did not exist on "f76fb3aeeafa98f2270e71f307559b6ab26d3801"

14 Jun, 2021 5 commits

Use text_column_name variable instead of "text" (#12132) · cd7961b6

Nicholas Broad authored Jun 14, 2021



* Use text_column_name variable instead of "text"

`text_column_name` was already defined above where I made the changes and it was also used below where I made changes.

This is a very minor change. If a dataset does not use "text" as the column name, then the `tokenize_function` will now use whatever column is assigned to `text_column_name`. `text_column_name` is just the first column name if "text" is not a column name. It makes the function a little more robust, though I would assume that 90% + of datasets use "text" anyway.

* black formatting

* make style
Co-authored-by: Nicholas Broad <nicholas@nmbroad.com>

cd7961b6

Don't log anything before logging is setup in examples (#12121) · b8ab5413
Sylvain Gugger authored Jun 14, 2021
```
* Don't log anything before logging is setup in examples

* Last example
```
b8ab5413
[Flax] Add links to google colabs (#12146) · 7566fefa
Patrick von Platen authored Jun 14, 2021
```
* fix_torch_device_generate_test

* remove @

* add colab links
```
7566fefa

add readme for flax clm (#12111) · d36fce82

Suraj Patil authored Jun 14, 2021



* add readme for flax clm

* use section link for tokenizer

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* update metrics
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d36fce82

Add mlm pretraining xla torch readme (#12011) · 16c0efca

Patrick von Platen authored Jun 14, 2021



* fix_torch_device_generate_test

* remove @

* upload

* Apply suggestions from code review

* Apply suggestions from code review

* Apply suggestions from code review

* Update examples/flax/language-modeling/README.md

* add more info

* finish

* fix
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

16c0efca

11 Jun, 2021 1 commit

Flax CLM script (#12023) · 15b498f3

Suraj Patil authored Jun 11, 2021

* first draft

* max_seq_length => block_size

* fix arg names

* fix typos

* fix loss calculation

* add max examples, fix  train eval steps, metrics

* optimizer mask

* fix perpelexity, metric logging

* fix logging

* data_collator = > data_loader

* refactor loss_fn

* support single GPU

* pass distributed to write_metric

* fix jitting

* fix single device training

* fix single device metrics

* close inner progress bars once finished

* add overwrite_cache arg

* ifx dataset caching issue

* add more logs

* few small fixes,

* address nicholas suggestions

* fix docstr

* address patricks suggestions

* make flake happy

* pass new new_dropout_rng to apply_gradients

* reset train metrics after every epoc

* remove distributed logis, small fixes

15b498f3

10 Jun, 2021 7 commits

add relevant description to tqdm in examples (#11927) · d2753dcb
Bhavitvya Malik authored Jun 11, 2021
```
* add relevant `desc` in examples

* require_version datasets>=1.8.0
```
d2753dcb
Appending label2id and id2label to models to ensure inference works properly (#12102) · bebbdd0f
Matt authored Jun 10, 2021

bebbdd0f
Minor style edits · 4cda08de
Matt authored Jun 10, 2021

4cda08de
Update README.md to cover the TF GLUE example. · 7f08dbd1
Matt authored Jun 10, 2021

7f08dbd1
Fix quality · d72e5a3a
Sylvain Gugger authored Jun 10, 2021

d72e5a3a

New TF GLUE example (#12028) · 73a53265

Matt authored Jun 10, 2021



* Pushing partially-complete new GLUE example

* First draft of the new TF GLUE example! Needs a little more testing to be sure but it's almost ready.

* Fix to the fit() call

* Bugfixes, making sure TPU and multi-GPU support is ready

* Remove logger line that depends on Pytorch

* Style pass

* Deleting old TF GLUE example

* Include label2id and id2label in the saved model config

* Don't clobber the existing model.config.label2id

* Style fixes

* Update examples/tensorflow/text-classification/run_glue.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

73a53265

Add text_column_name and label_column_name to run_ner and run_ner_no_trainer args (#12083) · 472a8676
kumapo authored Jun 10, 2021
```
* Add text_column_name and label_column_name to run_ner args

* Minor fix: grouping for text and label column name
```
472a8676

09 Jun, 2021 5 commits

rm require_version_examples (#12088) · 61e19198
Stas Bekman authored Jun 09, 2021

61e19198
pass decay_mask fn to optimizer (#12087) · d1500d91
Suraj Patil authored Jun 09, 2021

d1500d91

Wav2Vec2 Pretraining (#11306) · d472bd7b

Anton Lozhkov authored Jun 09, 2021



* Working quantizer forward

* Working quantizer forward

* Clean up unused model parts, test reproducibility

* Working quantizer forward

* Clean up unused model parts, test reproducibility

* Remove custom outputs from the shared ones

* correct conversion

* correct bug

* add first pretrain script

* save intermediate

* static shapes

* save intermediate

* finish first pretrain script version

* more refactor

* remove wanddb

* refactor more

* improve test

* correct perplexity compute bug

* finish model implementation

* add to docs

* finish docs

* finish pretraining script

* finish pretraining script

* remove wandb

* finish PR for merge

* finish config

* finish

* make deepspeed work

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* apply suggestions

* fix flaky test
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

d472bd7b

sync LayerDrop for Wav2Vec2Encoder + tests (#12076) · d14e0af2
Stas Bekman authored Jun 09, 2021

d14e0af2
Update run_ner.py with id2label config (#12001) · 82a2b76c
Koichi Yasuoka authored Jun 09, 2021

82a2b76c

08 Jun, 2021 6 commits

[Deepspeed Wav2vec2] integration (#11638) · 11d86d3d

Stas Bekman authored Jun 08, 2021

* wip

* wip - but working with https://github.com/microsoft/DeepSpeed/pull/1044

* cleanup

* workaround

* working 5/8 modes

* solve fp32 distributed zero3

* style

* sync

* sync

* rework

* deprecation

* cleanup

* https://github.com/microsoft/DeepSpeed/pull/1044

 pr was merged

* clean up

* add a guide

* more prose

* more prose

* fix

* more prose

* sub_group_size was too big

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* refactor

* bug fix

* make the true check explicit

* new deepspeed release
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

11d86d3d

Properly indent block_size (#12070) · fd690283
Sylvain Gugger authored Jun 08, 2021

fd690283

Add torch to requirements.txt in language-modeling (#12040) · 49bee0ae

cdleong authored Jun 08, 2021



* Add torch to requirements.txt in language-modeling

* Update examples/pytorch/language-modeling/requirements.txt
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

49bee0ae

Replace legacy tensor.Tensor with torch.tensor/torch.empty (#12027) · f5eec0d8
Mario Šaško authored Jun 08, 2021
```
* Replace legacy torch.Tensor constructor with torch.{tensor, empty}

* Remove torch.Tensor in examples
```
f5eec0d8

updated the original RAG implementation to be compatible with latest Pytorch-Lightning (#11806) · e33085d6

Shamane Siri authored Jun 09, 2021

* updated the original RAG implementation to be compatible with the latest PL version

* updated the requirements.txt file

* execute make style

* code quality test

* code quality

* conflix resolved in requirement.txt

* code quality

* changed the MyDDP class name to CustomDDP

e33085d6

adds metric prefix. (#12057) · e363e1d9
Russell Klopfer authored Jun 07, 2021
```
* adds metric prefix.

* update tests to include prefix
```
e363e1d9

03 Jun, 2021 2 commits

[Flax] Refactor MLM (#12013) · 242ec31a

Patrick von Platen authored Jun 03, 2021



* fix_torch_device_generate_test

* remove @

* finish refactor
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

242ec31a

Fix weight decay masking in `run_flax_glue.py` (#11964) · 4674061b

Nicholas Vadivelu authored Jun 03, 2021



* Fix weight decay masking in `run_flax_glue.py`

Issues with the previous implementation:
- The `dict` from `traverse_util.flatten_dict` has keys which are tuples of strings, not one long string with the path separated by periods.
- `optax.masked` applies the transformation wherever the mask is True, so the masks are flipped.
- Flax's LayerNorm calls the scale parameter `scale` not `weight`

* Fix formatting with black

* adapt results
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

4674061b

02 Jun, 2021 1 commit

Bump urllib3 from 1.25.8 to 1.26.5 in /examples/research_projects/lxmert (#11983) · 6db3a87d

dependabot[bot] authored Jun 02, 2021

Bumps [urllib3](https://github.com/urllib3/urllib3) from 1.25.8 to 1.26.5.
- [Release notes](https://github.com/urllib3/urllib3/releases)
- [Changelog](https://github.com/urllib3/urllib3/blob/main/CHANGES.rst)
- [Commits](https://github.com/urllib3/urllib3/compare/1.25.8...1.26.5

)

---
updated-dependencies:
- dependency-name: urllib3
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

6db3a87d

01 Jun, 2021 2 commits

modify qa-trainer (#11872) · 7e73601f
Fan Zhang authored Jun 01, 2021
```
* modify qa-trainer

* fix flax model
```
7e73601f

RAG-2nd2end-revamp (#11893) · 9ec0f01b

Shamane Siri authored Jun 01, 2021



* initial

* code quality test

* code quality

* added test functions in test_modeling_rag.py and test_retrieval_rag.py to test end2end retreiver

* minor change in test_modeling_rag

* fixed tests

* Update examples/research_projects/rag-end2end-retriever/README.md

typo corrected as suggested by lhoestq
Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>

* Update examples/research_projects/rag-end2end-retriever/finetune_rag.py

type change suggested by lhoestq
Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>

* Update src/transformers/models/rag/retrieval_rag.py

Adding this change as mentioned by lhoestq.
Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>

* completed the minor changes suggested by the reviewers
Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>

9ec0f01b

31 May, 2021 2 commits

Add MT5ForConditionalGeneration as supported arch. to summarization README (#11961) · cfca638a
Philip May authored May 31, 2021
```
* Add MT5ForConditionalGeneration as supported arch.

* Update README.md
```
cfca638a

Remove redundant `nn.log_softmax` in `run_flax_glue.py` (#11920) · 1ab147d6

Nicholas Vadivelu authored May 31, 2021

* Remove redundant `nn.log_softmax` in `run_flax_glue.py`

`optax.softmax_cross_entropy` expects unnormalized logits, and so it already calls `nn.log_softmax`, so I believe it is not needed here. `nn.log_softmax` is idempotent so mathematically it shouldn't have made a difference.

* Remove unused 'flax.linen' import

1ab147d6

26 May, 2021 1 commit
- Link official Cloud TPU JAX docs (#11892) · 2df54691
  Avital Oliver authored May 26, 2021
  
  2df54691
25 May, 2021 4 commits

[Examples] create model with custom config on the fly (#11798) · 1b653010

Stas Bekman authored May 25, 2021



* create custom model on the flight

* better wording

* add update_from_string

* cleanup

* cleanup

* Update src/transformers/configuration_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* more bool options

* style

* fix logger

* add test

* add the doc

* assert on conflict of options
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

1b653010

[lm examples] fix overflow in perplexity calc (#11855) · 6287c929
Stas Bekman authored May 25, 2021
```
* fix overflow in perplexity calc

* use inf

* fix
```
6287c929
Add option to log only once in multinode training (#11819) · f086652b
Sylvain Gugger authored May 25, 2021
```
* Add option to long only once in multinode training

* Use an alternate property
```
f086652b
typo (#11858) · b8344a27
Wang Ran (汪然) authored May 25, 2021

b8344a27

24 May, 2021 1 commit
- [Flax] Fix PyTorch import error (#11839) · f5806041
  Patrick von Platen authored May 24, 2021
```
* fix_torch_device_generate_test

* remove @

* change pytorch import to flax import
```
  f5806041
21 May, 2021 3 commits

Add flax text class colab (#11824) · da22245e
Patrick von Platen authored May 21, 2021
```
* fix_torch_device_generate_test

* remove @

* add flax glue link
```
da22245e

[Flax] Small fixes in `run_flax_glue.py` (#11820) · 82335185

Patrick von Platen authored May 21, 2021



* fix_torch_device_generate_test

* remove @

* correct best seed for flax fine-tuning
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

82335185

[Flax] Align GLUE training script with mlm training script (#11778) · bd987165

Patrick von Platen authored May 21, 2021



* speed up flax glue

* remove unnecessary line

* remove folder

* remove run in loop
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

bd987165