Commits · 09af5bdea350061a46161c3f79c23c9777990206 · chenpangpang / transformers

06 Jul, 2021 6 commits

Replace `nn.Moudle` by `nn.Module` (#12541) · 09af5bde
SaulLu authored Jul 06, 2021

09af5bde
Update README.md · f42a0abf
Patrick von Platen authored Jul 06, 2021

f42a0abf
Update README (#12540) · 029b9d3f
Suzana Ilić authored Jul 06, 2021

029b9d3f

Suraj Patil authored Jul 06, 2021

* flax gpt neo

* fix query scaling

* update generation test

* use flax model for test

7a259c19

[RoFormer] Fix some issues (#12397) · 626a0a01

yujun authored Jul 06, 2021



* add RoFormerTokenizerFast into AutoTokenizer

* fix typo in roformer docs

* make onnx export happy

* update RoFormerConfig embedding_size

* use jieba not rjieba

* fix 12244 and make test_alignement passed

* update ARCHIVE_MAP

* make style & quality & fixup

* update

* make style & quality & fixup

* make style quality fixup

* update

* suggestion from LysandreJik
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* make style

* use rjieba
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

626a0a01

[Flax] Fix hybrid clip (#12519) · f5b0c1ec
Suraj Patil authored Jul 06, 2021
```
* fix saving and loading

* update readme
```
f5b0c1ec

05 Jul, 2021 12 commits
- [Wav2Vec2] Flax - Adapt wav2vec2 script (#12520) · 7d6285a9
  Patrick von Platen authored Jul 05, 2021
```
* fix_torch_device_generate_test

* remove @

* adapt flax pretrain script
```
  7d6285a9
- [Flax] Fix another bug in logging steps (#12516) · 4605b2b8
  Patrick von Platen authored Jul 05, 2021
```
* fix_torch_device_generate_test

* remove @

* up
```
  4605b2b8
- [Flax] Correct logging steps flax (#12515) · d0f7508a
  Patrick von Platen authored Jul 05, 2021
```
* fix_torch_device_generate_test

* remove @

* push
```
  d0f7508a
- [Flax] Correct flax training scripts (#12514) · bb4ac2b5
  Patrick von Platen authored Jul 05, 2021
```
* fix_torch_device_generate_test

* remove @

* add logging steps

* correct training scripts

* correct readme

* correct
```
  bb4ac2b5
- NER example for Tensorflow (#12469) · ea556750
  Matt authored Jul 05, 2021
```
* NER example for Tensorflow

* Style pass

* Style pass

* Added metric computation on the evaluation set

* Style pass

* Fixed label masking

* Style pass

* Style pass
```
  ea556750
- [Flax] Dataset streaming example (#12470) · 9b908105
  Patrick von Platen authored Jul 05, 2021
```
* fix_torch_device_generate_test

* remove @

* upload

* finish dataset streaming

* adapt readme

* finish

* up

* up

* up

* up

* Apply suggestions from code review

* finish

* make style

* make style2

* finish
Co-authored-by: Patrick von Platen <patrick@huggingface.co>
```
  9b908105
- flax.linen.apply takes state as the first param, followed by the input (#12510) · eceb1042
  Navjot authored Jul 05, 2021
  
  eceb1042
- [Flax] ViT training example (#12300) · f1c81d6b
  Suraj Patil authored Jul 05, 2021
```
* begin script

* clean example, add readme

* update readme

* remove decay mask

* remove masking

* update readme & make flake happy
```
  f1c81d6b
- [Flax] Fix wav2vec2 pretrain arguments (#12498) · e799e0f1
  Akmal authored Jul 05, 2021
  
  e799e0f1
- create LxmertModelIntegrationTest Pytorch (#9989) · 0e1718af
  sadakmed authored Jul 05, 2021
```
* create LxmertModelIntegrationTest

* implementation using numpy seeding to fix inputs params.

* fix code quality

* isort check
```
  0e1718af
- [examples/flax] clip style image-text training example (#12491) · 23ab0b69
  Suraj Patil authored Jul 05, 2021
```
* clip style example

* fix post init

* add requirements

* update readme, few small fixes
```
  23ab0b69
- Add `Repository` import to the FLAX example script (#12501) · 89a8739f
  Lysandre Debut authored Jul 05, 2021
  
  89a8739f
04 Jul, 2021 1 commit
- Update README.md · 2df63282
  Patrick von Platen authored Jul 04, 2021
  
  2df63282
02 Jul, 2021 7 commits
- Add guide on how to build demos for the Flax sprint (#12468) · a76eebfc
  Omar Sanseviero authored Jul 02, 2021
  
  a76eebfc
- Update README.md · b21905e0
  Patrick von Platen authored Jul 02, 2021
  
  b21905e0
- Update README.md · d24a5231
  Patrick von Platen authored Jul 02, 2021
  
  d24a5231
- Update README.md · e3fce2f8
  Patrick von Platen authored Jul 02, 2021
```
Thanks a lot @BirgerMoell
```
  e3fce2f8
- Fix TAPAS test uncovered by #12446 (#12480) · b889d3f6
  Lysandre Debut authored Jul 02, 2021
  
  b889d3f6
- fixed typo in flax-projects readme (#12466) · b4ecc6be
  Matthew LeMay authored Jul 02, 2021
  
  b4ecc6be
- Rework notebooks and move them to the Notebooks repo (#12471) · e52288a1
  Sylvain Gugger authored Jul 02, 2021
  
  e52288a1
01 Jul, 2021 12 commits

[roberta] fix lm_head.decoder.weight ignore_key handling (#12446) · 2d1d9218

Stas Bekman authored Jul 01, 2021



* fix lm_head.decoder.weight ignore_key handling

* fix the mutable class variable

* Update src/transformers/models/roberta/modeling_roberta.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* replicate the comment

* make deterministic
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

2d1d9218

Fixing bug with param count without embeddings (#12461) · 7f0027db

Teven authored Jul 01, 2021



* fixing bug with param count without embeddings

* Update src/transformers/modeling_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

7f0027db

Validation split added: custom data files @sgugger, @patil-suraj (#12407) · d5b8fe3b

Souvic Chakraborty authored Jul 01, 2021



* Validation split added: custom data files

Validation split added in case of no validation file and loading custom data

* Updated documentation with custom file usage

Updated documentation with custom file usage

* Update README.md

* Update README.md

* Update README.md

* Made some suggested stylistic changes

* Used logger instead of print.
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Made similar changes to add validation split

In case of a missing validation file, a validation split will be used now.

* max_train_samples to be used for training only

max_train_samples got misplaced, now corrected so that it is applied on training data only, not whole data.

* styled

* changed ordering

* Improved language of documentation
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Improved language of documentation
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Fixed styling issue

* Update run_mlm.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

d5b8fe3b

Import check_inits handling of duplicate definitions. (#12467) · f929462b
Thibault FEVRY authored Jul 01, 2021
```
* Import fix_inits handling of duplicate definitions.

* Style fix
```
f929462b

Add TPU README (#12463) · 7f87bfc9

Patrick von Platen authored Jul 01, 2021



* Add TPU README

* Apply suggestions from code review

* Update examples/research_projects/jax-projects/README.md

* Update examples/research_projects/jax-projects/README.md
Co-authored-by: Stefan Schweter <stefan@schweter.it>
Co-authored-by: Stefan Schweter <stefan@schweter.it>

7f87bfc9

Update README.md · 1457839f
Patrick von Platen authored Jul 01, 2021

1457839f
Added talk details (#12465) · c18af5d4
Suzana Ilić authored Jul 01, 2021

c18af5d4
Fix training_args.py barrier for torch_xla (#12464) · 6c5b20aa
Jin Young (Daniel) Sohn authored Jul 01, 2021
```
torch_xla currently has its own synchronization primitives, so use
xm.rendezvous(tag) instead.
```
6c5b20aa
Comment fast GPU TF tests (#12452) · 2a501ac9
Lysandre Debut authored Jul 01, 2021

2a501ac9
[Wav2Vec2, Hubert] Fix ctc loss test (#12458) · 27d348f2
Patrick von Platen authored Jul 01, 2021
```
* fix_torch_device_generate_test

* remove @

* fix test
```
27d348f2

[Flax community event] How to use hub during training (#12447) · b655f16d

Patrick von Platen authored Jul 01, 2021



* fix_torch_device_generate_test

* remove @

* upload

* finish doc

* Apply suggestions from code review
Co-authored-by: Omar Sanseviero <osanseviero@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

* finish
Co-authored-by: Omar Sanseviero <osanseviero@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

b655f16d

Add test for a WordLevel tokenizer model (#12437) · 3aa37b94
SaulLu authored Jul 01, 2021
```
* add a test for a WordLevel tokenizer

* adapt common test to new tokenizer
```
3aa37b94

30 Jun, 2021 2 commits

[Flax] Add wav2vec2 (#12271) · 0d1f67e6

Patrick von Platen authored Jun 30, 2021



* fix_torch_device_generate_test

* remove @

* start flax wav2vec2

* save intermediate

* forward pass has correct shape

* add weight norm

* add files

* finish ctc

* make style

* finish gumbel quantizer

* correct docstrings

* correct some more files

* fix vit

* finish quality

* correct tests

* correct docstring

* correct tests

* start wav2vec2 pretraining script

* save intermediate

* start pretraining script

* finalize pretraining script

* finish

* finish

* small typo

* finish

* correct

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* make style

* push
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>

0d1f67e6

[JAX/Flax readme] add philosophy doc (#12419) · 3f36a2c0

Suraj Patil authored Jun 30, 2021



* add philosophy doc

* fix typos

* update doc

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* address Patricks suggestions

* add a training example and fix typos

* jit the training step

* jit train step

* fix example code

* typo

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

3f36a2c0