Commits · 2ce0fb84cc500a26b0c45bec1f8a42e33d13e05d · chenpangpang / transformers

04 May, 2021 1 commit

Make quality scripts work when one backend is missing. (#11573) · 2ce0fb84

Sylvain Gugger authored May 04, 2021

* Make quality scripts work when one backend is missing.

* Check env variable is properly set

* Add default

* With print statements

* Fix typo

* Set env variable

* Remove debug code

2ce0fb84

23 Apr, 2021 4 commits

Use 3 workers for torch tests · 81a6c7cd
Sylvain Gugger authored Apr 23, 2021

81a6c7cd
Wrong branch Sylvain... · ca6b80ca
Sylvain Gugger authored Apr 23, 2021

ca6b80ca
Try to trigger failure more · 3951fc55
Sylvain Gugger authored Apr 23, 2021

3951fc55

Trainer push to hub (#11328) · bf2e0cf7

Sylvain Gugger authored Apr 23, 2021



* Initial support for upload to hub

* push -> upload

* Fixes + examples

* Fix torchhub test

* Torchhub test I hate you

* push_model_to_hub -> push_to_hub

* Apply mixin to other pretrained models

* Remove ABC inheritance

* Add tests

* Typo

* Run tests

* Install git-lfs

* Change approach

* Add push_to_hub to all

* Staging test suite

* Typo

* Maybe like this?

* More deps

* Cache

* Adapt name

* Quality

* MOAR tests

* Put it in testing_utils

* Docs + torchhub last hope

* Styling

* Wrong method

* Typos

* Update src/transformers/file_utils.py
Co-authored-by: Julien Chaumond <julien@huggingface.co>

* Address review comments

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Julien Chaumond <julien@huggingface.co>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

bf2e0cf7

21 Apr, 2021 1 commit

Examples reorg (#11350) · dabeb152

Sylvain Gugger authored Apr 21, 2021



* Base move

* Examples reorganization

* Update references

* Put back test data

* Move conftest

* More fixes

* Move test data to test fixtures

* Update path

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Address review comments and clean
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

dabeb152

13 Apr, 2021 1 commit
- Document v4.5.1 · 893e51a5
  Sylvain Gugger authored Apr 13, 2021
  
  893e51a5
09 Apr, 2021 2 commits

Reactivate Megatron tests an use less workers · 26212c14
Sylvain Gugger authored Apr 09, 2021

26212c14

Add a special tokenizer for CPM model (#11068) · fb41f9f5

Kevin Canwen Xu authored Apr 10, 2021



* Add a special tokenizer for CPM model

* make style

* fix

* Add docs

* styles

* cpm doc

* fix ci

* fix the overview

* add test

* make style

* typo

* Custom tokenizer flag

* Add REAMDE.md
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

fb41f9f5

08 Apr, 2021 1 commit
- [setup] extras[docs] must include 'all' (#11148) · 97ccf67b
  Stas Bekman authored Apr 08, 2021
```
* extras[doc] must include 'all'

* fix

* better

* regroup
```
  97ccf67b
06 Apr, 2021 1 commit
- Development on v4.6.0dev0 · 9853c5dd
  Lysandre authored Apr 06, 2021
  
  9853c5dd
05 Apr, 2021 1 commit
- Add a script to check inits are consistent (#11024) · b0d49fd5
  Sylvain Gugger authored Apr 04, 2021
  
  b0d49fd5
01 Apr, 2021 1 commit

Add Vision Transformer and ViTFeatureExtractor (#10950) · 30677dc7

NielsRogge authored Apr 01, 2021



* Squash all commits into one

* Update ViTFeatureExtractor to use image_utils instead of torchvision

* Remove torchvision and add Pillow

* Small docs improvement

* Address most comments by @sgugger

* Fix tests

* Clean up conversion script

* Pooler first draft

* Fix quality

* Improve conversion script

* Make style and quality

* Make fix-copies

* Minor docs improvements

* Should use fix-copies instead of manual handling

* Revert "Should use fix-copies instead of manual handling"

This reverts commit fd4e591bce4496d41406425c82606a8fdaf8a50b.

* Place ViT in alphabetical order
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

30677dc7

31 Mar, 2021 1 commit

Add more metadata to the user agent (#10972) · d0b3797a

Sylvain Gugger authored Mar 31, 2021

* Add more metadata to the user agent

* Fix typo

* Use DISABLE_TELEMETRY

* Address review comments

* Use global env

* Add clean envs on circle CI

d0b3797a

23 Mar, 2021 1 commit
- Update stable docs · 3f48b2bc
  Lysandre authored Mar 23, 2021
  
  3f48b2bc
19 Mar, 2021 1 commit

Sort init import (#10801) · 21e86f99

Sylvain Gugger authored Mar 19, 2021



* Initial script

* Add script to properly sort imports in init.

* Add to the CI

* Update utils/custom_init_isort.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Separate scripts that change content from quality

* Move class_mapping_update to style_checks
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

21e86f99

18 Mar, 2021 1 commit
- Document v4.4.2 · dcebe254
  Sylvain Gugger authored Mar 18, 2021
  
  dcebe254
16 Mar, 2021 3 commits
- Docs for v4.4.1 · 73fe4089
  Lysandre authored Mar 16, 2021
  
  73fe4089
- Development on v4.5.0dev0 · 1b5ce1e6
  Lysandre authored Mar 16, 2021
  
  1b5ce1e6
- Flax testing should not run the full torch test suite (#10725) · 9f8619c6
  Patrick von Platen authored Mar 16, 2021
```
* make flax tests pytorch independent

* fix typo

* finish

* improve circle ci

* fix return tensors

* correct flax test

* re-add sentencepiece

* last tokenizer fixes

* finish maybe now
```
  9f8619c6
10 Mar, 2021 1 commit

Speech2TextTransformer (#10175) · d26b37e7

Suraj Patil authored Mar 10, 2021



* s2t

* fix config

* conversion script

* fix import

* add tokenizer

* fix tok init

* fix tokenizer

* first version working

* fix embeds

* fix lm head

* remove extra heads

* fix convert script

* handle encoder attn mask

* style

* better enc attn mask

* override _prepare_attention_mask_for_generation

* handle attn_maks in encoder and decoder

* input_ids => input_features

* enable use_cache

* remove old code

* expand embeddings if needed

* remove logits bias

* masked_lm_loss => loss

* hack tokenizer to support feature processing

* fix model_input_names

* style

* fix error message

* doc

* remove inputs_embeds

* remove input_embeds

* remove unnecessary docstring

* quality

* SpeechToText => Speech2Text

* style

* remove shared_embeds

* subsample => conv

* remove Speech2TextTransformerDecoderWrapper

* update output_lengths formula

* fix table

* remove max_position_embeddings

* update conversion scripts

* add possibility to do upper case for now

* add FeatureExtractor and Processor

* add tests for extractor

* require_torch_audio => require_torchaudio

* add processor test

* update import

* remove classification head

* attention mask is now 1D

* update docstrings

* attention mask should be of type long

* handle attention mask from generate

* alwyas return attention_mask

* fix test

* style

* doc

* Speech2TextTransformer => Speech2Text

* Speech2TextTransformerConfig => Speech2TextConfig

* remove dummy_inputs

* nit

* style

* multilinguial tok

* fix tokenizer

* add tgt_lang setter

* save lang_codes

* fix tokenizer

* add forced_bos_token_id to tokenizer

* apply review suggestions

* add torchaudio to extra deps

* add speech deps to CI

* fix dep

* add libsndfile to ci

* libsndfile1

* add speech to extras all

* libsndfile1 -> libsndfile1

* libsndfile

* libsndfile1-dev

* apt update

* add sudo to install

* update deps table

* install libsndfile1-dev on CI

* tuple to list

* init conv layer

* add model tests

* quality

* add integration tests

* skip_special_tokens

* add speech_to_text_transformer in toctree

* fix tokenizer

* fix fp16 tests

* add tokenizer tests

* fix copyright

* input_values => input_features

* doc

* add model in readme

* doc

* change checkpoint names

* fix copyright

* fix code example

* add max_model_input_sizes in tokenizer

* fix integration tests

* add do_lower_case to tokenizer

* remove clamp trick

* fix "Add modeling imports here"

* fix copyrights

* fix tests

* SpeechToTextTransformer => SpeechToText

* fix naming

* fix table formatting

* fix typo

* style

* fix typos

* remove speech dep from extras[testing]

* fix copies

* rename doc file,

* put imports under is_torch_available

* run feat extract tests when torch is available

* dummy objects for processor and extractor

* fix imports in tests

* fix import in modeling test

* fxi imports

* fix torch import

* fix imports again

* fix positional embeddings

* fix typo in import

* adapt new extractor refactor

* style

* fix torchscript test

* doc

* doc

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* fix docs, copied from, style

* fix docstring

* handle imports

* remove speech from all extra deps

* remove s2t from seq2seq lm mapping

* better names

* skip training tests

* add install instructions

* List => Tuple

* doc

* fix conversion script

* fix urls

* add instruction for libsndfile

* fix fp16 test
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

d26b37e7

05 Mar, 2021 3 commits
- Fix embeddings for PyTorch 1.8 (#10549) · 7da995c0
  Sylvain Gugger authored Mar 05, 2021
```
* Fix embeddings for PyTorch 1.8

* Try with PyTorch 1.8.0

* Fix embeddings init

* Fix copies

* Typo

* More typos
```
  7da995c0
- Pin torch to 1.7.1 in tests while we resolve issues · dc9aaa38
  Lysandre authored Mar 05, 2021
  
  dc9aaa38
- Update scatter to use torch 1.8.0 · 093b88f4
  Lysandre authored Mar 05, 2021
  
  093b88f4
24 Feb, 2021 1 commit
- v4.3.3 docs · 35918443
  Lysandre authored Feb 24, 2021
  
  35918443
10 Feb, 2021 1 commit
- [CI] build docs faster (#10115) · d478257d
  Stas Bekman authored Feb 10, 2021
```
I assume the CI machine should have at least 4 cores, so let's build docs faster
```
  d478257d
09 Feb, 2021 3 commits
- Add patch releases to the doc · 0c3d23df
  Sylvain Gugger authored Feb 09, 2021
  
  0c3d23df
- Docs for v4.3.1 release · bf1a06a4
  Lysandre authored Feb 09, 2021
  
  bf1a06a4
- Fix deployment script · ba542ffb
  Lysandre authored Feb 09, 2021
  
  ba542ffb
08 Feb, 2021 2 commits
- Docs for v4.3.0 · 0dd579c9
  Lysandre authored Feb 08, 2021
  
  0dd579c9
- A few fixes in the documentation (#10033) · 45aaf5f7
  Sylvain Gugger authored Feb 08, 2021
  
  45aaf5f7
05 Feb, 2021 2 commits
- Update doc deployment script path · ad2c4310
  Lysandre authored Feb 05, 2021
  
  ad2c4310
- Update doc deployment script · 95a5f271
  Lysandre authored Feb 05, 2021
  
  95a5f271
04 Feb, 2021 1 commit

Update doc for pre-release (#10014) · 3be965c5

Sylvain Gugger authored Feb 04, 2021

* Update doc for pre-release

* Use stable as default

* Use the right commit :facepalms:

3be965c5

21 Jan, 2021 1 commit
- Temporarily deactivate TPU tests while we work on fixing them (#9720) · 910aa896
  Lysandre Debut authored Jan 21, 2021
  
  910aa896
13 Jan, 2021 1 commit
- v4.2.0 documentation · 33a8497d
  Lysandre authored Jan 13, 2021
  
  33a8497d
17 Dec, 2020 2 commits
- v4.1.1 docs · bd40345d
  Lysandre authored Dec 17, 2020
  
  bd40345d
- v4.1.0 docs · f83d9c8d
  Lysandre authored Dec 17, 2020
  
  f83d9c8d
15 Dec, 2020 1 commit

[WIP] Tapas v4 (tres) (#9117) · 1551e2dc

NielsRogge authored Dec 15, 2020



* First commit: adding all files from tapas_v3

* Fix multiple bugs including soft dependency and new structure of the library

* Improve testing by adding torch_device to inputs and adding dependency on scatter

* Use Python 3 inheritance rather than Python 2

* First draft model cards of base sized models

* Remove model cards as they are already on the hub

* Fix multiple bugs with integration tests

* All model integration tests pass

* Remove print statement

* Add test for convert_logits_to_predictions method of TapasTokenizer

* Incorporate suggestions by Google authors

* Fix remaining tests

* Change position embeddings sizes to 512 instead of 1024

* Comment out positional embedding sizes

* Update PRETRAINED_VOCAB_FILES_MAP and PRETRAINED_POSITIONAL_EMBEDDINGS_SIZES

* Added more model names

* Fix truncation when no max length is specified

* Disable torchscript test

* Make style & make quality

* Quality

* Address CI needs

* Test the Masked LM model

* Fix the masked LM model

* Truncate when overflowing

* More much needed docs improvements

* Fix some URLs

* Some more docs improvements

* Test PyTorch scatter

* Set to slow + minify

* Calm flake8 down

* First commit: adding all files from tapas_v3

* Fix multiple bugs including soft dependency and new structure of the library

* Improve testing by adding torch_device to inputs and adding dependency on scatter

* Use Python 3 inheritance rather than Python 2

* First draft model cards of base sized models

* Remove model cards as they are already on the hub

* Fix multiple bugs with integration tests

* All model integration tests pass

* Remove print statement

* Add test for convert_logits_to_predictions method of TapasTokenizer

* Incorporate suggestions by Google authors

* Fix remaining tests

* Change position embeddings sizes to 512 instead of 1024

* Comment out positional embedding sizes

* Update PRETRAINED_VOCAB_FILES_MAP and PRETRAINED_POSITIONAL_EMBEDDINGS_SIZES

* Added more model names

* Fix truncation when no max length is specified

* Disable torchscript test

* Make style & make quality

* Quality

* Address CI needs

* Test the Masked LM model

* Fix the masked LM model

* Truncate when overflowing

* More much needed docs improvements

* Fix some URLs

* Some more docs improvements

* Add add_pooling_layer argument to TapasModel

Fix comments by @sgugger and @patrickvonplaten

* Fix issue in docs + fix style and quality

* Clean up conversion script and add task parameter to TapasConfig

* Revert the task parameter of TapasConfig

Some minor fixes

* Improve conversion script and add test for absolute position embeddings

* Improve conversion script and add test for absolute position embeddings

* Fix bug with reset_position_index_per_cell arg of the conversion cli

* Add notebooks to the examples directory and fix style and quality

* Apply suggestions from code review

* Move from `nielsr/` to `google/` namespace

* Apply Sylvain's comments
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
Co-authored-by: Rogge Niels <niels.rogge@howest.be>
Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: sgugger <sylvain.gugger@gmail.com>

1551e2dc

11 Dec, 2020 1 commit
- Remove docs only check (#9065) · 91fa7072
  Lysandre Debut authored Dec 11, 2020
  
  91fa7072