Commits · 4684bfc7576e45f5af26752bb5f5e57e171ce56b · chenpangpang / transformers

25 Mar, 2021 1 commit

Amir Tahmasbi authored Mar 25, 2021



* Added embeddings layer

* Added layoutlm layers, main model, maskedlm and token classification classes

* Added model classes to tf auto models

* Added model to PT to TF conversion script

* Added model to doc README

* Added tests

* Removed unused imports

* Added layoutlm model, test, and doc for sequence classification, and fix imports in __init__.py

* Made tests pass!

* Fixed typos in imports and docs

* Fixed a typo in embeddings layer

* Removed imports

* Fixed formatting issues, imports, tests

* Added layoutlm layers, main model, maskedlm and token classification classes

* Added model classes to tf auto models

* Added model to PT to TF conversion script

* Removed unused imports

* Added layoutlm model, test, and doc for sequence classification, and fix imports in __init__.py

* Made tests pass!

* Fixed typos in imports and docs

* Removed imports

* Fixed small formatting issues

* Removed duplicates import from main __init__.py

* Chnaged deafult arg to true for adding  pooling layer to tf layoutlm

* Fixed formatting issues

* Style

* Added copied from to classes copied from bert

* Fixed doc strings examples to work with layoutlm inputs

* Removed PyTorch reference in doc strings example

* Added integration tests

* Cleaned up initialization file

* Updated model checkpoint identifiers

* Fixed imports
Co-authored-by: Amir Tahmasbi <amir@ehsai.ca>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

4684bfc7

24 Mar, 2021 4 commits
- Update training args ignore_skip_data -> ignore_data_skip (#10891) · 1c06240e
  Sidd Karamcheti authored Mar 24, 2021
  
  1c06240e
- Remove version warning in pretrained BART models (#10890) · 3b20e910
  Sylvain Gugger authored Mar 24, 2021
```
* Remove version warning in pretrained BART models

* Put it at the base model
```
  3b20e910
- Fix overflowing bad word ids (#10889) · 3c12e3c1
  Lysandre Debut authored Mar 24, 2021
```
* Removes overflowing bad word IDs

* Raise warning
```
  3c12e3c1
- error type of tokenizer in __init__ definition (#10879) · f81077fc
  imzhengzx authored Mar 24, 2021
```
the orignal code in line 246 is
```
  tokenizer: Optional["PreTrainedTokenizerBase"] = None,
```

it should be
```
  tokenizer: Optional[PreTrainedTokenizerBase] = None,
```
```
  f81077fc
23 Mar, 2021 7 commits
- Sm trainer smp init fix (#10870) · 8c297cdb
  Philipp Schmid authored Mar 23, 2021
```
* rewrote is_sagemaker_model_parallel_available

* added is_sagemaker_model_parallel_available to SageMakerTrainer

* removed unnecessary mp_parameters as TrainingArguments

* make style happy

* added mp_parameters again to parse mp-specific args.
```
  8c297cdb
- fixed prefix_allowed_tokens_fn docstring in generate() (#10862) · d4d4447d
  RafaelWO authored Mar 23, 2021
  
  d4d4447d
- [file_utils] import refactor (#10859) · fb2b8984
  Stas Bekman authored Mar 23, 2021
```
* import refactor

* fix the fallback
```
  fb2b8984
- Fix p_mask cls token masking in qa pipeline (#10863) · 2eb596f0
  Marta Maślankowska authored Mar 23, 2021
  
  2eb596f0
- fixed typo (#10861) · eb330e89
  Bhadresh Savani authored Mar 23, 2021
  
  eb330e89
- fix nan in full-fp16 label_smoothing eval (#10815) · e21f89f6
  Stas Bekman authored Mar 22, 2021
  
  e21f89f6
- Make convert_to_onnx runable as script again (#10857) · b5b957a6
  Sylvain Gugger authored Mar 22, 2021
  
  b5b957a6
22 Mar, 2021 5 commits
- [Generate] Add save mode logits processor to remove nans and infs if necessary (#10769) · 77bf3fe7
  Patrick von Platen authored Mar 23, 2021
```
* push

* finish

* finish

* make fix copies

* change name
```
  77bf3fe7
- Modify the Trainer class to handle simultaneous execution of Ray Tune and Weights & Biases (#10823) · a8d4d677
  Ruan Chaves authored Mar 22, 2021
```
* Modify the _hp_search_setup method on the Trainer class to handle the wandb argument passed by Ray Tune to model config.

* Reformat single quotes as double quotes.
```
  a8d4d677
- feat(wandb): logging and configuration improvements (#10826) · 125ccead
  Boris Dayma authored Mar 22, 2021
```
* feat: ensure unique artifact id

* feat: allow manual init

* fix: simplify reinit logic

* fix: no dropped value + immediate commits

* fix: wandb use in sagemaker

* docs: improve documenation and formatting

* fix: typos

* docs: improve formatting
```
  125ccead
- Add simple one character fix so that on_step_begin and on_step_end are called... · b230181d
  Sidd Karamcheti authored Mar 22, 2021
```
Add simple one character fix so that on_step_begin and on_step_end are called at the right times (#10839)
```
  b230181d
- Correct AutoConfig call docstrings (#10822) · 2c668423
  Sebastian Olsson authored Mar 22, 2021
  
  2c668423
19 Mar, 2021 3 commits

Sort init import (#10801) · 21e86f99

Sylvain Gugger authored Mar 19, 2021



* Initial script

* Add script to properly sort imports in init.

* Add to the CI

* Update utils/custom_init_isort.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Separate scripts that change content from quality

* Move class_mapping_update to style_checks
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

21e86f99

Add transformers id to hub requests (#10811) · f2b744f6

Philipp Schmid authored Mar 19, 2021

* add uuid.hext to user_agent

* add log

* changed order of it

* renamed as session id

* renamed variable

* reverted naming of the const

f2b744f6

fix backend tokenizer args override: key mismatch (#10686) · 117dba99

Théo Matussière authored Mar 19, 2021



* fix backend tokenizer args override: key mismatch

* no touching the docs

* fix mpnet

* add mpnet to test

* fix test
Co-authored-by: theo <theo@matussie.re>

117dba99

18 Mar, 2021 7 commits

Fix distributed evaluation (#10795) · 008672e6
Sylvain Gugger authored Mar 18, 2021
```
* Fix distributed evaluation

* Use logger
```
008672e6

from_pretrained: check that the pretrained model is for the right model architecture (#10586) · 094afa51

Vimarsh Chaturvedi authored Mar 18, 2021



* Added check to ensure model name passed to from_pretrained and model are the same

* Added test to check from_pretrained throws assert error when passed an incompatiable model name

* Modified assert in from_pretrained with f-strings. Modified test to ensure desired assert message is being generated

* Added check to ensure config and model has model_type

* Fix FlauBERT heads

Co-authored-by: vimarsh chaturvedi <vimarsh chaturvedi>
Co-authored-by: Stas Bekman <stas@stason.org>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

094afa51

[file_utils] do not gobble certain kinds of requests.ConnectionError (#10235) · 4f3e93cf

Julien Chaumond authored Mar 18, 2021



* do not gobble certain kinds of requests.ConnectionError

* Apply review comments
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

4f3e93cf

Fix bug in input check for LengthGroupSampler (#10783) · ce9724e1

James Thomin authored Mar 18, 2021

This commit fixes a bug in the LengthGroupSampler where if
model_input_name is not set, the default value is None instead of
"input_ids"

ce9724e1

wav2vec2: support datasets other than LibriSpeech (#10581) · af8afdc8

Mohamed El-Geish authored Mar 18, 2021

* wav2vec2: support datasets other than LibriSpeech

* Formatting run_asr.py to pass code quality test

* bundled orthography options and added verbose logs

* fixing a typo in timit fine-tuning script

* update comment for clarity

* resize_lm_head and load custom vocab from file

* adding a max_duration_in_seconds filter

* do not assign `duration_filter` lambda, use a def

* log untransliterated text as well

* fix base model for arabic

* fix duration filter when target_sr is not set

* drop duration_in_seconds when unneeded

* script for wav2vec2-large-lv60-timit-asr

* fix for "tha" in arabic corpus (huggingface#10581)

* adding more options to work with common_voice

* PR feedback (huggingface#10581)

* small README change

af8afdc8

[Flax] Adapt Flax models to new structure (#9484) · 0b98ca36

Patrick von Platen authored Mar 18, 2021



* Create modeling_flax_eletra with code copied from modeling_flax_bert

* Add ElectraForMaskedLM and ElectraForPretraining

* Add modeling test for Flax electra and fix naming and arg in Flax Electra model

* Add documentation

* Fix code style

* Create modeling_flax_eletra with code copied from modeling_flax_bert

* Add ElectraForMaskedLM and ElectraForPretraining

* Add modeling test for Flax electra and fix naming and arg in Flax Electra model

* Add documentation

* Fix code style

* Fix code quality

* Adjust tol in assert_almost_equal due to very small difference between model output, ranging 0.0010 - 0.0016

* Remove redundant ElectraPooler

* save intermediate

* adapt

* correct bert flax design

* adapt roberta as well

* finish roberta flax

* finish

* apply suggestions

* apply suggestions
Co-authored-by: Chris Nguyen <anhtu2687@gmail.com>

0b98ca36

Add support for detecting intel-tensorflow version (#10781) · 5c0bf397
Funtowicz Morgan authored Mar 18, 2021
```
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>
```
5c0bf397

17 Mar, 2021 6 commits

Smmp batch not divisible by microbatches fix (#10778) · 0282e24e

Mansi Mane authored Mar 17, 2021



* Added debug prints

* Added config

* Added prints

* Added prints

* Added extra samples to SequentialDistributedSampler

* Added extra samples to SequentialDistributedSampler

Updated SequentialDistributedSampler call

* Added deubg prints

* Removed extra prints

* Making predicitons and labels multiple of batchsize

* updated number of microbatches

* Removed extra prints

* Made start_remainder similar to DistributedSamplerWithLoop

* Minor spacing update

* Added debug prints

Added config

Added prints

Added prints

* Added extra samples to SequentialDistributedSampler

Updated SequentialDistributedSampler call

Added extra samples to SequentialDistributedSampler

Added deubg prints

Removed extra prints

Making predicitons and labels multiple of batchsize

updated number of microbatches

Removed extra prints

Squashing redundant commits

* Made start_remainder similar to DistributedSamplerWithLoop

Minor spacing update

Made start_remainder similar to DistributedSamplerWithLoop

* Test and styling

* Rename test
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

0282e24e

Check copies blackify (#10775) · 40b049c7

Sylvain Gugger authored Mar 17, 2021

* Apply black before checking copies

* Fix for class methods

* Deal with lonely brackets

* Remove debug and add forward changes

* Separate copies and fix test

* Add black as a test dependency

40b049c7

make failure to find a resume checkpoint fatal + tests (#10777) · 3318c246
Stas Bekman authored Mar 17, 2021

3318c246
[DeepSpeed] improve checkpoint loading code plus tests (#10760) · cd8c93f7
Stas Bekman authored Mar 17, 2021
```
* deepspeed checkpoint loading code plus tests

* style

* style
```
cd8c93f7
[DeepSpeed] simplify init (#10762) · 01c7fb04
Stas Bekman authored Mar 17, 2021

01c7fb04
Fix URLs · d7e0d59b
Sylvain Gugger authored Mar 17, 2021

d7e0d59b

16 Mar, 2021 7 commits

[Deepspeed] Allow HF optimizer and scheduler to be passed to deepspeed (#10464) · c83fbc5f

Cheng Li authored Mar 16, 2021



* pass hf optimizer and scheduler to deepspeed if not specified in ds config

* pass hf optimizer and scheduler to deepspeed if not specified in ds config

* update

* make init_deepspeed support config dict

* fix docstring formatting

* clean up trainer's comments

* add new tests

* fix type

* composit argparse doesn't work

* style

* add a new test, rename others

* document new functionality

* complete tests, add docs

* style

* correct level

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add new methods to the doc

* must tell DS we are using a non-native optimizer

* add protection against cpu_offload + HF optimizer combo

* fix the cli overrides

* sync docs + tests

* restore AdamW

* better docs

* need new version

* no longer needed

* remove outdate information

* refactor duplicated code
Co-authored-by: Stas Bekman <stas@stason.org>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

c83fbc5f

Patches full import failure when sentencepiece is not installed (#10752) · c2324844
Lysandre Debut authored Mar 16, 2021
```
* Patches full import failure when sentencepiece is not installed

* Dummies :)
```
c2324844
Patches the full import failure and adds a test (#10750) · 2097aa18
Lysandre Debut authored Mar 16, 2021
```
* Patches the full import failure and adds a test

* Add comment
```
2097aa18
Development on v4.5.0dev0 · 1b5ce1e6
Lysandre authored Mar 16, 2021

1b5ce1e6
Release v4.4.0 · c988db5a
Lysandre authored Mar 16, 2021

c988db5a
Fix URLs from #10744 (#10748) · 5c02b97c
Sylvain Gugger authored Mar 16, 2021

5c02b97c
Add DistributedSamplerWithLoop (#10746) · a0a027c2
Sylvain Gugger authored Mar 16, 2021
```
* Add DistributedSamplerWithLoop

* Fix typo

* Test and small fix
```
a0a027c2