Commits · ec15da2445a4fe2f72774e382e39468e4aaa2111 · chenpangpang / transformers

14 Feb, 2022 3 commits

Report only the failed imports in `requires_backends` (#15636) · ec15da24
Toni Kukurin authored Feb 14, 2022

ec15da24
Fix a bug that ignores max_seq_len in preprocess (#15238) · 2b8599b2
Zhen Wang authored Feb 14, 2022

2b8599b2

[Fix doc example] FlaxVisionEncoderDecoder (#15626) · f52746d0

Yih-Dar authored Feb 14, 2022



* Fix wrong checkpoint name: vit

* Fix missing import

* Fix more missing import

* make style

* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

f52746d0

11 Feb, 2022 12 commits
- Add push to hub to feature extractor (#15632) · 52d2e6f6
  Sylvain Gugger authored Feb 11, 2022
```
* Add push to hub to feature extractor

* Quality

* Clean up
```
  52d2e6f6
- Fix grammar in tokenizer_summary (#15614) · 4f403ea8
  Daniel Erenrich authored Feb 11, 2022
```
"to make ensure" is redundant.
```
  4f403ea8
- Custom feature extractor (#15630) · 7a32e472
  Sylvain Gugger authored Feb 11, 2022
```
* Rework AutoFeatureExtractor.from_pretrained internal

* Custom feature extractor

* Add more tests

* Add support for custom feature extractor code

* Clean up
```
  7a32e472
- [research_projects] deal with security alerts (#15594) · fcb0f743
  Stas Bekman authored Feb 11, 2022
```
* [research_projects] deal with security alerts

* add a note of the original PL ver and warning
```
  fcb0f743
- [deepspeed docs] misc additions (#15585) · f15c99fa
  Stas Bekman authored Feb 11, 2022
```
* [deepspeed docs] round_robin_gradients

* training and/or eval/predict loss is

* Update docs/source/main_classes/deepspeed.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  f15c99fa
- Fix _configuration_file argument getting passed to model (#15629) · 2dce350b
  Sylvain Gugger authored Feb 11, 2022
  
  2dce350b
- 🖍 remove broken link (#15615) · 85aee09e
  Steven Liu authored Feb 11, 2022
  
  85aee09e
- TF MT5 embeddings resize (#15567) · 2f40c728
  Joao Gante authored Feb 11, 2022
```
* Fix TF MT5 vocab resize

* more assertive testing
```
  2f40c728
- Rebase (#15606) · 8c03df10
  Mishig Davaadorj authored Feb 11, 2022
  
  8c03df10
- TF: Add informative warning for inexistent CPU backprop ops (#15612) · 3fae83d2
  Joao Gante authored Feb 11, 2022
```
* Add informative warning
```
  3fae83d2
- Enable ONNX export when PyTorch and TensorFlow installed in the same environment (#15625) · 7e4844fc
  lewtun authored Feb 11, 2022
  
  7e4844fc
- Mark "code in the Hub" API as experimental (#15624) · 6cf06d19
  Sylvain Gugger authored Feb 11, 2022
  
  6cf06d19
10 Feb, 2022 9 commits

[Generate] Small refactor (#15611) · 45c7b5b1
Patrick von Platen authored Feb 10, 2022

45c7b5b1
Correct JSON format (#15600) · c0864d98
Ngo Quang Huy authored Feb 11, 2022

c0864d98
Add local and TensorFlow ONNX export examples to docs (#15604) · 2e8b85f7
lewtun authored Feb 10, 2022
```
* Add local and TensorFlow ONNX export examples to docs

* Use PyTorch - TensorFlow split
```
2e8b85f7
Fix Seq2SeqTrainer (#15603) · 3a2ed967
NielsRogge authored Feb 10, 2022
```
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain>
```
3a2ed967

Compute loss independent from decoder for TF EncDec models (as #14139) (#15175) · 724e51c6

Yih-Dar authored Feb 10, 2022



* Compute loss independent from decoder (as 14139)

* fix expected seq_len + style

* Apply the same change to TFVisionEncoderDecoderModel

* fix style

* Add case with labels in equivalence test

* uncomment

* Add case with labels in equivalence test

* add decoder_token_labels

* use hf_compute_loss

* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Add copied from
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

724e51c6

Add example batch size to all commands (#15596) · 3d5dea9b
Patrick von Platen authored Feb 10, 2022

3d5dea9b

Add Tensorflow handling of ONNX conversion (#13831) · cb7ed6e0

Alberto Bégué authored Feb 10, 2022



* Add TensorFlow support for ONNX export

* Change documentation to mention conversion with Tensorflow

* Refactor export into export_pytorch and export_tensorflow

* Check model's type instead of framework installation to choose between TF and Pytorch
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Alberto Bégué <alberto.begue@della.ai>
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

cb7ed6e0

Reformat tokenization_fnet · e923917c
Lysandre authored Feb 09, 2022

e923917c
Make slow tests slow · 644ec052
Sylvain Gugger authored Feb 09, 2022

644ec052

09 Feb, 2022 16 commits

Expand tutorial for custom models (#15587) · c722753a

Sylvain Gugger authored Feb 09, 2022



* Expand tutorial for custom models

* Style

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

c722753a

Add link (#15588) · a86ee226

NielsRogge authored Feb 09, 2022


Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain>

a86ee226

[trainer docs] document how to select specific gpus (#15551) · dee17d56
Stas Bekman authored Feb 09, 2022
```
* [trainer docs] document how to select specific gpus

* expand

* add urls

* add accelerate launcher
```
dee17d56
update serving_output for some TF models (#15568) · 25848086
Yih-Dar authored Feb 09, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
25848086
Fix tests hub failure (#15580) · 315e6740
Sylvain Gugger authored Feb 09, 2022
```
* Expose hub test problem

* Fix tests
```
315e6740
Fix quality · b1ba03e0
Sylvain Gugger authored Feb 09, 2022

b1ba03e0
Trigger doc build · eed3186b
Sylvain Gugger authored Feb 09, 2022

eed3186b

Constrained Beam Search [without disjunctive decoding] (#15416) · 2b5603f6

Chan Woo Kim authored Feb 10, 2022



* added classes to get started with constrained beam search

* in progress, think i can directly force tokens now but not yet with the round robin

* think now i have total control, now need to code the bank selection

* technically works as desired, need to optimize and fix design choices leading to undersirable outputs

* complete PR #1 without disjunctive decoding

* removed incorrect tests

* Delete k.txt

* Delete test.py

* Delete test.sh

* revert changes to test scripts

* genutils

* full implementation with testing, no disjunctive yet

* shifted docs

* passing all tests realistically ran locally

* removing accidentally included print statements

* fixed source of error in initial PR test

* fixing the get_device() vs device trap

* fixed documentation docstrings about constrained_beam_search

* fixed tests having failing for Speech2TextModel's floating point inputs

* fix cuda long tensor

* added examples and testing for them and founx & fixed a bug in beam_search and constrained_beam_search

* deleted accidentally added test halting code with assert False

* code reformat

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py

* fixing based on comments on PR

* took out the testing code that should but work fails without the beam search moditification ; style changes

* fixing comments issues

* docstrings for ConstraintListState

* typo in PhrsalConstraint docstring

* docstrings improvements
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

2b5603f6

Add implementation of typical sampling (#15504) · 0113aae5

Clara Meister authored Feb 09, 2022

* typical decoding

* changing arg name

* add test config params

* forgotten arg rename

* fix edge case where scores are same

* test for typical logits warper

* code quality fixes

0113aae5

[Flax tests/FlaxBert] make from_pretrained test faster (#15561) · f588cf40
Suraj Patil authored Feb 09, 2022

f588cf40
Upgrade click version (#15579) · 70292409
Lysandre Debut authored Feb 09, 2022

70292409
Add Wav2Vec2 Adapter Weights to Flax (#15566) · 9e00566b
Sanchit Gandhi authored Feb 09, 2022
```
* Add Wav2Vec2 Adapter Weights to Flax

* Suggested changes
```
9e00566b
Make sure custom configs work with Transformers (#15569) · 1f60bc46
Sylvain Gugger authored Feb 09, 2022
```
* Make sure custom configs work with Transformers

* Apply code review suggestions
```
1f60bc46
Upgrade black to version ~=22.0 (#15565) · 7732d0fe
Lysandre Debut authored Feb 09, 2022
```
* Upgrade black to version ~=22.0

* Check copies

* Fix code
```
7732d0fe

add model scaling section (#15119) · d923f762

Leandro von Werra authored Feb 09, 2022



* add model scaling section

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* integrate reviewer feedback

* initialize GPU properly

* add note about BnB optimizer

* move doc from `scaling.mdx` to `performance.mdx`

* integrate reviewer feedback

* revert section levels
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

d923f762

PoC for a ProcessorMixin class (#15549) · b5c6fdec

Sylvain Gugger authored Feb 09, 2022



* PoC for a ProcessorMixin class

* Documentation

* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Roll out to other processors

* Add base feature extractor class in init

* Use args and kwargs
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

b5c6fdec