Commits · f394a2a50d8729cd1ca9b368e330ec50664c3292 · chenpangpang / transformers

"tests/models/cohere/__init__.py" did not exist on "dd4df80f0b77c8f8e07e502298df0121cada9ce8"

31 May, 2022 10 commits

[Json configs] Make json prettier for all saved tokenizer files & ensure same... · f394a2a5

Patrick von Platen authored May 31, 2022

[Json configs] Make json prettier for all saved tokenizer files & ensure same json format for all processors (tok + feat_extract) (#17457)

* [Json dump] Make json prettier

* correct more tokenizeirs

* more patterns

* add aggressive test

* the aggressive test was actually useful :-)

* more tests

* Apply suggestions from code review

f394a2a5

Accumulate tokens into batches in `PreTrainedTokenizerBase.add_tokens()` (#17119) · 6ee1474b

Vít Novotný authored May 31, 2022

* Accumulate tokens into batches in PreTrainedTokenizerBase.add_tokens()

For tokenizers with a small number of special tokens or special tokens
with consecutive token IDs, this reduces the time complexity of creating
the trie from quadratic to linear, see also #16936.

* Extend explanation of batching added tokens

6ee1474b

Fix checkpoint name (#17484) · 8f8b3cbc
Yih-Dar authored May 31, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
8f8b3cbc

Added XLM onnx config (#17030) · 5af38953

Ritik Nandwal authored May 31, 2022

* Add onnx configuration for xlm

* Add supported features for xlm

* Add xlm to models exportable with onnx

* Add xlm architecture to test file

* Modify docs

* Make code quality fixes

5af38953

Disk offload fix (#17428) · 567d9c06
Sylvain Gugger authored May 31, 2022
```
* Fix offload to disk for big models

* Add test

* Fix test for other models
```
567d9c06

TF: GPT-2 generation supports left-padding (#17426) · 975dd2bb

Joao Gante authored May 31, 2022

* TF GPT-2 now properly works with left padding

* throw a warning when eos token == pad token and there is no attention mask

975dd2bb

[Generate] Fix output scores greedy search (#17442) · b0e0ac8a
Patrick von Platen authored May 31, 2022

b0e0ac8a

Fx support for multiple model architectures (#17393) · 28d00482

Michael Benayoun authored May 31, 2022

* Support for Bart and LayoutLM, and partial support for XLNet

* Support for mbart

* A lot of new models supported

* Support for other models

* LayoutLM fix

* Use strings instead of classes

28d00482

typo IBERT in __repr__ quant_mode (#17398) · 04681c1d
Ivan Gonzalez authored May 31, 2022
```
fix #17397
```
04681c1d
Fix typo (remove parenthesis) (#17415) · 13fd6734
Michele Conti authored May 31, 2022

13fd6734

26 May, 2022 2 commits
- [OPT] Fix bos token id default (#17441) · 7999ec12
  Patrick von Platen authored May 26, 2022
  
  7999ec12
- Pin protobouf that breaks TensorBoard in PyTorch (#17440) · 7535d92e
  Sylvain Gugger authored May 26, 2022
  
  7535d92e
25 May, 2022 4 commits
- Upd AutoTokenizer.from_pretrained doc examples (#17416) · 35e2d13f
  Cookie_thief authored May 25, 2022
  
  35e2d13f
- Support compilation via Torchdynamo, AOT Autograd, NVFuser (#17308) · 897a8dd8
  Animesh Jain authored May 25, 2022
```
* Support compilation via Torchdynamo, AOT Autograd, NVFuser

* Address comments

* Lint

* Stas comments - missing quality test

* Lintere

* Quality test

* Doc lint

* Reset CUDA peak mem

* Add CustomTrainer

* require a single gpu
Co-authored-by: Stas Bekman <stas@stason.org>
```
  897a8dd8
- Add test for new model parallelism features (#17401) · 31484afb
  Sylvain Gugger authored May 25, 2022
  
  31484afb
- Make check_init script more robust and clean inits (#17408) · 56b35ce3
  Sylvain Gugger authored May 25, 2022
  
  56b35ce3
24 May, 2022 3 commits

[WIP] Adding GPT-NeoX-20B (#16659) · 71e60272

Jason Phang authored May 24, 2022



* initial

* first try

* working 20B

* 20B tokenizers

* Docs

* Import fixes for missing classes

* Update docs, fixup

* black formatting

* isort

* flake

* dummy objects

* documentation

* Documentation yml

* more docs

* tweaks for tests

* tokenization auto

* fix neox tests

* test

* test

* einsum

* address PR feedback

* Documentation

* Update README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/gpt_neox/__init__.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/gpt_neox/configuration_gpt_neox.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Remove undefined LaTeX syntax

* Update to full url to avoid confusion about if that's supposed to refer to the Hub

* fix auto

* move tests

* documentation fix

* more doc fixes

* test refactor

* fix import

* fix import

* fix import

* fix import

* fix import

* style fixes

* More modeling fixes
Co-authored-by: Jason Phang <zp489@gr057.hpc.nyu.edu>
Co-authored-by: Stella Biderman <stellabiderman@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

71e60272

Enabling `imageGPT` auto feature extractor. (#16871) · d9809298

Nicolas Patry authored May 24, 2022



* Enablign `imageGPT` auto feature extractor.
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

* Small updates.

* Update after rebase to use `input_ids` instead of `pixel_values`.
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

d9809298

Add LayoutLMv3 (#17060) · 31ee80d5

NielsRogge authored May 24, 2022



* Make forward pass work

* More improvements

* Remove unused imports

* Remove timm dependency

* Improve loss calculation of token classifier

* Fix most tests

* Add docs

* Add model integration test

* Make all tests pass

* Add LayoutLMv3FeatureExtractor

* Improve integration test + make fixup

* Add example script

* Fix style

* Add LayoutLMv3Processor

* Fix style

* Add option to add visual labels

* Make more tokenizer tests pass

* Fix more tests

* Make more tests pass

* Fix bug and improve docs

* Fix import of processors

* Improve docstrings

* Fix toctree and improve docs

* Fix auto tokenizer

* Move tests to model folder

* Move tests to model folder

* change default behavior add_prefix_space

* add prefix space for fast

* add_prefix_spcae set to True for Fast

* no space before `unique_no_split` token

* add test to hightligh special treatment of added tokens

* fix `test_batch_encode_dynamic_overflowing` by building a long enough example

* fix `test_full_tokenizer` with add_prefix_token

* Fix tokenizer integration test

* Make the code more readable

* Add tests for LayoutLMv3Processor

* Fix style

* Add model to README and update init

* Apply suggestions from code review

* Replace asserts by value errors

* Add suggestion by @ducviet00

* Add model to doc tests

* Simplify script

* Improve README

* a step ahead to fix

* Update pair_input_test

* Make all tokenizer tests pass - phew

* Make style

* Add LayoutLMv3 to CI job

* Fix auto mapping

* Fix CI job name

* Make all processor tests pass

* Make tests of LayoutLMv2 and LayoutXLM consistent

* Add copied from statements to fast tokenizer

* Add copied from statements to slow tokenizer

* Remove add_visual_labels attribute

* Fix tests

* Add link to notebooks

* Improve docs of LayoutLMv3Processor

* Fix reference to section
Co-authored-by: SaulLu <lucilesaul.com@gmail.com>
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

31ee80d5

23 May, 2022 7 commits

Add support for `device_map="auto"` to OPT (#17382) · 13541b4a
Sylvain Gugger authored May 23, 2022

13541b4a
OPTForCausalLM lm_head input size should be config.word_embed_proj_dim (#17225) · 71cced8a
vfbd authored May 23, 2022

71cced8a

Use Accelerate in `from_pretrained` for big model inference (#17341) · 56f50590

Sylvain Gugger authored May 23, 2022



* Initial work

* More or less finished with first draft

* Update src/transformers/modeling_utils.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Update src/transformers/modeling_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Fix randomly initialized weights

* Update src/transformers/modeling_utils.py
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

* Address review comments

* Rename DeepSpeed folder to temporarily fix the test issue?

* Revert to try if Accelerate fix works

* Use latest Accelerate release

* Quality and fixes

* Style

* Quality

* Add doc

* Test + fix

* More blocks
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

56f50590

Traced models serialization and torchscripting fix (#17206) · 2e7e4280

Michael Benayoun authored May 23, 2022

* Fix torch.jit.script and pickling issues

* Fix get_attr issues

* Fix import in function

* Fix GPT-J and T5 tracing for torch=1.11

* Gate graph surgery on torch version

* Modeling minor changes to enable TorchScripting

* Model serialization / deserialization test

* Remove _assert_is_none users

2e7e4280

Fix Comet ML integration (#17381) · 1cd01b0a

Maximilian Schmidt authored May 23, 2022

Callback function `on_train_end` crashed if Comet ML integration was
used but `COMET_MODE` set to `DISABLE`

1cd01b0a

Fix cvt docstrings (#17367) · c86aad61
Anugunj Naman authored May 23, 2022

c86aad61

Correct & Improve Doctests for LayoutLMv2 (#17168) · 7b8cb269

ghlai9665 authored May 23, 2022



* add inference example to LayoutLMv2ForQuestionAnswering, passing doctest

* add loss example to LayoutLMv2ForQuestionAnswering, passing doctest

* Add correct doctest for LayoutLMv2ForTokenClassification, passing doctest

* add correct doctest for LayoutLMv2ForSequenceClassification, passing test

* add correct doctest for LayoutLMv2Model, passing test

* make fixup

* fix to address review comments

* make style

* fix doctest line break issue, add to documentaiton_tests.txt, address review comments

* move comment about layoutlmv2 dependencies to the doc page

* format doc page as suggested
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* delete extraneous backtick
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

7b8cb269

20 May, 2022 2 commits
- Fix a typo relative_postion_if_large -> relative_position_if_large (#17366) · b9bb4173
  Daniel Stancl authored May 20, 2022
  
  b9bb4173
- Pin dill to fix examples (#17368) · 3fd7de49
  Sylvain Gugger authored May 20, 2022
```
* Pin dill for now

* Try this version?

* force install

* Actually use dep in testing

* Try a larger pin
```
  3fd7de49
19 May, 2022 4 commits

fix for 17292 (#17293) · 5d6feecf
Nathan Dahlberg authored May 19, 2022

5d6feecf
[Generation] Fix Transition probs (#17311) · 518bd02c
Patrick von Platen authored May 19, 2022
```
* [Draft] fix transition probs

* up

* up

* up

* make it work

* fix

* finish

* update
```
518bd02c

[OPT] Run test in lower precision on GPU (#17353) · e8714c03

Patrick von Platen authored May 19, 2022

* [OPT] Run test only in half precision

* up

* up

* up

* up

* finish

* fix on GPU

* Update tests/models/opt/test_modeling_opt.py

e8714c03

[BC] Fixing usage of text pairs (#17324) · a4386d7e

Nicolas Patry authored May 19, 2022



* [BC] Fixing usage of text pairs

The BC is actually preventing users from misusing the pipeline since
users could have been willing to send text pairs and the pipeline would
instead understand the thing as a batch returning bogus results.

The correct usage of text pairs is preserved in this PR even when that
makes the code clunky.

Adds support for {"text":..,, "text_pair": ...} inputs for both dataset
iteration and more explicit usage to pairs.

* Updating the doc.

* Update src/transformers/pipelines/text_classification.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/pipelines/text_classification.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update tests/pipelines/test_pipelines_text_classification.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* quality.
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

a4386d7e

18 May, 2022 8 commits

fix (#17337) · 6aad3872

Yih-Dar authored May 18, 2022


Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

6aad3872

docs for typical decoding (#17186) · 6e195eb9
Jader Martins authored May 18, 2022
```
Co-authored-by: Jader Martins <jadermcs94@gmail.com>
```
6e195eb9

Add onnx export cuda support (#17183) · 6da76b9c

Jingya HUANG authored May 18, 2022


Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

6da76b9c

Add CvT (#17299) · adc0ff25

NielsRogge authored May 18, 2022



* Adding cvt files

* Adding cvt files

* changes in init file

* Adding cvt files

* changes in init file

* Style fixes

* Address comments from code review

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Format lists in docstring

* Fix copies

* Apply suggestion from code review
Co-authored-by: AnugunjNaman <anugunjjha@gmail.com>
Co-authored-by: Ayushman Singh <singhayushman13@protonmail.com>
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

adc0ff25

Accepting real pytorch device as arguments. (#17318) · 2cb2ea3f
Nicolas Patry authored May 18, 2022
```
* Accepting real pytorch device as arguments.

* is_torch_available.
```
2cb2ea3f
Updating the docs for `max_seq_len` in QA pipeline (#17316) · 1c9d1f4c
Nicolas Patry authored May 18, 2022

1c9d1f4c

[T5] Fix init in TF and Flax for pretraining (#17294) · 60ad7344

Patrick von Platen authored May 18, 2022



* fix init

* Apply suggestions from code review

* fix

* finish

* Update src/transformers/modeling_tf_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

60ad7344

Add type hints for ProphetNet (Pytorch) (#17223) · 7ba1d4e5

Joaq authored May 18, 2022



* added type hints to prophetnet

* reformatted with black

* fix bc black misformatted some parts

* fix imports

* fix imports

* Update src/transformers/models/prophetnet/configuration_prophetnet.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* update OPTIONAL type hint and docstring
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

7ba1d4e5