Commits · ba3f9a71a1d0ac30b2f469f426f350fe6f374de7 · chenpangpang / transformers

"test/srt/vscode:/vscode.git/clone" did not exist on "1083e7e3df1e637f3f9fe246f0e87162fa7199a1"

09 Feb, 2022 1 commit

logger.warn --> logger.warning (#15572) · ba3f9a71

Yih-Dar authored Feb 09, 2022



* change logger.warn to logger.warning

* make style
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

ba3f9a71

07 Feb, 2022 2 commits

FX tracing improvement (#14321) · 0fe17f37

Michael Benayoun authored Feb 07, 2022

* Change the way tracing happens, enabling dynamic axes out of the box

* Update the tests and modeling xlnet

* Add the non recoding of leaf modules to avoid recording more values for the methods to record than what will be seen at tracing time (which would otherwise desynchronize the recorded values and the values that need to be given to the proxies during tracing, causing errors).

* Comments and making tracing work for gpt-j and xlnet

* Refactore things related to num_choices (and batch_size, sequence_length)

* Update fx to work on PyTorch 1.10

* Postpone autowrap_function feature usage for later

* Add copyrights

* Remove unnecessary file

* Fix issue with add_new_model_like

* Apply suggestions

0fe17f37

[torch_int_div] Correct true division in generation (#15498) · c47d2592
Patrick von Platen authored Feb 07, 2022
```
* [torch_int_div] Correct true division in generation

* up

* up
```
c47d2592

02 Feb, 2022 1 commit

Save code of registered custom models (#15379) · 44b21f11

Sylvain Gugger authored Feb 02, 2022



* Allow dynamic modules to use relative imports

* Work for configs

* Fix last merge conflict

* Save code of registered custom objects

* Map strings to strings

* Fix test

* Add tokenizer

* Rework tests

* Tests

* Ignore fixtures py files for tests

* Tokenizer test + fix collection

* With full path

* Rework integration

* Fix typo

* Remove changes in conftest

* Test for tokenizers

* Add documentation

* Update docs/source/custom_models.mdx
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Add file structure and file content

* Add more doc

* Style

* Update docs/source/custom_models.mdx
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Address review comments
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Suraj Patil <surajp815@gmail.com>

44b21f11

21 Jan, 2022 1 commit

Refine errors for pretrained objects (#15261) · 6ac77534

Sylvain Gugger authored Jan 21, 2022

* Refine errors for pretrained objects

* PoC to avoid using get_list_of_files

* Adapt tests to use new errors

* Quality + Fix PoC

* Revert "PoC to avoid using get_list_of_files"

This reverts commit cb93b7cae8504ef837c2a7663cb7955e714f323e.

* Revert "Quality + Fix PoC"

This reverts commit 3ba6d0d4ca546708b31d355baa9e68ba9736508f.

* Fix doc

* Revert PoC

* Add feature extractors

* More tests and PT model

* Adapt error message

* Feature extractor tests

* TF model

* Flax model and test

* Merge flax auto tests

* Add tokenization

* Fix test

6ac77534

20 Jan, 2022 1 commit
- Make sure to raise NotImplementedError with correct method name (#15253) · 1fc0fa46
  kumapo authored Jan 21, 2022
  
  1fc0fa46
18 Jan, 2022 1 commit

Fix deprecation warnings for int div (#15180) · 531336bb

Sylvain Gugger authored Jan 18, 2022



* Fix deprecation warnings for int div
Co-authored-by: mgoldey <matthew.goldey@gmail.com>

* Fix import

* ensure that tensor output is python scalar

* make backward compatible

* make code more readable

* adapt test functions
Co-authored-by: mgoldey <matthew.goldey@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

531336bb

28 Dec, 2021 1 commit

Doc styler examples (#14953) · b5e2b183

Sylvain Gugger authored Dec 27, 2021

* Fix bad examples

* Add black formatting to style_doc

* Use first nonempty line

* Put it at the right place

* Don't add spaces to empty lines

* Better templates

* Deal with triple quotes in docstrings

* Result of style_doc

* Enable mdx treatment and fix code examples in MDXs

* Result of doc styler on doc source files

* Last fixes

* Break copy from

b5e2b183

27 Dec, 2021 3 commits

[doc] :obj: hunt (#14954) · e13f72fb
Stas Bekman authored Dec 27, 2021
```
* redo sans examples

* style
```
e13f72fb

[doc] consistent True/False/None default format (#14951) · 133c5e40

Stas Bekman authored Dec 27, 2021



* [doc] consistent True/False/None default format

* Update src/transformers/models/xlnet/modeling_xlnet.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

133c5e40

Doc styler v2 (#14950) · 87e6e4fe

Sylvain Gugger authored Dec 27, 2021

* New doc styler

* Fix issue with args at the start

* Code sample fixes

* Style code examples in MDX

* Fix more patterns

* Typo

* Typo

* More patterns

* Do without black for now

* Get more info in error

* Docstring style

* Re-enable check

* Quality

* Fix add_end_docstring decorator

* Fix docstring

87e6e4fe

21 Dec, 2021 1 commit

Mass conversion of documentation from rst to Markdown (#14866) · 27b3031d

Sylvain Gugger authored Dec 21, 2021

* Convert docstrings of all configurations and tokenizers

* Processors and fixes

* Last modeling files and fixes to models

* Pipeline modules

* Utils files

* Data submodule

* All the other files

* Style

* Missing examples

* Style again

* Fix copies

* Say bye bye to rst docstrings forever

27b3031d

20 Dec, 2021 1 commit

Add a main_input_name attribute to all models (#14803) · 33f36c86

Sylvain Gugger authored Dec 20, 2021



* Add a main_input_name attribute to all models

* Fix tests

* Wtf Vs Code?

* Update src/transformers/models/imagegpt/modeling_imagegpt.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Style

* Fix copies
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

33f36c86

01 Dec, 2021 1 commit

WIP: Support for Training with BF16 (#13207) · 70996a54

Jamie DeAntonis authored Nov 30, 2021



* started bf16 integration

* minor changes

* code now runs

* style

* lay foundation for bf16 testing

* lay foundation for bf16 testing

* start the tests

* better bf16 check

* style

* 2 separate checkers - one for bf16 support, another for bf16+autocast

* Update src/transformers/training_args.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* a couple of comment resolutions

* more comment resolutions

* resolved a small bug

* just some print statemtns

* added todo marking

* added a todo

* adjust for API change s/fast_dtype/dtype/

* fix style

* merge 2 bf16 util functions

* bf16 now does scaling too

* Add support for bfloat16

* Revert T5 layernorm to float32

This is based on the comment at https://github.com/huggingface/transformers/pull/14448/files#r752660929 and the PyTorch PR https://github.com/pytorch/pytorch/pull/66920

 .

* Add comment about conversion to float32 before returning the numpy data

* Add comment about AMP-bfloat16 incompatibility

* Fix formatting

* typo

* reformer / bf16

* cleanup

* require at least pt-1.10

* fix

* will deal with deepspeed separately

* cleanup

* revert

* cleanup

* fp16_full_eval and bf16_full_eval are separate modes

* proper deprecation

* cleanup

* test and fixes

* spelling

* cleanup

* add a note that this API is experimental
Co-authored-by: jamie <jamie@cortx.com>
Co-authored-by: Stas Bekman <stas@stason.org>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: suriya <suriya@cortx.com>
Co-authored-by: Manuel R. Ciosici <manuelrciosici@gmail.com>

70996a54

18 Nov, 2021 1 commit

Add a post init method to all models (#14431) · d83b0e0c

Sylvain Gugger authored Nov 18, 2021

* Add a post init method to all models

* Fix tests

* Fix last tests

* Fix templates

* Add comment

* Forgot to save

d83b0e0c

16 Nov, 2021 1 commit

Fix gradient_checkpointing backward compatibility (#14408) · 040fd471

Sylvain Gugger authored Nov 16, 2021



* Fix gradient_checkpointing backward compatibility

* Remove needless line

* make sure mask prob is big enough and length small enough

* Fix tests
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>

040fd471

10 Nov, 2021 1 commit
- enhance rewrite state_dict missing _metadata (#14348) · bec02ff2
  Chang Wang authored Nov 10, 2021
  
  bec02ff2
04 Nov, 2021 1 commit
- improve rewrite state_dict missing _metadata (#14276) · fd8136fa
  Chang Wang authored Nov 04, 2021
  
  fd8136fa
28 Oct, 2021 1 commit

[modeling_utils] respect original dtype in _get_resized_lm_head (#14181) · 123cce6f

Stas Bekman authored Oct 27, 2021



* respect dtype in _get_resized_lm_head

* Update src/transformers/modeling_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* consistency
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

123cce6f

22 Oct, 2021 1 commit
- Rename variables with unclear naming (#14122) · 62ccbe09
  Li-Huai (Allan) Lin authored Oct 23, 2021
```
* Rename var

* Add comments
```
  62ccbe09
21 Oct, 2021 1 commit

Fix ignore_mismatched_sizes (#14085) · 234cfefb

Li-Huai (Allan) Lin authored Oct 22, 2021

* Fix

* Style

* Name

* Fix tests

* Style

* Remove embed sizes checking

* Disable some tests

* Fix

* Apply suggestion

234cfefb

14 Oct, 2021 1 commit
- Remove wrong model_args supplied (#13937) · 51ee20fc
  Li-Huai (Allan) Lin authored Oct 14, 2021
```
* Remove wrong model_args of config.from_pretrained

* Fix tf & flax
```
  51ee20fc
11 Oct, 2021 1 commit

[Gradient checkpoining] Correct disabling `find_unused_parameters` in Trainer... · dca67968

Patrick von Platen authored Oct 11, 2021

[Gradient checkpoining] Correct disabling `find_unused_parameters` in Trainer when gradient checkpointing is enabled (#13961)

* up

* correct test

dca67968

08 Oct, 2021 1 commit

Adds `PreTrainedModel.framework` attribute (#13817) · de344815

Stella Biderman authored Oct 08, 2021



* Added `framework` attribute

* Update modeling_utils.py

* Update modeling_flax_utils.py

* Update modeling_tf_utils.py

* Update modeling_utils.py

* Update modeling_tf_utils.py

* Update modeling_tf_utils.py

* Update modeling_flax_utils.py

* Update modeling_tf_utils.py

* Update modeling_utils.py

* Update modeling_utils.py

* Update modeling_tf_utils.py

* Update modeling_flax_utils.py

* string -> str

* Update modeling_tf_utils.py

* string -> str

* fixup

* make flake happy
Co-authored-by: patil-suraj <surajp815@gmail.com>

de344815

07 Oct, 2021 2 commits
- Add missing character (#13922) · 5f34163b
  Mishig Davaadorj authored Oct 07, 2021
  
  5f34163b
- Add missing whitespace to multiline strings (#13916) · 57420b10
  Alex Hedges authored Oct 07, 2021
  
  57420b10
05 Oct, 2021 1 commit
- Improve error message when loading models from Hub (#13836) · 46efc580
  Alex Hedges authored Oct 05, 2021
```
* Improve error message when loading models from Hub

* Adjust error message wording
```
  46efc580
24 Sep, 2021 1 commit

Make assertions only if actually chunking forward (#13598) · 678bb248

Josh Devins authored Sep 24, 2021

This moves the assertion on checking input dimensions into a block that will only be called if the function is actually going to do chunking forward. This is often not the case at inference time and PyTorch tracing a model with this assertion in it leads to a tracing warning.

TracerWarning: Converting a tensor to a Python boolean might cause the trace to be incorrect. We can't record the data flow of Python values, so this value will be treated as a constant in the future. This means that the trace might not generalize to other inputs!
input_tensor.shape[chunk_dim] == tensor_shape for input_tensor in input_tensors

678bb248

23 Sep, 2021 1 commit

1x model size CPU memory usage for `from_pretrained` (#13466) · 62832c96

Stas Bekman authored Sep 22, 2021

* one possible solution

* low mem from_pretrained

* edge cases

* solve the persistent buffers

* style

* parametrize

* for later

* proper solution

* cleanup

* refactor; rework based on suggestions

* revert splitting into 2 parts, move checks into main func

62832c96

22 Sep, 2021 1 commit

Make gradient_checkpointing a training argument (#13657) · 27d46397

Sylvain Gugger authored Sep 22, 2021



* Make gradient_checkpointing a training argument

* Update src/transformers/modeling_utils.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Update src/transformers/configuration_utils.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Fix tests

* Style

* document Gradient Checkpointing as a performance feature

* Small rename

* PoC for not using the config

* Adapt BC to new PoC

* Forgot to save

* Rollout changes to all other models

* Fix typo
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas@stason.org>

27d46397

17 Sep, 2021 1 commit
- Use `config_dict_or_path` for deepspeed.zero.Init (#13614) · ce32c69c
  Alex Hedges authored Sep 17, 2021
  
  ce32c69c
16 Sep, 2021 1 commit
- [deepspeed] replaced deprecated init arg (#13587) · bec2e3f5
  Stas Bekman authored Sep 16, 2021
```
* [deepspeed] replaced deprecated init arg

* Trigger CI
```
  bec2e3f5
15 Sep, 2021 1 commit
- [Pretrained Model] Add resize_position_embeddings (#13559) · 95f933ea
  Patrick von Platen authored Sep 15, 2021
```
* finish

* delete bogus file

* correct some stuff

* finish

* finish
```
  95f933ea
08 Sep, 2021 1 commit
- Better error raised when cloned without lfs (#13401) · 99029ab6
  Lysandre Debut authored Sep 08, 2021
```
* Better error raised when cloned without lfs

* add from e
```
  99029ab6
30 Aug, 2021 1 commit
- fix: typo spelling grammar (#13212) · 01977466
  arfy slowy authored Aug 30, 2021
```
* fix: typo spelling grammar

* fix: make fixup
```
  01977466
26 Aug, 2021 1 commit

Add error message concerning revision (#13266) · 401377e6

Bram Vanroy authored Aug 26, 2021



* add error message concerning revision

* Update src/transformers/configuration_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* re-add double line endings

* is not None instead of implicit bool casting
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

401377e6

06 Aug, 2021 1 commit

Tpu tie weights (#13030) · 7fcee113

Sylvain Gugger authored Aug 06, 2021

* Fix tied weights on TPU

* Manually tie weights in no trainer examples

* Fix for test

* One last missing

* Gettning owned by my scripts

* Address review comments

* Fix test

* Fix tests

* Fix reformer tests

7fcee113

04 Aug, 2021 1 commit

Fix from_pretrained with corrupted state_dict (#12939) · d4c834d2

Sylvain Gugger authored Aug 04, 2021

* Fix from_pretrained with corrupted state_dict

* Adapt test

* Use better checkpoint

* Style

* Clean up

d4c834d2

17 Jul, 2021 1 commit
- Fix push_to_hub docstring and make it appear in doc (#12770) · da72ac6e
  Sylvain Gugger authored Jul 17, 2021
  
  da72ac6e
13 Jul, 2021 1 commit

[Deepspeed] adapt multiple models, add zero_to_fp32 tests (#12477) · 78f5fe14

Stas Bekman authored Jul 13, 2021



* zero_to_fp32 tests

* args change

* remove unnecessary work

* use transformers.trainer_utils.get_last_checkpoint

* document the new features

* cleanup

* wip

* fix fsmt

* add bert

* cleanup

* add xlm-roberta

* electra works

* cleanup

* sync

* split off the model zoo tests

* cleanup

* cleanup

* cleanup

* cleanup

* reformat

* cleanup

* casing

* deepspeed>=0.4.3

* adjust distilbert

* Update docs/source/main_classes/deepspeed.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

78f5fe14