Commits · 17503b00ea2dd96bc67b2cf1689d1fb6a4ca67c8 · chenpangpang / transformers

12 Apr, 2023 1 commit
- Added parallel device usage for GPT-J (#22713) · 17503b00
  jprivera44 authored Apr 12, 2023
  
  17503b00
27 Mar, 2023 1 commit
- Generate: support for left-padding on GPTNeoX and Llama (#22382) · 7dcd8703
  Joao Gante authored Mar 27, 2023
  
  7dcd8703
23 Mar, 2023 1 commit

[gptj] support older pytorch version (#22325) · 61f79b29

Stas Bekman authored Mar 22, 2023



* [gptj] support older pytorch version

* contributor

* contributor

* make copies

---------
Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>
Co-authored-by: Nick Hill <nickhill@us.ibm.com>

61f79b29

22 Mar, 2023 1 commit

Fix position embeddings for GPT-J and CodeGen (#22069) · 4e94c6c0

Nick Hill authored Mar 22, 2023

* Revert "[GPT-J] add deprecation warning (#21869)"

This reverts commit fb76994c.

* Fix position embeddings for GPT-J and CodeGen

* Address review comments from @gante

* Fix "Copied from" comment referencing wrong function

* Fix copy/paste mistake

* Fix training path

* Hopefully make torch.fx happy

* Move position_ids long cast

* Revert "Hopefully make torch.fx happy"

This reverts commit e41a6f4cad3ff441124c7457b19cfb630d4ca025.

* Changes to help with torch.fx tracing

* Linter fix

* Correct position_ids tensor type hint

* Work-around torch.fx tracing issue

* Get the changes to work with torch.fx

* Address review comment from @michaelbenayoun

* Another small adjustment

* Add explanatory comment; small code tidyup

4e94c6c0

02 Mar, 2023 1 commit
- [GPT-J] add deprecation warning (#21869) · fb76994c
  Arthur authored Mar 02, 2023
```
* add deprecation warning

* remove pos ids from args docstirng

* fix failing test
```
  fb76994c
28 Feb, 2023 1 commit

[GPTJ] Fix gradient checkpointing bug (#21794) · 31fa2b6c

Herumb Shandilya authored Feb 28, 2023



* If applied, this commit fixes generate bug in gptj

* Remove extra same code block

* formatting and test fix

* Conflict fix and declaration error fix

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

31fa2b6c

27 Feb, 2023 2 commits

introduce `logger.warning_once` and use it for grad checkpointing code (#21804) · c7f3abc2
Stas Bekman authored Feb 27, 2023
```
* logger.warning_once

* style
```
c7f3abc2

[torch] remove deprecated uint8 in favor of bool (#21384) · c51dc4f9

Arthur authored Feb 27, 2023



* uint8 -> bool

* fix copies

* style

* update test modeling commen when checking attention buffers

* style

* use logical not on random mask instead of subtraction with 1

* remove torch uint8

* quality

* remove modified modeling utils

* Update based on review
Co-authored-by: sgugger <sylvain.gugger@gmail.com>

---------
Co-authored-by: sgugger <sylvain.gugger@gmail.com>

c51dc4f9

22 Feb, 2023 1 commit
- Apply ruff flake8-comprehensions (#21694) · 5e8c8eb5
  Aaron Gokaslan authored Feb 22, 2023
  
  5e8c8eb5
13 Feb, 2023 1 commit
- Add `inputs_embeds` support when generating with GPT-J (#21575) · 93ed89bf
  Dzmitry Pletnikau authored Feb 13, 2023
  
  93ed89bf
07 Feb, 2023 2 commits

[CI ] Remove `past` in favor of `pat_key_values` (#21443) · 12eb528b

Arthur authored Feb 07, 2023

* fix past renamed to past_key_value

* update more `past`that were ski^êd

* fixup

* remove changes made to rag

* refactor `_reorder_cache` to use `past_key_values`

* fix git `prepare_inputs_for_generation` to pass tests when false is needed in use_cache

12eb528b

Deprecate parallelize API (#21448) · 5b493762
Sylvain Gugger authored Feb 06, 2023
```
* Deprecate parallelize API

* Add documentation

* Fix copies
```
5b493762

06 Feb, 2023 1 commit

Update quality tooling for formatting (#21480) · 6f79d264

Sylvain Gugger authored Feb 06, 2023

* Result of black 23.1

* Update target to Python 3.7

* Switch flake8 to ruff

* Configure isort

* Configure isort

* Apply isort with line limit

* Put the right black version

* adapt black in check copies

* Fix copies

6f79d264

23 Jan, 2023 1 commit

Models docstring (#21225) · fd5cdaee

Sylvain Gugger authored Jan 23, 2023

* Clean all models

* Style

* Last to remove

* address review comments

* Address review comments

fd5cdaee

20 Jan, 2023 1 commit

Fix `GPTJ` doctest (#21213) · ef530175

Yih-Dar authored Jan 20, 2023



Replace the checkpoint - the current one has shape issue
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

ef530175

19 Jan, 2023 1 commit
- Add disclaimer for necessary fake models (#21178) · 862888a3
  Sylvain Gugger authored Jan 19, 2023
```
* Add disclaimer for necessary fake models

* Address review comments

* Use for GPT-NeoX as well
```
  862888a3
08 Jan, 2023 1 commit

Replace `past` with `past_key_values` (#20944) · f0577df6

Arthur authored Jan 08, 2023

* start cleanup

* more updates

* more models are affected

* more updates

* update generation utils

* style

* revert change that removed reorder cachce

* update generation utils

* style

* style

* remove reorder cache

f0577df6

08 Dec, 2022 1 commit

Fix CIs for PyTorch 1.13 (#20686) · e3cc4487

Yih-Dar authored Dec 08, 2022



* fix 1

* fix 2

* fix 3

* fix 4
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

e3cc4487

23 Sep, 2022 1 commit

Fix incorrect comments about atten mask for pytorch backend (#18728) · ece76244

Tianqi Zhang (张天启) authored Sep 24, 2022



* fix incorrect comments about atten mask

* typo

* Update for CodeGen
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

ece76244

20 Jun, 2022 1 commit

Not use -1e4 as attn mask (#17306) · d3cb2888

Yih-Dar authored Jun 20, 2022



* Use torch.finfo(self.dtype).min

* for GPTNeoX

* for Albert

* For Splinter

* Update src/transformers/models/data2vec/modeling_data2vec_audio.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* fix -inf used in Bart-like models

* Fix a few remaining -inf

* more fix

* clean up

* For CLIP

* For FSMT

* clean up

* fix test

* Add dtype argument and use it for LayoutLMv3

* update FlaxLongT5Attention
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d3cb2888

15 Jun, 2022 1 commit
- normalize keys_to_ignore (#17722) · 66f89332
  Stas Bekman authored Jun 15, 2022
  
  66f89332
23 May, 2022 2 commits

Use Accelerate in `from_pretrained` for big model inference (#17341) · 56f50590

Sylvain Gugger authored May 23, 2022



* Initial work

* More or less finished with first draft

* Update src/transformers/modeling_utils.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Update src/transformers/modeling_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Fix randomly initialized weights

* Update src/transformers/modeling_utils.py
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

* Address review comments

* Rename DeepSpeed folder to temporarily fix the test issue?

* Revert to try if Accelerate fix works

* Use latest Accelerate release

* Quality and fixes

* Style

* Quality

* Add doc

* Test + fix

* More blocks
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

56f50590

Traced models serialization and torchscripting fix (#17206) · 2e7e4280

Michael Benayoun authored May 23, 2022

* Fix torch.jit.script and pickling issues

* Fix get_attr issues

* Fix import in function

* Fix GPT-J and T5 tracing for torch=1.11

* Gate graph surgery on torch version

* Modeling minor changes to enable TorchScripting

* Model serialization / deserialization test

* Remove _assert_is_none users

2e7e4280

12 May, 2022 1 commit

Black preview (#17217) · afe5d42d

Sylvain Gugger authored May 12, 2022

* Black preview

* Fixup too!

* Fix check copies

* Use the same version as the CI

* Bump black

afe5d42d

05 May, 2022 1 commit
- type hints for pytorch models (#17064) · 45360e1a
  Robot Jelly authored May 05, 2022
```
* type hints for pytorch models

* fixed import error

* fixed some errors
```
  45360e1a
21 Apr, 2022 1 commit

Fix GPT-J onnx conversion (#16780) · 0b1e0fcf

Thomas Chaigneau authored Apr 21, 2022



* add gptj to TOKENIZER_MAPPING_NAMES

* fix int32 to float to avoid problem in onnx

* Update src/transformers/models/gptj/modeling_gptj.py
Co-authored-by: ChainYo <t.chaigneau.tc@gmail.com>
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

0b1e0fcf

13 Apr, 2022 1 commit

Add Doc Test for GPT-J (#16507) · 06b4aac9

Michael Chung authored Apr 13, 2022



* Required the values GPTJ unfortunately cannot run the model =)

* Added the file to the doc tests

* Run Fixup and Style

* Fixed with the test versions of gptj. Ran Style and Fixup.

* Trigger ci

* A Minor Change to License

* Fixed spacing added to the benchmark_utils. Then refactored tests to const variables.

* Removed strings that were included as default parameters anyways.
Co-authored-by: ArEnSc <xx.mike.chung.xx@gmail.com>

06b4aac9

12 Apr, 2022 1 commit

Replace assertion with exception (#16720) · cc034f72

Anmol Joshi authored Apr 12, 2022



* Updated assertions to exceptions

* updated assertions to exceptions

* bug fixes

* fix-copies

* Update modeling_ctrl.py

* Update src/transformers/models/ctrl/modeling_tf_ctrl.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/gpt_neo/modeling_gpt_neo.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/gptj/modeling_gptj.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/gptj/modeling_tf_gptj.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update modeling_led.py

* Update modeling_led.py

* Update modeling_led.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

cc034f72

30 Mar, 2022 1 commit
- Add support for exporting GPT-J to ONNX-TRT (#16492) · ae189ef9
  tomerip authored Mar 30, 2022
```
Add support for exporting GPT-J to ONNX-TRT
Co-authored-by: Tomer Stav <stavt@amazon.com>
```
  ae189ef9
25 Mar, 2022 1 commit
- Big file_utils cleanup (#16396) · 088c1880
  Sylvain Gugger authored Mar 25, 2022
```
* Big file_utils cleanup

* This one still needs to be treated separately
```
  088c1880
23 Mar, 2022 1 commit

Reorganize file utils (#16264) · 4975002d

Sylvain Gugger authored Mar 23, 2022

* Split file_utils in several submodules

* Fixes

* Add back more objects

* More fixes

* Who exactly decided to import that from there?

* Second suggestion to code with code review

* Revert wront move

* Fix imports

* Adapt all imports

* Adapt all imports everywhere

* Revert this import, will fix in a separate commit

4975002d

08 Feb, 2022 1 commit
- [GPTJ] fix docs (#15558) · 0acd84f7
  Suraj Patil authored Feb 08, 2022
  
  0acd84f7
07 Feb, 2022 1 commit

FX tracing improvement (#14321) · 0fe17f37

Michael Benayoun authored Feb 07, 2022

* Change the way tracing happens, enabling dynamic axes out of the box

* Update the tests and modeling xlnet

* Add the non recoding of leaf modules to avoid recording more values for the methods to record than what will be seen at tracing time (which would otherwise desynchronize the recorded values and the values that need to be given to the proxies during tracing, causing errors).

* Comments and making tracing work for gpt-j and xlnet

* Refactore things related to num_choices (and batch_size, sequence_length)

* Update fx to work on PyTorch 1.10

* Postpone autowrap_function feature usage for later

* Add copyrights

* Remove unnecessary file

* Fix issue with add_new_model_like

* Apply suggestions

0fe17f37

28 Dec, 2021 1 commit

Doc styler examples (#14953) · b5e2b183

Sylvain Gugger authored Dec 27, 2021

* Fix bad examples

* Add black formatting to style_doc

* Use first nonempty line

* Put it at the right place

* Don't add spaces to empty lines

* Better templates

* Deal with triple quotes in docstrings

* Result of style_doc

* Enable mdx treatment and fix code examples in MDXs

* Result of doc styler on doc source files

* Last fixes

* Break copy from

b5e2b183

27 Dec, 2021 1 commit

Doc styler v2 (#14950) · 87e6e4fe

Sylvain Gugger authored Dec 27, 2021

* New doc styler

* Fix issue with args at the start

* Code sample fixes

* Style code examples in MDX

* Fix more patterns

* Typo

* Typo

* More patterns

* Do without black for now

* Get more info in error

* Docstring style

* Re-enable check

* Quality

* Fix add_end_docstring decorator

* Fix docstring

87e6e4fe

21 Dec, 2021 2 commits

Mass conversion of documentation from rst to Markdown (#14866) · 27b3031d

Sylvain Gugger authored Dec 21, 2021

* Convert docstrings of all configurations and tokenizers

* Processors and fixes

* Last modeling files and fixes to models

* Pipeline modules

* Utils files

* Data submodule

* All the other files

* Style

* Missing examples

* Style again

* Fix copies

* Say bye bye to rst docstrings forever

27b3031d

Convert docstrings of modeling files (#14850) · 7af80f66

Sylvain Gugger authored Dec 21, 2021

* Convert file_utils docstrings to Markdown

* Test on BERT

* Return block indent

* Temporarily disable doc styler

* Remove from quality checks as well

* Remove doc styler mess

* Remove check from circleCI

* Fix typo

* Convert file_utils docstrings to Markdown

* Test on BERT

* Return block indent

* Temporarily disable doc styler

* Remove from quality checks as well

* Remove doc styler mess

* Remove check from circleCI

* Fix typo

* Let's go on all other model files

* Add templates too

* Styling and quality

7af80f66

06 Dec, 2021 1 commit

Add GPTJForQuestionAnswering (#14503) · 0f3f045e

tucan9389 authored Dec 07, 2021



* Add GPTJForQuestionAnswering

* Reformat for GPTJForQuestionAnswering

* Fix isort error

* make style for GPTJForQA

* Add _keys_to_ignore_on_load_missing

* Change the sequence of qa and classification
Co-authored-by: Suraj Patil <surajp815@gmail.com>

0f3f045e

30 Nov, 2021 1 commit

use functional interface for softmax in attention (#14198) · 6ed9882d

Thomas Viehmann authored Nov 30, 2021

* use functional interface instead of instantiating module and immediately calling it

* fix torch.nn.functional to nn.functional. Thank you Stas!

6ed9882d

18 Nov, 2021 1 commit

Add a post init method to all models (#14431) · d83b0e0c

Sylvain Gugger authored Nov 18, 2021

* Add a post init method to all models

* Fix tests

* Fix last tests

* Fix templates

* Add comment

* Forgot to save

d83b0e0c