Commits · eb849f6604c7dcc0e96d68f4851e52e253b9f0e5 · chenpangpang / transformers

20 Jun, 2023 1 commit

Migrate doc files to Markdown. (#24376) · eb849f66

Sylvain Gugger authored Jun 20, 2023



* Rename index.mdx to index.md

* With saved modifs

* Address review comment

* Treat all files

* .mdx -> .md

* Remove special char

* Update utils/tests_fetcher.py
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

---------
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

eb849f66

16 Jun, 2023 1 commit

Big TF test cleanup (#24282) · 34037129

Matt authored Jun 16, 2023

* Fix one BLIP arg not being optional, remove misspelled arg

* Remove the lxmert test overrides and just use the base test_saved_model_creation

* saved_model_creation fixes and re-enabling tests across the board

* Remove unnecessary skip

* Stop caching sinusoidal embeddings in speech_to_text

* Fix transfo_xl compilation

* Fix transfo_xl compilation

* Fix the conditionals in xglm

* Set the save spec only when building

* Clarify comment

* Move comment correctly

* Correct embeddings generation for speech2text

* Mark RAG generation tests as @slow

* Remove redundant else:

* Add comment to clarify the save_spec line in build()

* Fix size tests for XGLM at last!

* make fixup

* Remove one band_part operation

* Mark test_keras_fit as @slow

34037129

13 Jun, 2023 2 commits

Stop storing references to bound methods via tf.function (#24146) · 3bd1fe43

Matt authored Jun 13, 2023

* Stop storing references to bound methods in tf.functions

* Remove the gc.collect calls now that we resolved the underlying problem

* Remove the default signature from model.serving entirely, big cleanup

* Remove _prune_signature as self.input_signature can prune itself

* Restore serving docstring

* Update int support test to check the input signature

* Make sure other tests also use model.input_signature and not serving.input_signature

* Restore _prune_signature

* Remove the doctest GC now it's no longer needed

* Correct core tests to use the pruned sig

* order lines correctly in core tests

* Add eager_serving back with a deprecation warning

3bd1fe43

TF: standardize `test_model_common_attributes` for language models (#23457) · 7bb6933b
Joao Gante authored Jun 13, 2023

7bb6933b

24 May, 2023 3 commits

Remove the last few TF serving sigs (#23738) · e45e756d
Matt authored May 24, 2023
```
Remove some more serving methods that (I think?) turned up while this PR was open
```
e45e756d

Overhaul TF serving signatures + dummy inputs (#23234) · 814de8fa

Matt authored May 24, 2023

* Let's try autodetecting serving sigs

* Don't clobber existing sigs

* Change shapes for multiplechoice models

* Make default dummy inputs smarter too

* Fix missing f-string

* Let's YOLO a serving output too

* Read __class__.__name__ properly

* Don't just pass naked lists in there and expect it to be okay

* Code cleanup

* Update default serving sig

* Clearer error messages

* Further updates to the default serving output

* make fixup

* Update the serving output a bit more

* Cleanups and renames, raise errors appropriately when we can't infer inputs

* More renames

* we're building in a functional context again, yolo

* import DUMMY_INPUTS from the right place

* import DUMMY_INPUTS from the right place

* Support cross-attention in the dummies

* Support cross-attention in the dummies

* Complete removal of dummy/serving overrides in BERT

* Complete removal of dummy/serving overrides in RoBERTa

* Obliterate lots and lots of serving sig and dummy overrides

* merge type hint changes

* Fix for token_type_ids with vocab_size 1

* Add missing property decorator

* Fix T5 and hopefully some models that take conv inputs

* More signature pruning

* Fix T5's signature

* Fix Wav2Vec2 signature

* Fix LongformerForMultipleChoice input signature

* Fix BLIP and LED

* Better default serving output error handling

* Fix BART dummies

* Fix dummies for cross-attention, esp encoder-decoder models

* Fix visionencoderdecoder signature

* Fix BLIP serving output

* Small tweak to BART dummies

* Cleanup the ugly parameter inspection line that I used in a few places

* committed a breakpoint again

* Move the text_dims check

* Remove blip_text serving_output

* Add decoder_input_ids to the default input sig

* Remove all the manual overrides for encoder-decoder model signatures

* Tweak longformer/led input sigs

* Tweak default serving output

* output.keys() -> output

* make fixup

814de8fa

Better TF docstring types (#23477) · f8b25744

Matt authored May 24, 2023

* Rework TF type hints to use | None instead of Optional[] for tf.Tensor

* Rework TF type hints to use | None instead of Optional[] for tf.Tensor

* Don't forget the imports

* Add the imports to tests too

* make fixup

* Refactor tests that depended on get_type_hints

* Better test refactor

* Fix an old hidden bug in the test_keras_fit input creation code

* Fix for the Deit tests

f8b25744

23 May, 2023 1 commit

Fix some docs what layerdrop does (#23691) · 003a0cf8

zspo authored May 24, 2023



* Fix some docs what layerdrop does

* Update src/transformers/models/data2vec/configuration_data2vec_audio.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Fix more docs

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

003a0cf8

17 May, 2023 1 commit
- TF: embeddings out of bounds check factored into function (#23427) · cf9e7cb0
  Joao Gante authored May 17, 2023
  
  cf9e7cb0
08 May, 2023 1 commit
- docs: Fix broken link in 'How to add a model...' (#23216) · 188a8bfc
  Connor Henderson authored May 08, 2023
```
fix link
```
  188a8bfc
11 Apr, 2023 1 commit
- Make it easier to develop without a dev install (#22697) · 28c19ab5
  Sylvain Gugger authored Apr 11, 2023
```
* Make it easier to develop without a dev install

* Remove ugly hack that doesn't work anyway
```
  28c19ab5
24 Feb, 2023 1 commit
- Generate - update cookie cutters to not initialize cache with training and... · 440f3975
  Joao Gante authored Feb 24, 2023
```
Generate - update cookie cutters to not initialize cache with training and gradient checkpointing (#21759)
```
  440f3975
14 Feb, 2023 1 commit
- Final cleanup of TOKENIZER_FOR_DOC (#21565) · 68b21b37
  Sylvain Gugger authored Feb 14, 2023
```
FInal cleanup of TOKENIZER_FOR_DOC
```
  68b21b37
07 Feb, 2023 2 commits

Cleanup quality (#21493) · 67d07487

Sylvain Gugger authored Feb 07, 2023

* Remove mentions of flake8/isort

* Clean up inits

* Deall with all other inits

* Last special rule for dummy files

67d07487

[CI ] Remove `past` in favor of `pat_key_values` (#21443) · 12eb528b

Arthur authored Feb 07, 2023

* fix past renamed to past_key_value

* update more `past`that were ski^êd

* fixup

* remove changes made to rag

* refactor `_reorder_cache` to use `past_key_values`

* fix git `prepare_inputs_for_generation` to pass tests when false is needed in use_cache

12eb528b

19 Jan, 2023 1 commit
- Flax dtype-dependent numerical masking (#21197) · cbaaa2f6
  Joao Gante authored Jan 19, 2023
  
  cbaaa2f6
09 Jan, 2023 1 commit
- Patch-past-refactor (#21050) · e3ecbaa4
  Arthur authored Jan 09, 2023
```
* small patches, forgot a line

* refactor PT

* the actual fix
```
  e3ecbaa4
08 Jan, 2023 1 commit

Replace `past` with `past_key_values` (#20944) · f0577df6

Arthur authored Jan 08, 2023

* start cleanup

* more updates

* more models are affected

* more updates

* update generation utils

* style

* revert change that removed reorder cachce

* update generation utils

* style

* style

* remove reorder cache

f0577df6

03 Jan, 2023 1 commit
- Generate: delete unused TF `_reorder_cache` (#20964) · 4fd89e49
  Joao Gante authored Jan 03, 2023
  
  4fd89e49
27 Dec, 2022 1 commit
- fix docs typos in "add_new_model" (#20900) · e35bc46a
  Eli Simhayev authored Dec 27, 2022
```
fix Jupyter typos
```
  e35bc46a
08 Dec, 2022 1 commit

Fix CIs for PyTorch 1.13 (#20686) · e3cc4487

Yih-Dar authored Dec 08, 2022



* fix 1

* fix 2

* fix 3

* fix 4
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

e3cc4487

05 Dec, 2022 1 commit

Cleanup some config attributes (#20554) · 9ffbed26

Yih-Dar authored Dec 05, 2022



* Remove is_encoder_decoder from some vision models

* cleanup more

* cleanup more
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

9ffbed26

30 Nov, 2022 1 commit
- Update doc examples feature extractor -> image processor (#20501) · 17a7b49b
  amyeroberts authored Nov 30, 2022
```
* Update doc example feature extractor -> image processor

* Apply suggestions from code review
```
  17a7b49b
28 Nov, 2022 1 commit

More TF int dtype fixes (#20384) · de4159a3

Matt authored Nov 28, 2022

* Add a test to ensure int dummy inputs are int64

* Move the test into the existing int64 test and update a lot of existing dummies

* Fix remaining dummies

* Fix remaining dummies

* Test for int64 serving sigs as well

* Update core tests to use tf.int64

* Add better messages to the assertions

* Update all serving sigs to int64

* More sneaky hiding tf.int32s

* Add an optional int32 signature in save_pretrained

* make fixup

* Add Amy's suggestions

* Switch all serving sigs back to tf.int32

* Switch all dummies to tf.int32

* Adjust tests to check for tf.int32 instead of tf.int64

* Fix base dummy_inputs dtype

* Start casting to tf.int32 in input_processing

* Change dtype for unpack_inputs test

* Add proper tf.int32 test

* Make the alternate serving signature int64

de4159a3

09 Nov, 2022 1 commit

Generate: move generation_*.py src files into generation/*.py (#20096) · f270b960

Joao Gante authored Nov 09, 2022

* move generation_*.py src files into generation/*.py

* populate generation.__init__ with lazy loading

* move imports and references from generation.xxx.object to generation.object

f270b960

18 Oct, 2022 1 commit
- check decoder_inputs_embeds is None before shifting labels (#19671) · 3e07196f
  Arthur authored Oct 18, 2022
  
  3e07196f
11 Oct, 2022 1 commit

🚨

TF: Remove `TFWrappedEmbeddings` (breaking: TF embedding initialization... · 462cd641

Joao Gante authored Oct 11, 2022

🚨🚨🚨  TF: Remove `TFWrappedEmbeddings` (breaking: TF embedding initialization updated for encoder-decoder models) (#19263)

* added test

* correct embedding init

* some changes in blenderbot (incomplete)

* update blenderbot (diff to be used as reference)

* update blenderbot_small

* update LED

* update marian

* update T5 and remove TFWrappedEmbeddings

* nullcontext() -> ContextManagers()

* fix embedding init

462cd641

22 Sep, 2022 1 commit
- TF: check embeddings range (#19102) · 1b5ab39c
  Joao Gante authored Sep 22, 2022
  
  1b5ab39c
15 Sep, 2022 1 commit

Update serving signatures and make sure we actually use them (#19034) · 2322eb8e

Matt authored Sep 15, 2022

* Override save() to use the serving signature as the default

* Replace int32 with int64 in all our serving signatures

* Remember one very important line so as not to break every test at once

* Dtype fix for TFLED

* dtype fix for shift_tokens_right in general

* Dtype fixes in mBART and RAG

* Fix dtypes for test_unpack_inputs

* More dtype fixes

* Yet more mBART + RAG dtype fixes

* Yet more mBART + RAG dtype fixes

* Add a check that the model actually has a serving method

2322eb8e

14 Sep, 2022 2 commits
- TF: tf.debugging assertions without tf.running_eagerly() protection (#19030) · 31be02f1
  Joao Gante authored Sep 14, 2022
  
  31be02f1
- PyTorch >= 1.7.0 and TensorFlow >= 2.4.0 (#19016) · a2a3afbc
  Sylvain Gugger authored Sep 14, 2022
  
  a2a3afbc
12 Sep, 2022 1 commit
- Fix TF start docstrings (#18991) · cf450b77
  Matt authored Sep 12, 2022
```
* Update our TF 2.0 input format tip across all models

* make style
```
  cf450b77
07 Sep, 2022 1 commit
- TF: final bias as a layer in seq2seq models (replicate TFMarian fix) (#18903) · 0eabab09
  Joao Gante authored Sep 07, 2022
  
  0eabab09
03 Aug, 2022 1 commit

Fix torch version comparisons (#18460) · 02b176c4

LSinev authored Aug 03, 2022

Comparisons like
version.parse(torch.__version__) > version.parse("1.6")
are True for torch==1.6.0+cu101 or torch==1.6.0+cpu

version.parse(version.parse(torch.__version__).base_version) are preferred (and available in pytorch_utils.py

02b176c4

01 Aug, 2022 1 commit
- Add a check regarding the number of occurrences of ``` (#18389) · bd6d1b43
  Yih-Dar authored Aug 01, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  bd6d1b43
11 Jul, 2022 1 commit

Fix some typos. (#17560) · 95113d13

Yulv-git authored Jul 11, 2022



* Fix some typos.
Signed-off-by: Yulv-git <yulvchi@qq.com>

* Fix typo.
Signed-off-by: Yulv-git <yulvchi@qq.com>

* make fixup.

95113d13

01 Jul, 2022 1 commit

[Flax] Add remat (gradient checkpointing) (#17843) · 485bbe79

Sanchit Gandhi authored Jul 01, 2022

* [Flax] Add remat (gradient checkpointing)

* fix variable naming in test

* flip: checkpoint using a method

* fix naming

* fix class naming

* apply PVP's suggestions from code review

* make fix-copies

* fix big-bird, electra, roberta

* cookie-cutter

* fix flax big-bird

* move test to common

485bbe79

29 Jun, 2022 1 commit
- Add missing comment quotes (#17379) · b8142753
  Leon Derczynski authored Jun 29, 2022
  
  b8142753
20 Jun, 2022 2 commits

Not use -1e4 as attn mask (#17306) · d3cb2888

Yih-Dar authored Jun 20, 2022



* Use torch.finfo(self.dtype).min

* for GPTNeoX

* for Albert

* For Splinter

* Update src/transformers/models/data2vec/modeling_data2vec_audio.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* fix -inf used in Bart-like models

* Fix a few remaining -inf

* more fix

* clean up

* For CLIP

* For FSMT

* clean up

* fix test

* Add dtype argument and use it for LayoutLMv3

* update FlaxLongT5Attention
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d3cb2888

TF: BART compatible with XLA generation (#17479) · 132402d7
Joao Gante authored Jun 20, 2022
```
* Also propagate changes to blenderbot, blenderbot_small, marian, mbart, and pegasus
```
132402d7