Commits · c28d04e9e252a1a099944e325685f14d242ecdcd · chenpangpang / transformers

03 Oct, 2022 2 commits

Update no_trainer script for summarization (#19277) · c28d04e9

Divyanshu Kumar authored Oct 03, 2022

* Update no_trainer script for summarization

* removed unnecessary import

* fixes notation mistake

* removed: unused variable

c28d04e9

Restructure DETR post-processing, return prediction scores (#19262) · 36f52e95

Alara Dirik authored Oct 03, 2022

* Restructure DetrFeatureExtractor post-processing methods
* Update post_process_instance_segmentation and post_process_panoptic_segmentation methods to return prediction scores
* Update DETR models docs

36f52e95

30 Sep, 2022 12 commits

time series forecasting model (#17965) · 5cd16f01

Kashif Rasul authored Sep 30, 2022



* initial files

* initial model via cli

* typos

* make a start on the model config

* ready with configuation

* remove tokenizer ref.

* init the transformer

* added initial model forward to return dec_output

* require gluonts

* update dep. ver table and add as extra

* fixed typo

* add type for prediction_length

* use num_time_features

* use config

* more config

* typos

* opps another typo

* freq can be none

* default via transformation is 1

* initial transformations

* fix imports

* added transform_start_field

* add helper to create pytorch dataloader

* added inital val and test data loader

* added initial distr head and loss

* training working

* remove TimeSeriesTransformerTokenizer
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/__init__.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/models/time_series_transformer/__init__.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* fixed copyright

* removed docs

* remove time series tokenizer

* fixed docs

* fix text

* fix second

* fix default

* fix order

* use config directly

* undo change

* fix comment

* fix year

* fix import

* add additional arguments for training vs. test

* initial greedy inference loop

* fix inference

* comment out token inputs to enc dec

* Use HF encoder/decoder

* fix inference

* Use Seq2SeqTSModelOutput output

* return Seq2SeqTSPredictionOutput

* added default arguments

* fix return_dict true

* scale is a tensor

* output static_features for inference

* clean up some unused bits

* fixed typo

* set return_dict if none

* call model once for both train/predict

* use cache if future_target is none

* initial generate func

* generate arguments

* future_time_feat is required

* return SampleTSPredictionOutput

* removed unneeded classes

* fix when params is none

* fix return dict

* fix num_attention_heads

* fix arguments

* remove unused shift_tokens_right

* add different dropout configs

* implement FeatureEmbedder, Scaler and weighted_average

* remove gluonts dependency

* fix class names

* avoid _variable names

* remove gluonts dependency

* fix imports

* remove gluonts from configuration

* fix docs

* fixed typo

* move utils to examples

* add example requirements

* config has no freq

* initial run_ts_no_trainer

* remove from ignore

* fix output_attentions and removed unsued getters/setters

* removed unsed tests

* add dec seq len

* add test_attention_outputs

* set has_text_modality=False

* add config attribute_map

* make style

* make fix-copies

* add encoder_outputs to TimeSeriesTransformerForPrediction forward

* Improve docs, add model to README

* added test_forward_signature

* More improvements

* Add more copied from

* Fix README

* Fix remaining quality issues

* updated encoder and decoder

* fix generate

* output_hidden_states and use_cache are optional

* past key_values returned too

* initialize weights of distribution_output module

* fixed more tests

* update test_forward_signature

* fix return_dict outputs

* Update src/transformers/models/time_series_transformer/configuration_time_series_transformer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/time_series_transformer/configuration_time_series_transformer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/time_series_transformer/configuration_time_series_transformer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/time_series_transformer/configuration_time_series_transformer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/time_series_transformer/modeling_time_series_transformer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/time_series_transformer/modeling_time_series_transformer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/time_series_transformer/modeling_time_series_transformer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* removed commented out tests

* added neg. bin and normal output

* Update src/transformers/models/time_series_transformer/configuration_time_series_transformer.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* move to one line

* Add docstrings

* Update src/transformers/models/time_series_transformer/configuration_time_series_transformer.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* add try except for assert and raise

* try and raise exception

* fix the documentation formatting

* fix assert call

* fix docstring formatting

* removed input_ids from DOCSTRING

* Update input docstring

* Improve variable names

* Update order of inputs

* Improve configuration

* Improve variable names

* Improve docs

* Remove key_length from tests

* Add extra docs

* initial unittests

* added test_inference_no_head test

* added test_inference_head

* add test_seq_to_seq_generation

* make style

* one line

* assert mean prediction

* removed comments

* Update src/transformers/models/time_series_transformer/modeling_time_series_transformer.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/models/time_series_transformer/modeling_time_series_transformer.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* fix order of args

* make past_observed_mask optional as well

* added Amazon license header

* updated utils with new fieldnames

* make style

* cleanup

* undo position of past_observed_mask

* fix import

* typo

* more typo

* rename example files

* remove example for now

* Update docs/source/en/_toctree.yml
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/time_series_transformer/configuration_time_series_transformer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/time_series_transformer/modeling_time_series_transformer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/time_series_transformer/modeling_time_series_transformer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update modeling_time_series_transformer.py

fix style

* fixed typo

* fix typo and grammer

* fix style
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: NielsRogge <niels.rogge1@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

5cd16f01

Docs - Guide to add a new TensorFlow model (#19256) · cfb777f2

Joao Gante authored Sep 30, 2022


Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

cfb777f2

Fix cached lookup filepath on windows for hub (#19178) · 6a08162a

Keith Kjer authored Sep 30, 2022



* Update hub.py commit_hash extraction

Add safety mechanism for windows systems to unify logic (replace double backslashes with /)

* Fix string quotetype

* Aaaa circleci is messing with me.

* Switch to using as_posix() method from pathlib

* Update src/transformers/utils/hub.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/utils/hub.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

6a08162a

Fix Encoder-Decoder testing issue about repo. names (#19250) · f33858d1
Yih-Dar authored Sep 30, 2022
```
* Change "../gpt2" to "gpt2"
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
f33858d1

Add `beautifulsoup4` to the dependency list (#19253) · 2fba98e5

Yih-Dar authored Sep 30, 2022



* Add `beautifulsoup4` to extras["testing"]
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

2fba98e5

Poc to use safetensors (#19175) · 3e2dd7f9

Sylvain Gugger authored Sep 30, 2022



* Poc to use safetensors

* Typo

* Final version

* Add tests

* Save with the right name!

* Update tests/test_modeling_common.py
Co-authored-by: Julien Chaumond <julien@huggingface.co>

* Support for sharded checkpoints

* Test from Hub part 1

* Test from hub part 2

* Fix regular checkpoint sharding

* Bump for fixes
Co-authored-by: Julien Chaumond <julien@huggingface.co>

3e2dd7f9

Add notebooks (#19259) · dad578e4
Jingya HUANG authored Sep 30, 2022

dad578e4
Add stop sequence to text generation pipeline (#18444) · e3963581
Karim Foda authored Sep 30, 2022

e3963581

Add expected output to the sample code for `ViTMSNForImageClassification` (#19183) · 582d085b

Sayak Paul authored Sep 30, 2022

* chore: add expected output to the sample code.

* add: imagenet-1k labels to the model config.

* chore: apply code formatting.

* chore: change the expected output.

582d085b

Rebase ESM PR and update all file formats (#19055) · 368b649a

Matt authored Sep 30, 2022



* Rebase ESM PR and update all file formats

* Fix test relative imports

* Add __init__.py to the test dir

* Disable gradient checkpointing

* Remove references to TFESM... FOR NOW >:|

* Remove completed TODOs from tests

* Convert docstrings to mdx, fix-copies from BERT

* fix-copies for the README and index

* Update ESM's __init__.py to the modern format

* Add to _toctree.yml

* Ensure we correctly copy the pad_token_id from the original ESM model

* Ensure we correctly copy the pad_token_id from the original ESM model

* Tiny grammar nitpicks

* Make the layer norm after embeddings an optional flag

* Make the layer norm after embeddings an optional flag

* Update the conversion script to handle other model classes

* Remove token_type_ids entirely, fix attention_masking and add checks to convert_esm.py

* Break the copied from link from BertModel.forward to remove token_type_ids

* Remove debug array saves

* Begin ESM-2 porting

* Add a hacky workaround for the precision issue in original repo

* Code cleanup

* Remove unused checkpoint conversion code

* Remove unused checkpoint conversion code

* Fix copyright notices

* Get rid of all references to the TF weights conversion

* Remove token_type_ids from the tests

* Fix test code

* Update src/transformers/__init__.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/__init__.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Add credit

* Remove _ args and __ kwargs in rotary embedding

* Assertively remove asserts

* Replace einsum with torch.outer()

* Fix docstring formatting

* Remove assertions in tokenization

* Add paper citation to ESMModel docstring

* Move vocab list to single line

* Remove ESMLayer from init

* Add Facebook copyrights

* Clean up RotaryEmbedding docstring

* Fix docstring formatting

* Fix docstring for config object

* Add explanation for new config methods

* make fix-copies

* Rename all the ESM- classes to Esm-

* Update conversion script to allow pushing to hub

* Update tests to point at my repo for now

* Set config properly for tests

* Remove the gross hack that forced loss of precision in inv_freq and instead copy the data from the model being converted

* make fixup

* Update expected values for slow tests

* make fixup

* Remove EsmForCausalLM for now

* Remove EsmForCausalLM for now

* Fix padding idx test

* Updated README and docs with ESM-1b and ESM-2 separately (#19221)

* Updated README and docs with ESM-1b and ESM-2 separately

* Update READMEs, longer entry with 3 citations

* make fix-copies
Co-authored-by: Your Name <you@example.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Tom Sercu <tsercu@fb.com>
Co-authored-by: Your Name <you@example.com>

368b649a

Catch `HFValidationError` in `TrainingSummary` (#19252) · 4fd32a1f

Yih-Dar authored Sep 30, 2022



* Catch HfValidationError in TrainingSummary
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

4fd32a1f

Add MarkupLM (#19198) · f3d2f7a6

NielsRogge authored Sep 30, 2022



* First draft

* Make basic test work

* Fix most tokenizer tests

* More improvements

* Make more tests pass

* Fix more tests

* Fix some code quality

* Improve truncation

* Implement feature extractor

* Improve feature extractor and add tests

* Improve feature extractor tests

* Fix pair_input test partly

* Add fast tokenizer

* Improve implementation

* Fix rebase

* Fix rebase

* Fix most of the tokenizer tests.

* propose solution for fast

* add: integration test for fasttokenizer, warning for decode, fix template in slow tokenizer

* add: modify markuplmconverter

* add: some modify on converter and tokenizerfast

* Fix style, copies

* Make fixup

* Update tokenization_markuplm.py

* Update test_tokenization_markuplm.py

* Update markuplm related

* Improve processor, add integration test

* Add processor test file

* Improve processor

* Improve processor tests

* Fix more processor tests

* Fix processor tests

* Update docstrings

* Add Copied from statements

* Add more Copied from statements

* Add code examples

* Improve code examples

* Add model to doc tests

* Adding dependency check

* Add dummy file

* Add requires_backends

* Add model to toctree

* Fix more things, disable dependency check for now

* Apply more suggestions

* Add soft dependency

* Add annotators to tests

* Fix style

* Remove from_slow=True

* Remove print statements

* Add sanity check

* Fix processor test

* Fix processor tests, add more docs

* Add doc tests for mdx file

* Add more tips

* Apply suggestions
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
Co-authored-by: lockon-n <45759388+lockon-n@users.noreply.github.com>
Co-authored-by: SaulLu <lucilesaul.com@gmail.com>
Co-authored-by: lockon-n <dd098309@126.com>

f3d2f7a6

29 Sep, 2022 17 commits

[Wav2Vec2] Fix None loss in doc examples (#19218) · 49d62b01

rbsteinm authored Sep 29, 2022

* pass sampled_negative_indices parameter to the model to avoid getting a None loss
* concerns doc examples for Wav2Vec2ForPreTraining and Wav2Vec2ConformerForPreTraining

49d62b01

Update Past CI report script (#19228) · 1a1893e5

Yih-Dar authored Sep 29, 2022



* Simplify the error report

* Add status placeholder

* Add job links
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

1a1893e5

Add job names in Past CI artifacts (#19235) · 163cd152
Yih-Dar authored Sep 29, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
163cd152
Skip pipeline tests (#19248) · f16bbf14
Sylvain Gugger authored Sep 29, 2022

f16bbf14

Cast TF generate() inputs (#19232) · cca6e6fe

Matt authored Sep 29, 2022



* Just stick a couple of casts into generate()

* Cast decoder_input_ids too

* Don't accidentally cast floats

* Move to _generate()

* Move to after input validation
Co-authored-by: Your Name <you@example.com>

cca6e6fe

Improve DETR post-processing methods (#19205) · 01eb34ab

Alara Dirik authored Sep 29, 2022

* Ensures consistent arguments and outputs with other post-processing methods
* Adds post_process_semantic_segmentation, post_process_instance_segmentation, post_process_panoptic_segmentation, post_process_object_detection methods to DetrFeatureExtractor
* Adds deprecation warnings to post_process, post_process_segmentation and post_process_panoptic

01eb34ab

Fix test fetching for examples (#19237) · 655f72a6

Sylvain Gugger authored Sep 29, 2022

* Fix test fetching for examples

* Fake example modif

* Debug statements

* Typo

* You need to persist the file...

* Revert change in example

* Remove debug statements

655f72a6

Fix TrainingArgs argument serialization (#19239) · b79028f0
atturaioe authored Sep 29, 2022

b79028f0
Use `hf_raise_for_status` instead of deprecated `_raise_for_status` (#19244) · 902d30b3
Lucain authored Sep 29, 2022
```
* Use  instead of  from huggingface_hub

* bump huggingface_hub to 0.10.0 + make deps_table_update
```
902d30b3

Fix opt softmax small nit (#19243) · 3a27ba3d

Younes Belkada authored Sep 29, 2022

* fix opt softmax nit

- Use the same logic as 1eb09537550734a783c194e416029cb9bc4cb119 for consistency

* Update src/transformers/models/opt/modeling_opt.py

3a27ba3d

Fix `m2m_100.mdx` doc example missing `labels` (#19149) · ba9e336f
mustapha ajeghrir authored Sep 29, 2022
```
The `labels` variable is not defined, the `model_inputs` already contain this information.
```
ba9e336f

[TensorFlow] Adding GroupViT (#18020) · 0dc7b3a7

Aritra Roy Gosthipaty authored Sep 29, 2022



* chore: initial commit

* chore: adding util methods

yet to work on the nn.functional.interpolate port with align_corener=True

* chore: refactor the utils

* used tf.compat.v1.image.resize to align the F.interpolate function
* added type hints to the method signatures
* added references to the gists where one 2 one alignment of torch and tf has been shown

* chore: adding the layers

* chore: porting all the layers from torch to tf

This is the initial draft, nothing is tested yet.

* chore: aligning the layers with reference to tf clip

* chore: aligning the modules

* added demaraction comments
* added copied and adapted from comments

* chore: aligning with CLIP

* chore: wrangling the layers to keep it tf compatible

* chore: aligning the names of the layers for porting

* chore: style changes

* chore: adding docs and inits

* chore: adding tfp dependencis

the code is taken from TAPAS

* chore: initial commit for testing

* chore: aligning the vision embeddings with the vit implementatino

* chore: changing model prefix

* chore: fixing the name of the model and the layer normalization test case

* chore: every test passes but the slow ones

* chore: fix style and integration test

* chore: moving comments below decorators

* chore: make fixup and fix-copies changes

* chore: adding the Vision and Text Model to check_repo

* chore: modifying the prefix name to align it with the torch implementation

* chore: fix typo in configuration

* choer: changing the name of the model variable

* chore: adding segmentation flag

* chore: gante's review

* chore: style refactor

* chore: amy review

* chore: adding shape_list to parts that have been copied from other snippets

* chore: init batchnorm with torch defaults

* chore: adding shape_list to pass the tests

* test fix: adding seed as 0

* set seed

* chore: changing the straight through trick to fix -ve dimensinos

* chore: adding a dimension to the loss

* chore: adding reviewers and contributors names to the docs

* chore: added changes after review

* chore: code quality fixup

* chore: fixing the segmentation snippet

* chore: adding  to the layer calls

* chore: changing int32 to int64 for inputs of serving

* chore: review changes

* chore: style changes

* chore: remove from_pt=True

* fix: repo consistency
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

0dc7b3a7

Add a getattr method, which replaces _module_getattr in torch.fx.Tracer from PyTorch 1.13+ (#19233) · bb6fa06f
Michael Benayoun authored Sep 29, 2022

bb6fa06f

XGLM - Fix Softmax NaNs when using FP16 (#18057) · 9d732fd2

Gabriele Sarti authored Sep 29, 2022



* fix fp16 for xglm

* Removed misleading comment

* Fix undefined variable
Co-authored-by: Gabriele Sarti <gsarti@amazon.com>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

9d732fd2

Fix confusing working directory in Push CI (#19234) · 99c32493
Yih-Dar authored Sep 29, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
99c32493

Focus doc around preprocessing classes (#18768) · 6957350c

Steven Liu authored Sep 28, 2022

* 📝 reframe docs around preprocessing classes

* small edits

* edits and review

* fix typo

* apply review

* clarify processor

6957350c

Move AutoClasses under Main Classes (#19163) · 990936a8
Steven Liu authored Sep 28, 2022
```
* move autoclasses to main classes

* keep auto.mdx in model_doc
```
990936a8

28 Sep, 2022 8 commits
- Fix seq2seq QA example · 0fc68a7e
  Sylvain Gugger authored Sep 28, 2022
  
  0fc68a7e
- Fix cache names in CircleCI jobs (#19223) · 64998a57
  Yih-Dar authored Sep 28, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  64998a57
- Fix trainer seq2seq qa.py evaluate log and ft script (#19208) · 4a0b958d
  Tatsuki Okada authored Sep 28, 2022
```
* fix args option

* fix trainer eval log

* fix out of memory qa script

* do isort, black, flake

* fix tokenize target

* take it back.

* fix: comment
```
  4a0b958d
- Document and validate typical_p in generation (#19128) · 9c6aeba3
  Nick Doiron authored Sep 28, 2022
```
* Document and validate typical_p in generation
```
  9c6aeba3
- Fix doctest for `TFDeiTForImageClassification` (#19173) · de359c45
  Yih-Dar authored Sep 28, 2022
```
* Fix doctest for TFDeiTForImageClassification

* Remove unnecessary tf.random.set_seed
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  de359c45
- Fix deprecation warning for return_all_scores (#19217) · 22d37a9d
  Gabriel Luiz Freitas Almeida authored Sep 28, 2022
```
* Improve deprecation warning for return_all_scores

* Fix formatting
```
  22d37a9d
- Generate: add warning when left padding should be used (#19067) · a357ed50
  Joao Gante authored Sep 28, 2022
```
* add warning when left padding should be used

* PT: check for pad token; FLAX: can only check while not tracing
```
  a357ed50
- Fix small use_cache typo in the docs (#19191) · 942fa8ce
  Ankur Goyal authored Sep 28, 2022
  
  942fa8ce
27 Sep, 2022 1 commit
- Added tests for yaml and json parser (#19219) · 2df60287
  IMvision12 authored Sep 28, 2022
```
* Added tests for yaml and json

* Added tests for yaml and json
```
  2df60287