• Francesco Saverio Zuppichini's avatar
    Maskformer (#15682) · d83d22f5
    Francesco Saverio Zuppichini authored
    
    
    * maskformer
    
    * conflicts
    
    * conflicts
    
    * minor fixes
    
    * feature extractor test fix
    
    refactor MaskFormerLoss following conversation
    
    MaskFormer related types should not trigger a module time import error
    
    missed one
    
    removed all the types that are not used
    
    update config mapping
    
    minor updates in the doc
    
    resolved conversation that doesn't need a discussion
    
    minor changes
    
    resolved conversations
    
    fixed DetrDecoder
    
    * minor changes
    
    minor changes
    
    fixed mdx file
    
    test feature_extractor return types
    
    functional losses -> classes
    
    removed the return type test for the feature extractor
    
    minor changes + style + quality
    
    * conflicts?
    
    * rebase master
    
    * readme
    
    * added missing files
    
    * deleded poolformers test that where in the wrong palce
    
    * CI
    
    * minor changes
    
    * Apply suggestions from code review
    Co-authored-by: default avatarNielsRogge <48327001+NielsRogge@users.noreply.github.com>
    
    * resolved conversations
    
    * minor changes
    
    * conversations
    
    [Unispeech] Fix slow tests (#15818)
    
    * remove soundfile old way of loading audio
    
    * Adapt slow test
    
    [Barthez Tokenizer] Fix saving (#15815)
    
    [TFXLNet] Correct tf xlnet generate (#15822)
    
    * [TFXLNet] Correct tf xlnet
    
    * adapt test comment
    
    Fix the push run (#15807)
    
    Fix semantic segmentation pipeline test (#15826)
    
    Fix dummy_inputs() to dummy_inputs in symbolic_trace doc (#15776)
    
    Add model specific output classes to PoolFormer model docs (#15746)
    
    * Added model specific output classes to poolformer docs
    
    * Fixed Segformer typo in Poolformer docs
    
    Adding the option to return_timestamps on pure CTC ASR models. (#15792)
    
    * Adding the option to return_timestamps on pure CTC ASR models.
    
    * Remove `math.prod` which was introduced in Python 3.8
    
    * int are not floats.
    
    * Reworking the PR to support "char" vs "word" output.
    
    * Fixup!
    
    * Update src/transformers/pipelines/automatic_speech_recognition.py
    Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
    
    * Update src/transformers/pipelines/automatic_speech_recognition.py
    Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
    
    * Update src/transformers/pipelines/automatic_speech_recognition.py
    Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
    
    * Update src/transformers/pipelines/automatic_speech_recognition.py
    Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
    
    * Update src/transformers/pipelines/automatic_speech_recognition.py
    Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
    
    * Update src/transformers/pipelines/automatic_speech_recognition.py
    Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
    
    * Update src/transformers/pipelines/automatic_speech_recognition.py
    Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
    
    * Update src/transformers/pipelines/automatic_speech_recognition.py
    Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
    
    * Update src/transformers/pipelines/automatic_speech_recognition.py
    Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
    
    * Quality.
    Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
    
    HFTracer.trace should use/return self.graph to be compatible with torch.fx.Tracer (#15824)
    
    Fix tf.concatenate + test past_key_values for TF models (#15774)
    
    * fix wrong method name tf.concatenate
    
    * add tests related to causal LM / decoder
    
    * make style and quality
    
    * clean-up
    
    * Fix TFBertModel's extended_attention_mask when past_key_values is provided
    
    * Fix tests
    
    * fix copies
    
    * More tf.int8 -> tf.int32 in TF test template
    
    * clean-up
    
    * Update TF test template
    
    * revert the previous commit + update the TF test template
    
    * Fix TF template extended_attention_mask when past_key_values is provided
    
    * Fix some styles manually
    
    * clean-up
    
    * Fix ValueError: too many values to unpack in the test
    
    * Fix more: too many values to unpack in the test
    
    * Add a comment for extended_attention_mask when there is past_key_values
    
    * Fix TFElectra extended_attention_mask when past_key_values is provided
    
    * Add tests to other TF models
    
    * Fix for TF Electra test: add prepare_config_and_inputs_for_decoder
    
    * Fix not passing training arg to lm_head in TFRobertaForCausalLM
    
    * Fix tests (with past) for TF Roberta
    
    * add testing for pask_key_values for TFElectra model
    Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
    
    [examples/summarization and translation] fix readme (#15833)
    
    Add ONNX Runtime quantization for text classification notebook (#15817)
    
    Re-enable doctests for the quicktour (#15828)
    
    * Re-enable doctests for the quicktour
    
    * Re-enable doctests for task_summary (#15830)
    
    * Remove &
    
    Framework split model report (#15825)
    
    Add TFConvNextModel (#15750)
    
    * feat: initial implementation of convnext in tensorflow.
    
    * fix: sample code for the classification model.
    
    * chore: added checked for  from the classification model.
    
    * chore: set bias initializer in the classification head.
    
    * chore: updated license terms.
    
    * chore: removed ununsed imports
    
    * feat: enabled  argument during using drop_path.
    
    * chore: replaced tf.identity with layers.Activation(linear).
    
    * chore: edited default checkpoint.
    
    * fix: minor bugs in the initializations.
    
    * partial-fix: tf model errors for loading pretrained pt weights.
    
    * partial-fix: call method updated
    
    * partial-fix: cross loading of weights (4x3 variables to be matched)
    
    * chore: removed unneeded comment.
    
    * removed playground.py
    
    * rebasing
    
    * rebasing and removing playground.py.
    
    * fix: renaming TFConvNextStage conv and layer norm layers
    
    * chore: added initializers and other minor additions.
    
    * chore: added initializers and other minor additions.
    
    * add: tests for convnext.
    
    * fix: integration tester class.
    
    * fix: issues mentioned in pr feedback (round 1).
    
    * fix: how output_hidden_states arg is propoagated inside the network.
    
    * feat: handling of  arg for pure cnn models.
    
    * chore: added a note on equal contribution in model docs.
    
    * rebasing
    
    * rebasing and removing playground.py.
    
    * feat: encapsulation for the convnext trunk.
    
    * Fix variable naming; Test-related corrections; Run make fixup
    
    * chore: added Joao as a contributor to convnext.
    
    * rebasing
    
    * rebasing and removing playground.py.
    
    * rebasing
    
    * rebasing and removing playground.py.
    
    * chore: corrected copyright year and added comment on NHWC.
    
    * chore: fixed the black version and ran formatting.
    
    * chore: ran make style.
    
    * chore: removed from_pt argument from test, ran make style.
    
    * rebasing
    
    * rebasing and removing playground.py.
    
    * rebasing
    
    * rebasing and removing playground.py.
    
    * fix: tests in the convnext subclass, ran make style.
    
    * rebasing
    
    * rebasing and removing playground.py.
    
    * rebasing
    
    * rebasing and removing playground.py.
    
    * chore: moved convnext test to the correct location
    
    * fix: locations for the test file of convnext.
    
    * fix: convnext tests.
    
    * chore: applied  sgugger's suggestion for dealing w/ output_attentions.
    
    * chore: added comments.
    
    * chore: applied updated quality enviornment style.
    
    * chore: applied formatting with quality enviornment.
    
    * chore: revert to the previous tests/test_modeling_common.py.
    
    * chore: revert to the original test_modeling_common.py
    
    * chore: revert to previous states for test_modeling_tf_common.py and modeling_tf_utils.py
    
    * fix: tests for convnext.
    
    * chore: removed output_attentions argument from convnext config.
    
    * chore: revert to the earlier tf utils.
    
    * fix: output shapes of the hidden states
    
    * chore: removed unnecessary comment
    
    * chore: reverting to the right test_modeling_tf_common.py.
    
    * Styling nits
    Co-authored-by: default avatarariG23498 <aritra.born2fly@gmail.com>
    Co-authored-by: default avatarJoao Gante <joao@huggingface.co>
    Co-authored-by: default avatarSylvain Gugger <Sylvain.gugger@gmail.com>
    
    * minor changes
    
    * doc fix in feature extractor
    
    * doc
    
    * typose
    
    * removed detr logic from config
    
    * removed detr logic from config
    
    * removed num_labels
    
    * small fix in the config
    
    * auxilary -> auxiliary
    
    * make style
    
    * some test is failing
    
    * fix a weird char in config prevending doc-builder
    
    * retry to fix the doc-builder issue
    
    * make style
    
    * new try to fix the doc builder
    
    * CI
    
    * change weights to facebook
    Co-authored-by: default avatarNielsRogge <48327001+NielsRogge@users.noreply.github.com>
    Co-authored-by: default avatarariG23498 <aritra.born2fly@gmail.com>
    Co-authored-by: default avatarJoao Gante <joao@huggingface.co>
    Co-authored-by: default avatarSylvain Gugger <Sylvain.gugger@gmail.com>
    d83d22f5
README.md 51.6 KB