1. 09 Mar, 2022 2 commits
  2. 08 Mar, 2022 4 commits
  3. 07 Mar, 2022 6 commits
  4. 04 Mar, 2022 6 commits
    • NielsRogge's avatar
      Move dependency to call method (#15941) · e8efaecb
      NielsRogge authored
      e8efaecb
    • Chan Woo Kim's avatar
      Constrained Beam Search [*With* Disjunctive Decoding] (#15761) · 5c6f57ee
      Chan Woo Kim authored
      
      
      * added classes to get started with constrained beam search
      
      * in progress, think i can directly force tokens now but not yet with the round robin
      
      * think now i have total control, now need to code the bank selection
      
      * technically works as desired, need to optimize and fix design choices leading to undersirable outputs
      
      * complete PR #1 without disjunctive decoding
      
      * removed incorrect tests
      
      * Delete k.txt
      
      * Delete test.py
      
      * Delete test.sh
      
      * revert changes to test scripts
      
      * genutils
      
      * full implementation with testing, no disjunctive yet
      
      * shifted docs
      
      * passing all tests realistically ran locally
      
      * removing accidentally included print statements
      
      * fixed source of error in initial PR test
      
      * fixing the get_device() vs device trap
      
      * fixed documentation docstrings about constrained_beam_search
      
      * fixed tests having failing for Speech2TextModel's floating point inputs
      
      * fix cuda long tensor
      
      * added examples and testing for them and founx & fixed a bug in beam_search and constrained_beam_search
      
      * deleted accidentally added test halting code with assert False
      
      * code reformat
      
      * Update tests/test_generation_utils.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Update tests/test_generation_utils.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Update tests/test_generation_utils.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Update tests/test_generation_utils.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Update tests/test_generation_utils.py
      
      * fixing based on comments on PR
      
      * took out the testing code that should but work fails without the beam search moditification ; style changes
      
      * fixing comments issues
      
      * docstrings for ConstraintListState
      
      * typo in PhrsalConstraint docstring
      
      * docstrings improvements
      
      * finished adding what is sort of an opinionated implementation of disjunctive generation, but it revealed errors in inner beam search logic during testing.
      
      * fixed bug found in constrained beam search that used beam_idx that were not global across all the batches
      
      * disjunctive constraint working 100% correctly
      
      * passing all tests
      
      * Accidentally included mlruns
      
      * Update src/transformers/generation_beam_constraints.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Update src/transformers/generation_beam_constraints.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * complete overhaul of type complexities and other nits
      
      * strict type checks in generate()
      
      * fixing second round of feedback by narsil
      
      * fixed failing generation test because of type check overhaul
      
      * generation test fail fix
      
      * fixing test fails
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      5c6f57ee
    • Yih-Dar's avatar
      Do not change the output from tuple to list - to match PT's version (#15918) · f0aacc14
      Yih-Dar authored
      
      
      * Do not change the output from tuple to list - to match PT's version
      
      * Fix the same issues for 5 other models and the template
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      f0aacc14
    • Javier de la Rosa's avatar
      Add missing support for Flax XLM-RoBERTa (#15900) · 01485cee
      Javier de la Rosa authored
      
      
      * Adding Flax XLM-RoBERTa
      
      * Add Flax to __init__
      
      * Adding doc and dummy objects
      
      * Add tests
      
      * Add Flax XLM-R models autodoc
      
      * Fix tests
      
      * Add Flask XLM-RoBERTa to TEST_FILES_WITH_NO_COMMON_TESTS
      
      * Update src/transformers/models/xlm_roberta/modeling_flax_xlm_roberta.py
      Co-authored-by: default avatarSuraj Patil <surajp815@gmail.com>
      
      * Update tests/xlm_roberta/test_modeling_flax_xlm_roberta.py
      Co-authored-by: default avatarSuraj Patil <surajp815@gmail.com>
      
      * Update tests/xlm_roberta/test_modeling_flax_xlm_roberta.py
      Co-authored-by: default avatarSuraj Patil <surajp815@gmail.com>
      
      * Remove test on large Flask XLM-RoBERTa
      
      * Add tokenizer to the test
      Co-authored-by: default avatarSuraj Patil <surajp815@gmail.com>
      01485cee
    • Nicolas Patry's avatar
      Making MaskFormerForInstanceSegmentation. (#15934) · 89c7d9cf
      Nicolas Patry authored
      Small adjustments.
      
      Adding in type hint.
      
      Last fix ?
      
      Only include the default dict thing, not the pipelines.
      89c7d9cf
    • ParkSangJun's avatar
      Support CLIPTokenizerFast for CLIPProcessor (#15913) · 6b104c5b
      ParkSangJun authored
      * Fix to support fast tokenizer with `CLIPProcessor`
      
      * Update CLIPProcessor test for fast tokenizer
      
      * Fix Docstring Style
      
      * Rename into meaningful Variable name in test code
      6b104c5b
  5. 03 Mar, 2022 7 commits
  6. 02 Mar, 2022 7 commits
    • NielsRogge's avatar
      Fix SegformerForImageClassification (#15895) · 89be34c3
      NielsRogge authored
      
      
      * Fix reshape
      
      * Apply suggestion from code review
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      89be34c3
    • Joao Gante's avatar
      TF generate refactor - Sample (#15793) · baab5e7c
      Joao Gante authored
      
      
      * Add TF logits wrappers 
      
      * Add sample method
      
      * add tests for TF logit wrappers
      
      * TF generate sample tests now run on CPU
      Co-authored-by: default avatarMatt <Rocketknight1@users.noreply.github.com>
      baab5e7c
    • NielsRogge's avatar
      [SegFormer] Add deprecation warning (#15889) · 96ae92be
      NielsRogge authored
      
      
      * Add deprecation warning
      
      * Remove from docs and hide in kwargs
      
      * Improve implementation
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      96ae92be
    • Francesco Saverio Zuppichini's avatar
      Maskformer (#15682) · d83d22f5
      Francesco Saverio Zuppichini authored
      
      
      * maskformer
      
      * conflicts
      
      * conflicts
      
      * minor fixes
      
      * feature extractor test fix
      
      refactor MaskFormerLoss following conversation
      
      MaskFormer related types should not trigger a module time import error
      
      missed one
      
      removed all the types that are not used
      
      update config mapping
      
      minor updates in the doc
      
      resolved conversation that doesn't need a discussion
      
      minor changes
      
      resolved conversations
      
      fixed DetrDecoder
      
      * minor changes
      
      minor changes
      
      fixed mdx file
      
      test feature_extractor return types
      
      functional losses -> classes
      
      removed the return type test for the feature extractor
      
      minor changes + style + quality
      
      * conflicts?
      
      * rebase master
      
      * readme
      
      * added missing files
      
      * deleded poolformers test that where in the wrong palce
      
      * CI
      
      * minor changes
      
      * Apply suggestions from code review
      Co-authored-by: default avatarNielsRogge <48327001+NielsRogge@users.noreply.github.com>
      
      * resolved conversations
      
      * minor changes
      
      * conversations
      
      [Unispeech] Fix slow tests (#15818)
      
      * remove soundfile old way of loading audio
      
      * Adapt slow test
      
      [Barthez Tokenizer] Fix saving (#15815)
      
      [TFXLNet] Correct tf xlnet generate (#15822)
      
      * [TFXLNet] Correct tf xlnet
      
      * adapt test comment
      
      Fix the push run (#15807)
      
      Fix semantic segmentation pipeline test (#15826)
      
      Fix dummy_inputs() to dummy_inputs in symbolic_trace doc (#15776)
      
      Add model specific output classes to PoolFormer model docs (#15746)
      
      * Added model specific output classes to poolformer docs
      
      * Fixed Segformer typo in Poolformer docs
      
      Adding the option to return_timestamps on pure CTC ASR models. (#15792)
      
      * Adding the option to return_timestamps on pure CTC ASR models.
      
      * Remove `math.prod` which was introduced in Python 3.8
      
      * int are not floats.
      
      * Reworking the PR to support "char" vs "word" output.
      
      * Fixup!
      
      * Update src/transformers/pipelines/automatic_speech_recognition.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Update src/transformers/pipelines/automatic_speech_recognition.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Update src/transformers/pipelines/automatic_speech_recognition.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Update src/transformers/pipelines/automatic_speech_recognition.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Update src/transformers/pipelines/automatic_speech_recognition.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Update src/transformers/pipelines/automatic_speech_recognition.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Update src/transformers/pipelines/automatic_speech_recognition.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Update src/transformers/pipelines/automatic_speech_recognition.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Update src/transformers/pipelines/automatic_speech_recognition.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Quality.
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      HFTracer.trace should use/return self.graph to be compatible with torch.fx.Tracer (#15824)
      
      Fix tf.concatenate + test past_key_values for TF models (#15774)
      
      * fix wrong method name tf.concatenate
      
      * add tests related to causal LM / decoder
      
      * make style and quality
      
      * clean-up
      
      * Fix TFBertModel's extended_attention_mask when past_key_values is provided
      
      * Fix tests
      
      * fix copies
      
      * More tf.int8 -> tf.int32 in TF test template
      
      * clean-up
      
      * Update TF test template
      
      * revert the previous commit + update the TF test template
      
      * Fix TF template extended_attention_mask when past_key_values is provided
      
      * Fix some styles manually
      
      * clean-up
      
      * Fix ValueError: too many values to unpack in the test
      
      * Fix more: too many values to unpack in the test
      
      * Add a comment for extended_attention_mask when there is past_key_values
      
      * Fix TFElectra extended_attention_mask when past_key_values is provided
      
      * Add tests to other TF models
      
      * Fix for TF Electra test: add prepare_config_and_inputs_for_decoder
      
      * Fix not passing training arg to lm_head in TFRobertaForCausalLM
      
      * Fix tests (with past) for TF Roberta
      
      * add testing for pask_key_values for TFElectra model
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      
      [examples/summarization and translation] fix readme (#15833)
      
      Add ONNX Runtime quantization for text classification notebook (#15817)
      
      Re-enable doctests for the quicktour (#15828)
      
      * Re-enable doctests for the quicktour
      
      * Re-enable doctests for task_summary (#15830)
      
      * Remove &
      
      Framework split model report (#15825)
      
      Add TFConvNextModel (#15750)
      
      * feat: initial implementation of convnext in tensorflow.
      
      * fix: sample code for the classification model.
      
      * chore: added checked for  from the classification model.
      
      * chore: set bias initializer in the classification head.
      
      * chore: updated license terms.
      
      * chore: removed ununsed imports
      
      * feat: enabled  argument during using drop_path.
      
      * chore: replaced tf.identity with layers.Activation(linear).
      
      * chore: edited default checkpoint.
      
      * fix: minor bugs in the initializations.
      
      * partial-fix: tf model errors for loading pretrained pt weights.
      
      * partial-fix: call method updated
      
      * partial-fix: cross loading of weights (4x3 variables to be matched)
      
      * chore: removed unneeded comment.
      
      * removed playground.py
      
      * rebasing
      
      * rebasing and removing playground.py.
      
      * fix: renaming TFConvNextStage conv and layer norm layers
      
      * chore: added initializers and other minor additions.
      
      * chore: added initializers and other minor additions.
      
      * add: tests for convnext.
      
      * fix: integration tester class.
      
      * fix: issues mentioned in pr feedback (round 1).
      
      * fix: how output_hidden_states arg is propoagated inside the network.
      
      * feat: handling of  arg for pure cnn models.
      
      * chore: added a note on equal contribution in model docs.
      
      * rebasing
      
      * rebasing and removing playground.py.
      
      * feat: encapsulation for the convnext trunk.
      
      * Fix variable naming; Test-related corrections; Run make fixup
      
      * chore: added Joao as a contributor to convnext.
      
      * rebasing
      
      * rebasing and removing playground.py.
      
      * rebasing
      
      * rebasing and removing playground.py.
      
      * chore: corrected copyright year and added comment on NHWC.
      
      * chore: fixed the black version and ran formatting.
      
      * chore: ran make style.
      
      * chore: removed from_pt argument from test, ran make style.
      
      * rebasing
      
      * rebasing and removing playground.py.
      
      * rebasing
      
      * rebasing and removing playground.py.
      
      * fix: tests in the convnext subclass, ran make style.
      
      * rebasing
      
      * rebasing and removing playground.py.
      
      * rebasing
      
      * rebasing and removing playground.py.
      
      * chore: moved convnext test to the correct location
      
      * fix: locations for the test file of convnext.
      
      * fix: convnext tests.
      
      * chore: applied  sgugger's suggestion for dealing w/ output_attentions.
      
      * chore: added comments.
      
      * chore: applied updated quality enviornment style.
      
      * chore: applied formatting with quality enviornment.
      
      * chore: revert to the previous tests/test_modeling_common.py.
      
      * chore: revert to the original test_modeling_common.py
      
      * chore: revert to previous states for test_modeling_tf_common.py and modeling_tf_utils.py
      
      * fix: tests for convnext.
      
      * chore: removed output_attentions argument from convnext config.
      
      * chore: revert to the earlier tf utils.
      
      * fix: output shapes of the hidden states
      
      * chore: removed unnecessary comment
      
      * chore: reverting to the right test_modeling_tf_common.py.
      
      * Styling nits
      Co-authored-by: default avatarariG23498 <aritra.born2fly@gmail.com>
      Co-authored-by: default avatarJoao Gante <joao@huggingface.co>
      Co-authored-by: default avatarSylvain Gugger <Sylvain.gugger@gmail.com>
      
      * minor changes
      
      * doc fix in feature extractor
      
      * doc
      
      * typose
      
      * removed detr logic from config
      
      * removed detr logic from config
      
      * removed num_labels
      
      * small fix in the config
      
      * auxilary -> auxiliary
      
      * make style
      
      * some test is failing
      
      * fix a weird char in config prevending doc-builder
      
      * retry to fix the doc-builder issue
      
      * make style
      
      * new try to fix the doc builder
      
      * CI
      
      * change weights to facebook
      Co-authored-by: default avatarNielsRogge <48327001+NielsRogge@users.noreply.github.com>
      Co-authored-by: default avatarariG23498 <aritra.born2fly@gmail.com>
      Co-authored-by: default avatarJoao Gante <joao@huggingface.co>
      Co-authored-by: default avatarSylvain Gugger <Sylvain.gugger@gmail.com>
      d83d22f5
    • Rahul Huilgol's avatar
      Updates in Trainer to support new features in SM Model Parallel library (#15877) · 2eb7bb15
      Rahul Huilgol authored
      
      
      * Create optimizer after model creation for SMP
      
      * update dp_rank to rdp_rank for opt_state_dict
      
      * update world_size and process_index for smp
      
      * Address comments
      
      * Lint fix
      Co-authored-by: default avatarCavdar <dcavdar@a07817b12d7e.ant.amazon.com>
      2eb7bb15
    • Nicolas Patry's avatar
      Adding timestamps for CTC with LM in ASR pipeline. (#15863) · 6e57a569
      Nicolas Patry authored
      * Adding timestamps for CTC with LM in ASR pipeline.
      
      * iRemove print.
      
      * Nit change.
      6e57a569
    • Michael Benayoun's avatar
      M2M100 support for ONNX export (#15193) · 4bfe75bd
      Michael Benayoun authored
      * Add M2M100 support for ONNX export
      
      * Delete useless imports
      
      * Add M2M100 to tests
      
      * Fix protobuf issue
      4bfe75bd
  7. 01 Mar, 2022 4 commits
  8. 28 Feb, 2022 3 commits
  9. 25 Feb, 2022 1 commit
    • Sayak Paul's avatar
      Add TFConvNextModel (#15750) · 84eaa6ac
      Sayak Paul authored
      
      
      * feat: initial implementation of convnext in tensorflow.
      
      * fix: sample code for the classification model.
      
      * chore: added checked for  from the classification model.
      
      * chore: set bias initializer in the classification head.
      
      * chore: updated license terms.
      
      * chore: removed ununsed imports
      
      * feat: enabled  argument during using drop_path.
      
      * chore: replaced tf.identity with layers.Activation(linear).
      
      * chore: edited default checkpoint.
      
      * fix: minor bugs in the initializations.
      
      * partial-fix: tf model errors for loading pretrained pt weights.
      
      * partial-fix: call method updated
      
      * partial-fix: cross loading of weights (4x3 variables to be matched)
      
      * chore: removed unneeded comment.
      
      * removed playground.py
      
      * rebasing
      
      * rebasing and removing playground.py.
      
      * fix: renaming TFConvNextStage conv and layer norm layers
      
      * chore: added initializers and other minor additions.
      
      * chore: added initializers and other minor additions.
      
      * add: tests for convnext.
      
      * fix: integration tester class.
      
      * fix: issues mentioned in pr feedback (round 1).
      
      * fix: how output_hidden_states arg is propoagated inside the network.
      
      * feat: handling of  arg for pure cnn models.
      
      * chore: added a note on equal contribution in model docs.
      
      * rebasing
      
      * rebasing and removing playground.py.
      
      * feat: encapsulation for the convnext trunk.
      
      * Fix variable naming; Test-related corrections; Run make fixup
      
      * chore: added Joao as a contributor to convnext.
      
      * rebasing
      
      * rebasing and removing playground.py.
      
      * rebasing
      
      * rebasing and removing playground.py.
      
      * chore: corrected copyright year and added comment on NHWC.
      
      * chore: fixed the black version and ran formatting.
      
      * chore: ran make style.
      
      * chore: removed from_pt argument from test, ran make style.
      
      * rebasing
      
      * rebasing and removing playground.py.
      
      * rebasing
      
      * rebasing and removing playground.py.
      
      * fix: tests in the convnext subclass, ran make style.
      
      * rebasing
      
      * rebasing and removing playground.py.
      
      * rebasing
      
      * rebasing and removing playground.py.
      
      * chore: moved convnext test to the correct location
      
      * fix: locations for the test file of convnext.
      
      * fix: convnext tests.
      
      * chore: applied  sgugger's suggestion for dealing w/ output_attentions.
      
      * chore: added comments.
      
      * chore: applied updated quality enviornment style.
      
      * chore: applied formatting with quality enviornment.
      
      * chore: revert to the previous tests/test_modeling_common.py.
      
      * chore: revert to the original test_modeling_common.py
      
      * chore: revert to previous states for test_modeling_tf_common.py and modeling_tf_utils.py
      
      * fix: tests for convnext.
      
      * chore: removed output_attentions argument from convnext config.
      
      * chore: revert to the earlier tf utils.
      
      * fix: output shapes of the hidden states
      
      * chore: removed unnecessary comment
      
      * chore: reverting to the right test_modeling_tf_common.py.
      
      * Styling nits
      Co-authored-by: default avatarariG23498 <aritra.born2fly@gmail.com>
      Co-authored-by: default avatarJoao Gante <joao@huggingface.co>
      Co-authored-by: default avatarSylvain Gugger <Sylvain.gugger@gmail.com>
      84eaa6ac