1. 01 Mar, 2023 5 commits
  2. 28 Feb, 2023 9 commits
    • Sylvain Gugger's avatar
      Fix flaky test for log level (#21776) · b29e2dca
      Sylvain Gugger authored
      * Fix flaky test for log level
      
      * Fix other flaky test
      b29e2dca
    • Matt's avatar
      Improve TF weight loading, especially PT crossloading (#21792) · acfb714b
      Matt authored
      * First commit for the improved PT-TF weight loading
      
      * Remove workarounds from TFEncoderDecoder tests
      
      * Allow a custom weight renaming function in from_pretrained and use that to clean up EncoderDecoder
      
      * make fixup
      
      * First attempt at visionencoderdecoder
      
      * Disable tensorfloat32 in tests to get consistent outputs
      
      * Quick fix to tf_vision_encoder_decoder tests
      
      * make fixup
      
      * Update Blenderbot tests
      
      * Remove unused arg in modeling_tf_opt
      
      * load_tf_sharded_weights had strict=True! This meant transfer learning was impossible, so I'm setting it to False.
      
      * Support prefixes when loading sharded TF checkpoints
      
      * make fixup
      
      * Add test to load sharded models with a weight prefix
      
      * Fix sharded weight loading test
      
      * Add a test for transfer from a sharded checkpoint
      
      * make fixup
      
      * Add test to check that crossloading from PT with a prefix works
      
      * Refactor from_pretrained in the encoderdecoder classes
      
      * Refactor from_pretrained in the encoderdecoder classes
      
      * missmatched -> mismatched
      
      * Explicitly check for None
      
      * No comments showing my very impressive and attractive knowledge of Py3.9+
      
      * Disable TF32 across all TF tests
      acfb714b
    • Yih-Dar's avatar
      🔥Rework pipeline testing by removing `PipelineTestCaseMeta` 🚀 (#21516) · 871c31a6
      Yih-Dar authored
      
      
      * Add PipelineTesterMixin
      
      * remove class PipelineTestCaseMeta
      
      * move validate_test_components
      
      * Add for ViT
      
      * Add to SPECIAL_MODULE_TO_TEST_MAP
      
      * style and quality
      
      * Add feature-extraction
      
      * update
      
      * raise instead of skip
      
      * add tiny_model_summary.json
      
      * more explicit
      
      * skip tasks not in mapping
      
      * add availability check
      
      * Add Copyright
      
      * A way to diable irrelevant tests
      
      * update with main
      
      * remove disable_irrelevant_tests
      
      * skip tests
      
      * better skip message
      
      * better skip message
      
      * Add all pipeline task tests
      
      * revert
      
      * Import PipelineTesterMixin
      
      * subclass test classes with PipelineTesterMixin
      
      * Add pipieline_model_mapping
      
      * Fix import after adding pipieline_model_mapping
      
      * Fix style and quality after adding pipieline_model_mapping
      
      * Fix one more import after adding pipieline_model_mapping
      
      * Fix style and quality after adding pipieline_model_mapping
      
      * Fix test issues
      
      * Fix import requirements
      
      * Fix mapping for MobileViTModelTest
      
      * Update
      
      * Better skip message
      
      * pipieline_model_mapping could not be None
      
      * Remove some PipelineTesterMixin
      
      * Fix typo
      
      * revert tests_fetcher.py
      
      * update
      
      * rename
      
      * revert
      
      * Remove PipelineTestCaseMeta from ZeroShotAudioClassificationPipelineTests
      
      * style and quality
      
      * test fetcher for all pipeline/model tests
      
      ---------
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      871c31a6
    • Anahita Bhiwandiwalla's avatar
      Add loss for BridgeTowerForMaskedLM and BridgeTowerForImageAndTextRetrieval (#21684) · 4cb5ffa9
      Anahita Bhiwandiwalla authored
      
      
      * Add loss for BridgeTowerForMaskedLM and BridgeTowerForImageAndTextRetrieval
      
      * minor fix return_dict
      
      * implement test for loss computation
      
      ---------
      Co-authored-by: default avatarTiep Le <97980157+tileintel@users.noreply.github.com>
      Co-authored-by: default avatarTiep Le <tiep.le@intel.com>
      4cb5ffa9
    • Younes Belkada's avatar
      [`Blip2`] Fix Blip-2 multi gpu (#21707) · 7f4f8b97
      Younes Belkada authored
      
      
      * fix blip multi gpu
      
      * fix
      
      * final changes
      
      * adapt suggestions
      
      * fix failing slow test
      
      * forward contrib credits from testing and suggestions
      
      * reformat
      
      ---------
      Co-authored-by: default avatarakkikiki <akkikiki@users.noreply.github.com>
      7f4f8b97
    • raghavanone's avatar
      Fix the issue of blip model returning loss even when the label is not provided. (#21811) · eec76042
      raghavanone authored
      * Fix the issue of blip model returning loss even when the label is not provoided
      
      * Fix ruff failure
      
      * Incorporate PR feedbacks
      
      * Incorporate PR feedbacks
      
      * Incorporate PR feedbacks
      
      * Incorporate PR feedbacks
      eec76042
    • Younes Belkada's avatar
      [`Blip2`] Add `Blip2Model` (#21817) · b8de7e44
      Younes Belkada authored
      * add v1
      
      * add `Blip2Model`
      
      - add relevant functions
      - add tests
      - add on automapping
      
      * fix docs
      
      * fix doctest
      b8de7e44
    • Younes Belkada's avatar
      [`T5`] Fix torchquant issue (#21843) · ae9230af
      Younes Belkada authored
      * fix torchquant issue
      
      * add tests
      ae9230af
    • Yih-Dar's avatar
      Rename `MobileViTModelTest` to `TFMobileViTModelTest` (#21825) · a9dd1243
      Yih-Dar authored
      Let's give TF a bit more love 🙏
      
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      a9dd1243
  3. 27 Feb, 2023 4 commits
  4. 24 Feb, 2023 6 commits
  5. 23 Feb, 2023 4 commits
  6. 22 Feb, 2023 5 commits
  7. 21 Feb, 2023 2 commits
  8. 20 Feb, 2023 5 commits
    • Alara Dirik's avatar
      Add EfficientNet (#21563) · 49ab1623
      Alara Dirik authored
      * Add EfficientNet to transformers
      49ab1623
    • Younes Belkada's avatar
      [`bnb`] fix `bnb` decoders bug (#21688) · c9a06714
      Younes Belkada authored
      * fix `bnb` decoders bug
      
      * make fixup
      c9a06714
    • tanreinama's avatar
      add GPTSAN model (reopen) (#21291) · f56174ac
      tanreinama authored
      * add GPTSAN-Japanese
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN (update for review)
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * fix typo in comment text
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * fix document and comments
      
      * fix class name GPTSAN->GPTSan
      
      * fix import and test for tokenizer
      f56174ac
    • Sylvain Gugger's avatar
      Fix quality · c87bbe1f
      Sylvain Gugger authored
      c87bbe1f
    • Andy Ehrenberg's avatar
      add flax whisper implementation (#20479) · 2840272c
      Andy Ehrenberg authored
      
      
      * add flax whisper implementation
      
      * rever change to setup
      
      * remove unused imports
      
      * revert generation changes
      
      * flax whisper docs
      
      * docs
      
      * import order
      
      * import sorting
      
      * isort
      
      * add dummy objects
      
      * doc formatting
      
      * formatting
      
      * remove trailing whitespaces
      
      * fix flax whisper docs
      
      * add generation logic to unlock flax whisper
      
      * remove scans
      
      * give credits to Flax Bart implementation
      
      * remove unused imports
      
      * add license
      
      * remove assert
      
      * more credits to Bart
      
      * fix style
      
      * formatting
      
      * support left padding
      
      * add flax whisper generation test
      
      * remove copied from comments whenever not a full copy
      
      * fix docstrings for logits processors
      
      * revert change to FlaxForceTokensLogitsProcessor
      
      * revert doc changes
      
      * improve generation docs
      
      * reorganize
      
      * formatting
      
      * cleanup docs
      
      * add tests
      
      * handle empty list case
      
      * fix forced decoder ids in flax tests
      
      * add flax whisper to inits
      
      * upate dummy objects
      
      * docs for FlaxAutoModelForSpeechSeq2Seq
      
      * fix decoder_position_ids computation in pretrained model decode/__call__ fns
      
      * add Copied from statements as necessary
      
      * compute position_ids only in __call__ and decode methods of pretrained model subclasses
      
      * improve readabilityof compute positional embeddings
      
      * check dimensionality of input_features instead of hidden_states
      
      * copied from statement for init_cache
      
      * formatting
      
      * fix copies
      
      * fix copies
      
      * pass attention mask to encoder layers
      
      * fix decoder module outputs
      
      * set dtype
      Co-authored-by: default avatarSanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
      
      * smaller flax model for whisper test
      
      * Update src/transformers/generation/flax_utils.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/models/whisper/modeling_flax_whisper.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update tests/models/whisper/test_modeling_flax_whisper.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * cleanup
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/models/whisper/modeling_flax_whisper.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * bias cleanup
      
      * doc fix
      
      * align style for force tokens processor
      
      * readability
      
      * fix input shape in tests
      
      * revert FlaxGenerationMixin docstring
      
      * formatting
      
      * fix tests
      
      * fix imports
      
      * consistent encoder hidden states
      
      * consistent hidden states
      
      * input shapes
      
      * typo
      
      * partial class trick
      
      * partial class for input shape
      
      * base_class with correct input shape
      
      * partial base classes
      
      * match by name
      
      * set main_input_name
      
      * compare on names
      
      * formatting
      
      * remove unused import
      
      * safer position ids computation
      
      * safer position id computation
      
      * Update src/transformers/models/whisper/modeling_flax_whisper.py
      Co-authored-by: default avatarSanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
      
      * Update src/transformers/models/whisper/modeling_flax_whisper.py
      Co-authored-by: default avatarSanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
      
      * remove identical inherited tests
      
      * fix prompt ids in tests
      
      * use generation config
      
      * use jnp array
      
      * better var names
      
      * more explicit bias use
      
      * import transformers
      
      * formatting
      
      * test formatting
      
      * remove unused imports
      
      * remove unused imports
      
      * formatting
      
      * isort
      
      * docs
      
      * fix ln orders for encoder hidden states
      
      * whisper unique generation stuff
      
      * flake
      
      * use finfo for attention bias
      
      * docs
      
      * Update src/transformers/generation/flax_utils.py
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * docs
      
      * add timestamp flax test
      
      * jit for timestamps
      
      * formatting
      
      * clean up timestamps processor
      
      * formatting
      
      * remove if_true
      
      * cleanup
      
      ---------
      Co-authored-by: default avatarSanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      2840272c