1. 24 Feb, 2022 4 commits
  2. 23 Feb, 2022 3 commits
    • Lysandre Debut's avatar
      [Test refactor 1/5] Per-folder tests reorganization (#15725) · 29c10a41
      Lysandre Debut authored
      
      
      * Per-folder tests reorganization
      Co-authored-by: default avatarsgugger <sylvain.gugger@gmail.com>
      Co-authored-by: default avatarStas Bekman <stas@stason.org>
      29c10a41
    • Nicolas Patry's avatar
      Enable `image-segmentation` on `AutoModelForSemanticSegmentation` (#15647) · 9e71d464
      Nicolas Patry authored
      * Enabling Beit SegFormer to `image-segmentation`.
      
      * Fixing the score.
      
      * Fix import ?
      
      * Missing in type hint.
      
      * Multiple test fixes:
      
      - Add `raw_image` support. It should be the default IMHO since in Python
        world it doesn't make any sense to base64 encode the image (Sorry
        @mishig, didn't catch that in my review). I really think we should
        consider breaking BC here.
      - Add support for Segformer tiny test (needed
        `SegformerModelTester.get_config` to enable TinyConfig
        @NielsRogge)
      - Add the check that `batch_size` works correctly on that pipeline.
        Uncovered that it doesn't for Detr, which IMO is OK since images
        after `feature_extractor` don't have the same size. Comment should
        explain.
      
      * Type hint as a string.
      
      * Make fixup + update black.
      
      * torch+vision protections.
      
      * Don't use torchvision, use F.interpolate instead (no new dep).
      
      * Last fixes for Segformer.
      
      * Update test to reflect new image (which was broken)
      
      * Update tests.
      
      * Major BC modification:
      
      - Removed the string compressed PNG string, that's a job for users
      `transformers` stays in python land.
      - Removed the `score` for semantic segmentation. It has hardly a meaning
        on its own in this context.
      - Don't include the grayscale with logits for now (which could enable
        users to get a sense of confidence). Might be done later.
      - Don't include the surface of the mask (could be used for sorting by
        users, to filter out small masks). It's already calculable, and
        it's easier to add later, than to add now and break later if we need.
      
      * `make fixup`.
      
      * Small changes.
      
      * Rebase + doc fixup.
      9e71d464
    • Nicolas Patry's avatar
      Adding ZeroShotImageClassificationPipeline (#12119) · f9582c20
      Nicolas Patry authored
      
      
      * [Proposal] Adding ZeroShotImageClassificationPipeline
      
      - Based on CLIP
      
      * WIP, Resurection in progress.
      
      * Resurrection... achieved.
      
      * Reword handling different `padding_value` for `feature_extractor` and
      `tokenizer`.
      
      * Thanks doc-builder !
      
      * Adding docs + global namespace `ZeroShotImageClassificationPipeline`.
      
      * Fixing templates.
      
      * Make the test pass and be robust to floating error.
      
      * Adressing suraj's comments on docs mostly.
      
      * Tf support start.
      
      * TF support.
      
      * Update src/transformers/pipelines/zero_shot_image_classification.py
      Co-authored-by: default avatarSuraj Patil <surajp815@gmail.com>
      Co-authored-by: default avatarSuraj Patil <surajp815@gmail.com>
      f9582c20
  3. 22 Feb, 2022 2 commits
    • Patrick von Platen's avatar
      Time stamps for CTC models (#15687) · c44d3675
      Patrick von Platen authored
      
      
      * [Wav2Vec2 Time Stamps]
      
      * Add first version
      
      * add word time stamps
      
      * Fix
      
      * save intermediate space
      
      * improve
      
      * [Finish CTC Tokenizer]
      
      * remove @
      
      * remove @
      
      * push
      
      * continue with phonemes
      
      * up
      
      * finish PR
      
      * up
      
      * add example
      
      * rename
      
      * finish
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * correct split
      
      * finalize
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      c44d3675
    • Funtowicz Morgan's avatar
      Gelu10 (#15676) · 32295b15
      Funtowicz Morgan authored
      * Add GeLU10 (clipped version of GeLU) to transformers to improve quantization performances.
      
      * Add unittests.
      
      * Import tensorflow after `is_tf_available` check.
      
      * Fix tensorflow wrong function `tf.tensor` to `tf.constant`
      
      * style.
      
      * use `tf.math.max`
      
      * Fix tf tests.
      
      * style.
      
      * style style style style style style
      
      * style style style style style style
      
      * Address @sgugger comments.
      
      * Fix wrong operator for raising ValueError for ClippedGELUActivation.
      32295b15
  4. 21 Feb, 2022 1 commit
  5. 18 Feb, 2022 6 commits
    • Sanchit Gandhi's avatar
      fix bug in PT speech-encoder-decoder (#15699) · 60ba4820
      Sanchit Gandhi authored
      
      
      * fix bug in PT speech-encoder-decoder
      
      * add pt test for `inputs is not None`
      
      * fix test
      
      * new pt test
      
      * Update tests/test_modeling_speech_encoder_decoder.py
      
      * make fixup
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      60ba4820
    • Lysandre Debut's avatar
      Fix auto (#15706) · 83f45cd6
      Lysandre Debut authored
      83f45cd6
    • Gunjan Chhablani's avatar
      Add PLBart (#13269) · ae1f8350
      Gunjan Chhablani authored
      * Init PLBART
      
      * Add missing configuration file
      
      * Add conversion script and configurationf ile
      
      * Fix style
      
      * Update modeling and conversion scripts
      
      * Fix scale embedding in config
      
      * Add comment
      
      * Fix conversion script
      
      * Add classification option to conversion script
      
      * Fix vocab size in config doc
      
      * Add tokenizer files from MBart50
      
      * Allow no lang code in regular tokenizer
      
      * Add PLBart Tokenizer Converters
      
      * Remove mask from multi tokenizer
      
      * Remove mask from multi tokenizer
      
      * Change from MBart-50 to MBart tokenizer
      
      * Fix names and modify src/tgt behavior
      
      * Fix imports for tokenizer
      
      * Remove <mask> from multi tokenizer
      
      * Fix style
      
      * Change tokenizer_class to processor_class
      
      * Add attribute map to config class
      
      * Update modeling file to modified MBart code
      
      * Update configuration file to MBart style configuration
      
      * Fix tokenizer
      
      * Separate tokenizers
      
      * Fix error in tokenization auto
      
      * Copy MBart tests
      
      * Replace with MBart tokenization tests
      
      * Fix style
      
      * Fix language code in multi tokenizer
      
      * Fix configuration docs
      
      * Add entry for plbart_multi in transformers init
      
      * Add dummy objects and fix imports
      
      * Fix modeling tests
      
      * Add TODO in config
      
      * Fix copyright year
      
      * Fix modeling docs and test
      
      * Fix some tokenization tests and style
      
      * Add changes from review
      
      * Fix copies
      
      * Fix docs
      
      * Fix docs
      
      * Fix style
      
      * Fix year
      
      * Add changes from review
      
      * Remove extra changes
      
      * Fix base tokenizer and doc
      
      * Fix style
      
      * Fix modeling and slow tokenizer tests
      
      * Remove Multi-tokenizer Converter and Tests
      
      * Delete QA model and Multi Tokenizer dummy objects
      
      * Fix repo consistency and code quality issues
      
      * Fix example documentation
      
      * Fix style
      
      * Remove PLBartTokenizer from type checking in init
      
      * Fix consistency issue
      
      * Add changes from review
      
      * Fix style
      
      * Remove PLBartTokenizerFast
      
      * Remove FastTokenizer converter
      
      * Fix AutoTokenzier mapping
      
      * Add plbart to toctree and fix consistency issues
      
      * Add language codes tokenizer test
      
      * Fix styling and doc issues
      
      * Add fixes for failing tests
      
      * Fix copies
      
      * Fix failing modeling test
      
      * Change assert to assertTrue in modeling tests
      ae1f8350
    • Yih-Dar's avatar
      Fix LongformerModel hidden states (#15537) · 2f2fefd6
      Yih-Dar authored
      
      
      * add undo padding
      
      * fix
      
      * fix tuple issue
      
      * make style and quality
      
      * move unpad logic to LongformerEncoder + unpad attentions + update tests
      
      * move unpad logic to TFLongformerEncoder
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      2f2fefd6
    • Yih-Dar's avatar
    • SaulLu's avatar
      fix CLIP fast tokenizer and change some properties of the slow version (#15067) · e93763d4
      SaulLu authored
      
      
      Very big changes concerning the tokenizer fast of CLIP which did not correspond to the tokenizer slow of CLIP
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      e93763d4
  6. 17 Feb, 2022 2 commits
    • NielsRogge's avatar
      Add SimMIM (#15586) · 57882177
      NielsRogge authored
      
      
      * Add first draft
      
      * Make model importable
      
      * Make SwinForMaskedImageModeling importable
      
      * Fix imports
      
      * Add missing inits
      
      * Add support for Swin
      
      * Fix bug
      
      * Fix bug
      
      * Fix another bug
      
      * Fix Swin MIM implementation
      
      * Fix default encoder stride
      
      * Fix Swin
      
      * Add print statements for debugging
      
      * Add image_size data argument
      
      * Fix Swin
      
      * Fix image_size
      
      * Add print statements for debugging
      
      * Fix print statement
      
      * Remove print statements
      
      * Improve reshaping of bool_masked_pos
      
      * Add support for DeiT, fix tests
      
      * Improve docstrings
      
      * Apply new black version
      
      * Improve script
      
      * Fix bug
      
      * Improve README
      
      * Apply suggestions from code review
      
      * Remove DS_Store and add to gitignore
      
      * Apply suggestions from code review + fix BEiT Flax
      
      * Revert BEiT changes
      
      * Improve README
      
      * Fix code quality
      
      * Improve README
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MBP.localdomain>
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      57882177
    • Tanay Mehta's avatar
      Add PoolFormer (#15531) · f84e0dbd
      Tanay Mehta authored
      
      
      * Added all files, PoolFormerFeatureExtractor still failing tests
      
      * Fixed PoolFormerFeatureExtractor not being able to import
      
      * Completed Poolformer doc
      
      * Applied Suggested fixes
      
      * Fixed errors in modeling_auto.py
      
      * Fix feature extractor, convert docs to Markdown, styling of code
      
      * Remove PoolFormer from check_repo and fix integration test
      
      * Remove Poolformer from check_repo
      
      * Fixed configuration_poolformer.py docs and removed inference.py from poolformer
      
      * Ran with black v22
      
      * Added PoolFormer to _toctree.yml
      
      * Updated poolformer doc
      
      * Applied suggested fixes and added on README.md
      
      * Did make fixup and make fix-copies, tests should pass now
      
      * Changed PoolFormer weights conversion script name and fixed README
      
      * Applied fixes in test_modeling_poolformer.py and modeling_poolformer.py
      
      * Added PoolFormerFeatureExtractor to AutoFeatureExtractor API
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MBP.localdomain>
      f84e0dbd
  7. 16 Feb, 2022 4 commits
  8. 15 Feb, 2022 8 commits
  9. 14 Feb, 2022 1 commit
    • Sylvain Gugger's avatar
      Register feature extractor (#15634) · 2e11a043
      Sylvain Gugger authored
      * Rework AutoFeatureExtractor.from_pretrained internal
      
      * Custom feature extractor
      
      * Add more tests
      
      * Add support for custom feature extractor code
      
      * Clean up
      
      * Add register API to AutoFeatureExtractor
      2e11a043
  10. 11 Feb, 2022 4 commits
  11. 10 Feb, 2022 4 commits
  12. 09 Feb, 2022 1 commit