1. 19 Dec, 2022 1 commit
  2. 16 Dec, 2022 2 commits
    • NielsRogge's avatar
      Add Swin2SR (#19784) · 26dd041c
      NielsRogge authored
      
      
      * First draft
      
      * Add more improvements
      
      * Improve forward pass
      
      * Fix layernorm
      
      * Add upscaler
      
      * More improvements
      
      * More improvements
      
      * More improvements
      
      * Improve conversion script
      
      * Add preprocessing
      
      * Make output match original implementation
      
      * Add additional attributes
      
      * Add support for more models
      
      * Support more models
      
      * Add support for real world sr
      
      * Add initial Swin2SRFeatureExtractor
      
      * Add ImageSuperResolutionOutput
      
      * Make more tests pass
      
      * Use BaseModelOutput
      
      * Fix one more test
      
      * Fix more tests
      
      * Fix another test
      
      * Fix all tests
      
      * Rename to Swin2SRImageProcessor
      
      * Fix toctree
      
      * Fix toctree
      
      * Fix rebase
      
      * Improve Swin2SRImageProcessor
      
      * Remove feature extractor file
      
      * Improve model
      
      * Improve conversion script
      
      * Fix integration test
      
      * Fix init
      
      * Fix conversion script
      
      * Address comments
      
      * Improve upsampler
      
      * Add NearestConvUpsampler
      
      * Improve pixel shuffle upsampler
      
      * Improve auxiliary upsampler
      
      * Improve conversion script
      
      * Rename conv_last to final_convolution
      
      * Fix rebase
      
      * Improve upsample module
      
      * Add padding to image processor
      
      * Fix bug
      
      * Update padding
      
      * Remove print statement and fix integration test
      
      * Improve docs
      
      * Add image processor tests
      
      * Convert all checkpoints, fix tests茅
      
      * Remove print statements
      
      * Fix import
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      26dd041c
    • NielsRogge's avatar
      Add Universal Segmentation class + mapping (#20766) · 7f998612
      NielsRogge authored
      
      
      * Add mapping
      
      * Add mapping to pipeline
      
      * Apply suggestions
      
      * Fix feature extractor tests
      
      * Use ForInstance, add model to universal mapping
      
      * More fixes
      
      * Remove model from deprecated objects茅
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      7f998612
  3. 13 Dec, 2022 1 commit
    • Hazrul Akmal's avatar
      Add docs xlm roberta (#20742) · f28c918c
      Hazrul Akmal authored
      * added model resources for xlm-roberta
      
      * added model resources for xlm-roberta
      
      * resolve suggested changes
      
      * add resources to xlm-roberta
      f28c918c
  4. 12 Dec, 2022 2 commits
    • Ariel Ekgren's avatar
      Add gpt-sw3 model to transformers (#20209) · 5f94855d
      Ariel Ekgren authored
      
      
      * Add templates for gpt-sw3
      
      * Add templates for gpt-sw3
      
      * Added sentencepiece tokenizer
      
      * intermediate commit with many changes
      
      * fixed conflicts
      
      * Init commit for tokenization port
      
      * Tokenization progress
      
      * Remove fast tokenizer
      
      * Clean up and rename spm.model -> spiece.model
      
      * Remove TF -> PT conversion script template, Clean up Megatron -> PT script
      
      * Optimize encode & decode performance
      
      * added new attention
      
      * added new attention
      
      * attention for gpt-sw3 working
      
      * attention good
      
      * Cache is now working
      
      * fixed attention mask so that it works with causal attention
      
      * fixed badbmm bug for cpu and caching
      
      * updated config with correct parameters
      
      * Refactor and leave optimizations as separate functions to avoid breaking expected functionality
      
      * Fix special tokens mapping for both tokenizers
      
      * cleaning up of code and comments
      
      * HF compatible attention outputs
      
      * Tokenizer now passing tests, add documentation
      
      * Update documentation
      
      * reverted back to base implementation after checking that it is identical to pretrained model
      
      * updated gpt-sw3 config
      
      * updated conversion script
      
      * aligned parameters with gpt-sw3 config
      
      * changed default scale_attn_by_inverse_layer_idx to true
      
      * removed flag from conversion script
      
      * added temporary model path
      
      * reverted back to functioning convert script
      
      * small changes to default config
      
      * updated tests for gpt-sw3
      
      * make style, make quality, minor cleanup
      
      * Change local paths to testing online repository
      
      * Change name: GptSw3 -> GPTSw3
      
      * Remove GPTSw3TokenizerFast references
      
      * Use official model repository and add more model sizes
      
      * Added reference to 6.7b model
      
      * Add GPTSw3DoubleHeadsModel to IGNORE_NON_AUTO_CONFIGURED, like GPT2DoubleHeadsModel
      
      * Remove pointers to non-existing TFGPTSw3
      
      * Add GPTSw3 to docs/_toctree.yml
      
      * Remove TF artifacts from GPTSw3 in __init__ files
      
      * Update README:s with 'make fix-copies'
      
      * Add 20b model to archive list
      
      * Add documentation for GPT-Sw3
      
      * Fix typo in documentation for GPT-Sw3
      
      * Do 'make fix-copies' again after having updated docs
      
      * Fix some typos in docs
      
      * Update src/transformers/models/gpt_sw3/configuration_gpt_sw3.py
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Update src/transformers/models/gpt_sw3/configuration_gpt_sw3.py
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Update src/transformers/models/gpt_sw3/__init__.py
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Update src/transformers/models/gpt_sw3/__init__.py
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Update src/transformers/models/gpt_sw3/convert_megatron_to_pytorch.py
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Update src/transformers/models/gpt_sw3/modeling_gpt_sw3.py
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Update tests/models/gpt_sw3/test_tokenization_gpt_sw3.py
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Update src/transformers/models/gpt_sw3/modeling_gpt_sw3.py
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Update src/transformers/models/gpt_sw3/modeling_gpt_sw3.py
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Resolve comments from PR feedback
      
      * Resolve more comments from PR feedback, also set use_cache=True in convert script
      
      * Add '# Copied from' comments for GPTSw3 modeling
      
      * Set 'is_parallelizable = False'
      
      * Remove '# Copied from' where code was modified and add 'with x->y' when appropriate
      
      * Remove parallelize in mdx
      
      * make style, make quality
      
      * Update GPTSw3Config default values and corresponding documentation
      
      * Update src/transformers/models/gpt_sw3/tokenization_gpt_sw3.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/models/gpt_sw3/__init__.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Clean up and protect GPTSw3Tokenizer imports with is_sentencepiece_available
      
      * Make style, make quality
      
      * Add dummy object for GPTSw3Tokenizer via 'make fix-copies'
      
      * make fix-copies
      
      * Remove GPTSw3 modeling classes
      
      * make style, make quality
      
      * Add GPTSw3 auto-mappings for other GPT2 heads
      
      * Update docs/source/en/model_doc/gpt-sw3.mdx
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Update src/transformers/models/gpt_sw3/convert_megatron_to_pytorch.py
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Update src/transformers/models/gpt_sw3/tokenization_gpt_sw3.py
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Remove old TODO-comment
      
      * Add example usage to GPTSw3Tokenizer docstring
      
      * make style, make quality
      
      * Add implementation details and example usage to gpt-sw3.mdx
      Co-authored-by: default avatarJoeyOhman <joeyoh@kth.se>
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      5f94855d
    • stanleycai95's avatar
      17c742bb
  5. 07 Dec, 2022 2 commits
    • Cole Howard's avatar
      Add TFBartForSequenceClassification (#20570) · fc95386e
      Cole Howard authored
      * read to load
      
      * base functionality
      
      * revert init
      
      * fix dummy data
      
      * moving right along
      
      * moving right along
      
      * finally
      
      * cleanup
      
      * pull out comment
      
      * add test
      
      * update docstring for main class
      
      * flake comments and rewriting copies from make repo-consistency`
      
      * remove irrelevant differences/accidental spaces
      
      * put copies back after space removals
      
      * mid
      
      * final test pass
      
      * stray comment
      
      * update test file
      
      * update test file
      
      * fixup
      
      * black
      
      * missed
      
      * black missed one more
      
      * sytle
      
      * add doc update
      
      * fix order of output class
      
      * comment
      
      * Revert "comment"
      
      This reverts commit 03f86b6948808461939cc8ad4ad74305dfb67700.
      
      * remove redundant function, and redundant reshape
      
      * move change out of common
      
      * style
      
      * put common spaces back
      
      * reorder kwargs in output
      
      * doc style
      fc95386e
    • NielsRogge's avatar
      Add BiT + ViT hybrid (#20550) · d151a8c5
      NielsRogge authored
      
      
      * First draft
      
      * More improvements
      
      * Add backbone, first draft of ViT hybrid
      
      * Add AutoBackbone
      
      * More improvements
      
      * Fix bug
      
      * More improvements
      
      * More improvements
      
      * Convert ViT-hybrid
      
      * More improvements
      
      * add patch bit
      
      * Fix style
      
      * Improve code
      
      * cleaned v1
      
      * more cleaning
      
      * more refactoring
      
      * Improve models, add tests
      
      * Add docs and tests
      
      * Make more tests pass
      
      * Improve default backbone config
      
      * Update model_type
      
      * Fix more tests
      
      * Add more copied from statements
      
      * More improvements
      
      * Add push to hub to conversion scripts
      
      * clean
      
      * more cleanup
      
      * clean
      
      * replace to
      
      * fix
      
      * Update src/transformers/models/bit/configuration_bit.py
      Co-authored-by: default avatarNielsRogge <48327001+NielsRogge@users.noreply.github.com>
      
      * fix base model prefix
      
      * more cleaning
      
      * get rid of stem
      
      * clean
      
      * replace flag
      
      * Update src/transformers/models/bit/configuration_bit.py
      Co-authored-by: default avatarNielsRogge <48327001+NielsRogge@users.noreply.github.com>
      
      * Update src/transformers/models/bit/configuration_bit.py
      Co-authored-by: default avatarNielsRogge <48327001+NielsRogge@users.noreply.github.com>
      
      * add check
      
      * another check
      
      * fix for hybrid vit
      
      * final fix
      
      * update config
      
      * fix class name
      
      * fix `make fix-copies`
      
      * remove `use_activation`
      
      * Update src/transformers/models/bit/configuration_bit.py
      
      * rm unneeded file
      
      * Add BiT image processor
      
      * rm unneeded file
      
      * add doc
      
      * Add image processor to conversion script
      
      * Add ViTHybrid image processor
      
      * Add resources
      
      * Move bit to correct position
      
      * Fix auto mapping
      
      * Rename hybrid to Hybrid
      
      * Fix name in toctree
      
      * Fix READMEs'
      
      * Improve config
      
      * Simplify GroupNormActivation layer
      
      * fix test + make style
      
      * Improve config
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * remove comment
      
      * remove comment
      
      * replace
      
      * replace
      
      * remove all conv_layer
      
      * refactor norm_layer
      
      * revert x
      
      * add copied from
      
      * last changes + integration tests
      
      * make fixup
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * fix name
      
      * fix message
      
      * remove assert and refactor
      
      * refactor + make fixup
      
      * refactor - add  + sfety checker
      
      * fix docstring + checkpoint names
      
      * fix merge issues
      
      * fix function name
      
      * fix copies
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * fix model checkpoint
      
      * fix doctest output
      
      * vit name on doc
      
      * fix name on doc
      
      * fix small nits
      
      * fixed integration tests
      
      * final changes - slow tests pass
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      Co-authored-by: default avataryounesbelkada <younesbelkada@gmail.com>
      Co-authored-by: default avatarYounes Belkada <49240599+younesbelkada@users.noreply.github.com>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      d151a8c5
  6. 05 Dec, 2022 3 commits
    • Steven Liu's avatar
      Split autoclasses on modality (#20559) · 720e9599
      Steven Liu authored
      * split autoclasses on modality
      
      * apply review
      
      * auto classes
      720e9599
    • Francisco Kurucz's avatar
      ac3bccdc
    • Kamal Raj Kanakarajan's avatar
      Add BioGPT (#20420) · 13e73668
      Kamal Raj Kanakarajan authored
      * biogpt initial commit
      
      * updated init
      
      * fix faster decoding with use_cache
      
      * 1. fix input_ids and input_embeds with correct device
      2. added _keys_to_ignore_on_load_missing
      3. updated prepare_inputs_for_generation
      
      * add activation_dropout and scale_embedding
      
      * replace fsmt attention with bart attention
      
      * added test
      
      * run make fix-copies
      
      * doc init and fix build
      
      * updated README with proper information
      
      * 1. added tips to docs
      2. updated BioGptTokenizer func
      
      * 1. added tokenizer test
      2. refactor tokenizer
      
      * make fixup
      
      * add biogpt fairseq to hf converter
      
      * updated layer names more
      similar to original checkpoints
      
      * config update doc string and set defaults
      
      * added "#copied" from bart model and
      updated doc strings
      
      * enable model_input_names in tokenizer
      
      * 1.  positionalembedding depending on attention_mask
      2. added attention mask to prepare for generation
      
      * added test to verify past and generation
      
      * BioGptLMHeadModel -> BioGptForCausalLM
      
      * fix typo
      
      * tokenization and test
      Copyright and updated assertion
      
      * updated Copyright and
      one func at time in line
      
      * Copyright updates and
      minor doc fix
      
      * replace assertion with ValueError
      
      * rm extra space
      
      * added code syntax
      
      * revert cmnt position change
      
      * add tokenizer to auto
      
      * updated doc string
      
      * tokenizer doc string update
      
      * biogpt hub model update to microsoft/biogpt
      
      * make fixup
      
      * rm cmnt to fix flake8 5.0.4 vs 6 error
      13e73668
  7. 02 Dec, 2022 2 commits
    • szhublox's avatar
      flan-t5.mdx: fix link to large model (#20555) · 699e9043
      szhublox authored
      699e9043
    • fatih's avatar
      [New Model] Add TimeSformer model (#18908) · cc3d0e1b
      fatih authored
      * init timesformer
      
      * apply fix-copies
      
      * reformat style
      
      * revert back some incoorect style updates
      
      * init timesformer
      
      * apply fix-copies
      
      * reformat style
      
      * revert back some incoorect style updates
      
      * update timseformer doc
      
      * add some functions and classes
      
      * add new config params
      
      * implement multiple classes
      
      * update TimeSformerLayer
      
      * update TimeSformerModel, TimeSformerPreTrainedModel, TimeSformerEncoder
      
      * several fixes
      
      * reformat
      
      * temporary update
      
      * fix some typos
      
      * fix weight converter
      
      * more fixes
      
      * fix a typo
      
      * fix typo
      
      * remove redundant params
      
      * fix for latest hf-hub
      
      * merge fix
      
      * fix some checks
      
      * video classification works with einops
      
      * add paper info to docs
      
      * merge fix
      
      * remove redundant line
      
      * remove redundant docstring
      
      * update config
      
      * fix some typos
      
      * fix converter
      
      * update some test constants
      
      * refactor einops functions
      
      * reformat
      
      * fix a comment
      
      * remove redundat imports
      
      * reformat
      
      * fix a typo
      
      * remove comment
      
      * remove unused imports
      
      * remove redundant doc line
      
      * reformat
      
      * add missing line
      
      * fix docs
      
      * fix timesformer auto feat ext
      
      * add unittests
      
      * reformat
      
      * fix docs
      
      * some fixes and updates
      
      * fix readme
      
      * fix modeling
      
      * fix readme
      
      * update index
      
      * revert _toctree.yml changes
      
      * update timseformer.mdx
      
      * update drop_path_prob to drop_path_rate
      
      * add dosctring for drop_path_rate
      
      * update TimeSformerPatchEmbed naming
      
      * remove to_2tuple
      
      * explicit use of nn.functional
      
      * reformat
      
      * many updates from review comments
      
      * fix a typo
      
      * reformat
      
      * remove assert, better variable name
      
      * make variable names more explicit
      
      * add some adapted from
      
      * more explicit variable names
      
      * remove redundant docstring
      
      * fix initilaization
      
      * move permute inside embedding
      
      * update class names
      
      * remove unused imports
      
      * add test for video classification
      
      * update PretrainedModel with PreTrainedModel
      
      * remove double permute
      
      * update based on sylvain's review
      
      * aply auto fix
      
      * update image_processing_auto for timesformer
      
      * update hub urls
      
      * reformat
      
      * remove duplicate import
      
      * update doc link
      cc3d0e1b
  8. 30 Nov, 2022 3 commits
    • Yang An's avatar
      Add Chinese-CLIP implementation (#20368) · 72176402
      Yang An authored
      
      
      * init chinese-clip model from clip
      
      * init model tests and docs
      
      * implement chinese-clip into hf
      
      * implement chinese-clip into hf
      
      * implement chinese-clip into hf
      
      * implement chinese-clip into hf
      
      * implement chinese-clip into hf
      
      * update usecase example in model implementation
      
      * fix codestyle
      
      * fix model_type typo in readme
      
      * add placeholder in doc
      
      * add placeholder in doc
      
      * update the init script
      
      * update usecase
      
      * fix codestyle
      
      * update testcase
      
      * update testcase
      
      * update testcase
      
      * update testcase
      
      * update testcase
      
      * update testcase
      
      * update testcase
      
      * update testcase
      
      * update testcase
      
      * update testcase
      
      * update testcase
      
      * update testcase
      
      * forward the convert_rgb
      
      * update testcase
      
      * update testcase
      
      * update testcase
      
      * merge the recent update from clip about model_input_name property
      
      * update the doc
      
      * update the doc
      
      * update the doc
      
      * update the doc
      
      * remove unused imports
      
      * reformat code style
      
      * update the doc
      
      * fix isort style
      
      * bypass a weird failed unit test which is unrelated with my PR
      
      * update the doc
      
      * implement independent vision config class
      
      * implement independent vision model class
      
      * fix refactor bug
      
      * fix refactor bug
      
      * fix refactor bug
      
      * make style
      
      * fix refactor bug
      
      * make style
      
      * fix refactor bug
      
      * fix refactor bug
      
      * make style
      
      * fix refactor bug
      
      * fix refactor bug
      
      * doc-build restyle
      
      * implement independent text config class
      
      * implement independent text model class
      
      * implement independent text model class
      
      * make style
      
      * make fix-copies
      
      * fix refactor bug
      
      * fix refactor bug
      
      * fix refactor bug
      
      * fix refactor bug
      
      * fix refactor bug
      
      * fix refactor bug
      
      * fix refactor bug
      
      * fix refactor bug
      
      * fix refactor bug
      
      * fix refactor bug
      
      * make style
      
      * update doc
      
      * black and isort
      
      * update doc
      
      * Update src/transformers/models/chinese_clip/configuration_chinese_clip.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/models/auto/tokenization_auto.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * modify the model type from chinese-clip to chinese_clip
      
      * format the example comment of ChineseCLIPVisionConfig
      
      * correct the copyright comment
      
      * fix the tokenizer specification
      
      * add copied from for loss function
      
      * remove unused class
      
      * update CHINESE_CLIP_TEXT_INPUTS_DOCSTRING
      
      * update CHINESE_CLIP_INPUTS_DOCSTRING
      
      * update doc
      
      * update doc
      
      * update code comment in config
      
      * update copied from statement
      
      * make style
      
      * rename the doc file
      
      * add copied statement
      
      * remove unused attention_mask, causal_attention_mask in ChineseCLIPVisionEncoder
      
      * remove ChineseCLIPTextPreTrainedModel
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * update doc
      
      * make style
      
      * Update src/transformers/models/chinese_clip/configuration_chinese_clip.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/models/chinese_clip/configuration_chinese_clip.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * update ChineseCLIPImageProcessor in image_processing_auto
      
      * fix config_class of chinesecliptextmodel
      
      * fix the test case
      
      * update the docs
      
      * remove the copied from comment for ChineseCLIPTextModel, since it has diverged from BertModel with customed config_class
      
      * update the testcase
      
      * final fix
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      72176402
    • amyeroberts's avatar
      Update doc examples feature extractor -> image processor (#20501) · 17a7b49b
      amyeroberts authored
      * Update doc example feature extractor -> image processor
      
      * Apply suggestions from code review
      17a7b49b
    • amyeroberts's avatar
      Add segmentation + object detection image processors (#20160) · de6d19ea
      amyeroberts authored
      * Add transforms for object detection
      
      * DETR models + Yolos
      
      * Scrappy additions
      
      * Maskformer image processor
      
      * Fix up; MaskFormer tests
      
      * Update owlvit processor
      
      * Add to docs
      
      * OwlViT tests
      
      * Update pad logic
      
      * Remove changes to transforms
      
      * Import fn directly
      
      * Update to include pad transformation
      
      * Remove uninstended changes
      
      * Add new owlvit post processing function
      
      * Tidy up
      
      * Fix copies
      
      * Fix some copies
      
      * Include device fix
      
      * Fix scipy imports
      
      * Update _pad_image
      
      * Update padding functionality
      
      * Fix bug
      
      * Properly handle ignore index
      
      * Fix up
      
      * Remove defaults to None in docstrings
      
      * Fix docstrings & docs
      
      * Fix sizes bug
      
      * Resolve conflicts in init
      
      * Cast to float after resizing
      
      * Tidy & add size if missing
      
      * Allow kwards when processing for owlvit
      
      * Update test values
      de6d19ea
  9. 29 Nov, 2022 3 commits
  10. 28 Nov, 2022 1 commit
  11. 21 Nov, 2022 3 commits
    • NielsRogge's avatar
      Add Audio Spectogram Transformer (#19981) · 4973d2a0
      NielsRogge authored
      
      
      * First draft
      
      * Make conversion script work
      
      * Add id2label mapping, run code quality
      
      * Fix copies
      
      * Add first draft of feature extractor
      
      * Update conversion script to use feature extractor
      
      * Make more tests pass
      
      * Add docs
      
      * update input_features to input_values + pad by default to max length
      
      * Fix doc tests
      
      * Add feature extractor tests
      
      * Add proper padding/truncation to feature extractor
      
      * Add support for conversion of all audioset checkpoints
      
      * Improve docs and extend conversion script
      
      * Fix README
      
      * Rename spectogram to spectrogram
      
      * Fix copies
      
      * Add integration test
      
      * Remove dummy conv
      
      * Update to ast
      
      * Update organization
      
      * Fix init
      
      * Rename model to AST
      
      * Add require_torchaudio annotator
      
      * Move import of ASTFeatureExtractor under a is_speech_available
      
      * Fix rebase
      
      * Add pipeline config
      
      * Update name of classifier head
      
      * Rename time_dimension and frequency_dimension for clarity
      
      * Remove print statement
      
      * Fix pipeline test
      
      * Fix pipeline test
      
      * Fix index table
      
      * Fix init
      
      * Fix conversion script
      
      * Rename to ForAudioClassification
      
      * Fix index table
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      4973d2a0
    • NielsRogge's avatar
      Add resources (#20296) · 96783e53
      NielsRogge authored
      
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      96783e53
    • Matthijs Hollemans's avatar
      add MobileNetV1 model (#17799) · d21c97cc
      Matthijs Hollemans authored
      * add model files etc for MobileNetV2
      
      rename files for MobileNetV1
      
      initial implementation of MobileNetV1
      
      fix conversion script
      
      cleanup
      
      write docs
      
      tweaks
      
      fix conversion script
      
      extract hidden states
      
      fix test cases
      
      make fixup
      
      fixup it all
      
      remove main from doc link
      
      fixes
      
      fix tests
      
      fix up
      
      use google org
      
      fix weird assert
      
      * fixup
      
      * use google organization for checkpoints
      d21c97cc
  12. 18 Nov, 2022 1 commit
    • Ali Hassani's avatar
      Add Neighborhood Attention Transformer (NAT) and Dilated NAT (DiNAT) models (#20219) · fc4a993e
      Ali Hassani authored
      * Add DiNAT
      
      * Adds DiNAT + tests
      
      * Minor fixes
      
      * Added HF model
      
      * Add natten to dependencies.
      
      * Cleanup
      
      * Minor fixup
      
      * Reformat
      
      * Optional NATTEN import.
      
      * Reformat & add doc to _toctree
      
      * Reformat (finally)
      
      * Dummy objects for DiNAT
      
      * Add NAT + minor changes
      
      Adds NAT as its own independent model + docs, tests
      Adds NATTEN to ext deps to ensure ci picks it up.
      
      * Remove natten from `all` and `dev-torch` deps, add manual pip install to ci tests
      
      * Minor fixes.
      
      * Fix READMEs.
      
      * Requested changes to docs + minor fixes.
      
      * Requested changes.
      
      * Add NAT/DiNAT tests to layoutlm_job
      
      * Correction to Dinat doc.
      
      * Requested changes.
      fc4a993e
  13. 16 Nov, 2022 2 commits
  14. 15 Nov, 2022 4 commits
  15. 14 Nov, 2022 2 commits
    • Bartosz Szmelczynski's avatar
      Fix tapas scatter (#20149) · 78a471ff
      Bartosz Szmelczynski authored
      
      
      * First draft
      
      * Remove scatter dependency
      
      * Add require_torch
      
      * update vectorized sum test, add clone call
      
      * remove artifacts
      
      * fix style
      
      * fix style v2
      
      * remove "scatter" mentions from the code base
      
      * fix isort error
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      78a471ff
    • Matthijs Hollemans's avatar
      add MobileNetV2 model (#17845) · f711d683
      Matthijs Hollemans authored
      * add model files etc for MobileNetV2
      
      * rename files for MobileNetV1
      
      * initial implementation of MobileNetV1
      
      * fix conversion script
      
      * cleanup
      
      * write docs
      
      * tweaks
      
      * fix conversion script
      
      * extract hidden states
      
      * fix test cases
      
      * make fixup
      
      * fixup it all
      
      * rename V1 to V2
      
      * fix checkpoints
      
      * fixup
      
      * implement first block + weight conversion
      
      * add remaining layers
      
      * add output stride and dilation
      
      * fixup
      
      * add tests
      
      * add deeplabv3+ head
      
      * a bit of fixup
      
      * finish deeplab conversion
      
      * add link to doc
      
      * fix issue with JIT trace
      
      in_height and in_width would be Tensor objects during JIT trace, which caused Core ML conversion to fail on the remainder op. By making them ints, the result of the padding calculation becomes a constant value.
      
      * cleanup
      
      * fix order of models
      
      * fix rebase error
      
      * remove main from doc link
      
      * add image processor
      
      * remove old feature extractor
      
      * fix converter + other issues
      
      * fixup
      
      * fix unit test
      
      * add to onnx tests (but these appear broken now)
      
      * add post_process_semantic_segmentation
      
      * use google org
      
      * remove unused imports
      
      * move args
      
      * replace weird assert
      f711d683
  16. 10 Nov, 2022 2 commits
  17. 09 Nov, 2022 2 commits
  18. 08 Nov, 2022 3 commits
    • amyeroberts's avatar
      AutoImageProcessor (#20111) · 4eb918e6
      amyeroberts authored
      * AutoImageProcessor skeleton
      
      * Update references
      
      * Add mapping in init
      
      * Add model image processors to __init__ for importing
      
      * Add AutoImageProcessor tests
      
      * Fix up
      
      * Image Processor documentation
      
      * Remove pdb
      
      * Update docs/source/en/model_doc/mobilevit.mdx
      
      * Update docs
      
      * Don't add whitespace on json files
      
      * Remove fixtures
      
      * Move checking model config down
      
      * Fix up
      
      * Add check for image processor
      
      * Remove FeatureExtractorMixin in docstrings
      
      * Rename model_tmpfile to config_tmpfile
      
      * Don't make None if not in image processor map
      4eb918e6
    • Weiwe Shi's avatar
      Add RocBert (#20013) · efa889d2
      Weiwe Shi authored
      
      
      * add roc_bert
      
      * update roc_bert readme
      
      * code style
      
      * change name and delete unuse file
      
      * udpate model file
      
      * delete unuse log file
      
      * delete tokenizer fast
      
      * reformat code and change model file path
      
      * add RocBertForPreTraining
      
      * update docs
      
      * delete wrong notes
      
      * fix copies
      
      * fix make repo-consistency error
      
      * fix files are not present in the table of contents error
      
      * change RocBert -> RoCBert
      
      * add doc, add detail test
      Co-authored-by: default avatarweiweishi <weiweishi@tencent.com>
      efa889d2
    • NielsRogge's avatar
      Add CLIPSeg (#20066) · 25896306
      NielsRogge authored
      
      
      * Add first draft
      
      * Update conversion script
      
      * Improve conversion script
      
      * Improve conversion script some more
      
      * Add conditional embeddings
      
      * Add initial decoder
      
      * Fix activation function of decoder
      
      * Make decoder outputs match original implementation
      
      * Make decoder outputs match original implementation
      
      * Add more copied from statements
      
      * Improve model outputs
      
      * Fix auto tokenizer file
      
      * Fix more tests
      
      * Add test
      
      * Improve README and docs, improve conditional embeddings
      
      * Fix more tests
      
      * Remove print statements
      
      * Remove initial embeddings
      
      * Improve conversion script
      
      * Add interpolation of position embeddings
      
      * Finish addition of interpolation of position embeddings
      
      * Add support for refined checkpoint
      
      * Fix refined checkpoint
      
      * Remove unused parameter
      
      * Improve conversion script
      
      * Add support for training
      
      * Fix conversion script
      
      * Add CLIPSegFeatureExtractor
      
      * Fix processor
      
      * Fix CLIPSegProcessor
      
      * Fix conversion script
      
      * Fix most tests
      
      * Fix equivalence test
      
      * Fix README
      
      * Add model to doc tests
      
      * Use better variable name
      
      * Convert other checkpoint as well
      
      * Update config, add link to paper
      
      * Add docs
      
      * Update organization
      
      * Replace base_model_prefix with clip
      
      * Fix base_model_prefix
      
      * Fix checkpoint of config
      
      * Fix config checkpoint
      
      * Remove file
      
      * Use logits for output
      
      * Fix tests
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      25896306
  19. 07 Nov, 2022 1 commit