1. 25 Jan, 2023 3 commits
    • Anahita Bhiwandiwalla's avatar
      Add BridgeTower model (#20775) · 3a6e4a22
      Anahita Bhiwandiwalla authored
      
      
      * Commit with BTModel and latest HF code
      
      * Placeholder classes for BTForMLM and BTForITR
      
      * Importing Bert classes from transformers
      
      * Removed objectives.py and dist_utils.py
      
      * Removed swin_transformer.py
      
      * Add image normalization, BridgeTowerForImageAndTextRetrieval
      
      * Add center_crop
      
      * Removing bert tokenizer and LCI references
      
      * Tested config loading from HF transformers hub
      
      * Removed state_dict updates and added path to hub
      
      * Enable center crop
      
      * Getting image_size from config, renaming num_heads and num_layers
      
      * Handling max_length in BridgeTowerProcessor
      
      * Add BridgeTowerForMaskedLM
      
      * Add doc string for BridgeTowerConfig
      
      * Add doc strings for BT config, processor, image processor
      
      * Adding docs, removed swin
      
      * Removed convert_bridgetower_original_to_pytorch.py
      
      * Added doc files for bridgetower, removed is_vision
      
      * Add support attention_mask=None and BridgeTowerModelOutput
      
      * Fix formatting
      
      * Fixes with 'make style', 'make quality', 'make fixup'
      
      * Remove downstream tasks from BridgeTowerModel
      
      * Formatting fixes, add return_dict to BT models
      
      * Clean up after doc_test
      
      * Update BTModelOutput return type, fix todo in doc
      
      * Remove loss_names from init
      
      * implement tests and update tuples returned by models
      
      * Add image reference to bridgetower.mdx
      
      * after make fix-copies, make fixup, make style, make quality, make repo-consistency
      
      * Rename class names with BridgeTower prefix
      
      * Fix for image_size in BTImageProcessor
      
      * implement feature extraction bridgetower tests
      
      * Update image_mean and image_std to be list
      
      * remove unused import
      
      * Removed old comments
      
      * Rework CLIP
      
      * update config in tests followed config update
      
      * Formatting fixes
      
      * Add copied from for BridgeTowerPredictionHeadTransform
      
      * Update bridgetower.mdx
      
      * Update test_feature_extraction_bridgetower.py
      
      * Update bridgetower.mdx
      
      * BridgeTowerForMaskedLM is conditioned on image too
      
      * Add BridgeTowerForMaskedLM
      
      * Fixes
      
      * Call post_init to init weights
      
      * Move freeze layers into method
      
      * Remove BTFeatureExtractor, add BT under multimodal models
      
      * Remove BTFeatureExtractor, add BT under multimodal models
      
      * Code review feedback - cleanup
      
      * Rename variables
      
      * Formatting and style to PR review feedback
      
      * Move center crop after resize
      
      * Use named parameters
      
      * Style fix for modeling_bridgetower.py
      
      * Update docs/source/en/model_doc/bridgetower.mdx
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      * Update docs/source/en/model_doc/bridgetower.mdx
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      * Update docs/source/en/model_doc/bridgetower.mdx
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      * Update src/transformers/models/bridgetower/modeling_bridgetower.py
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      * Update src/transformers/models/bridgetower/modeling_bridgetower.py
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      * Update docs/source/en/model_doc/bridgetower.mdx
      Co-authored-by: default avatarYounes Belkada <49240599+younesbelkada@users.noreply.github.com>
      
      * Update src/transformers/models/bridgetower/modeling_bridgetower.py
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      * Rename config params, copy BERT classes, clean comments
      
      * Cleanup irtr
      
      * Replace Roberta imports, add BTTextConfig and Model
      
      * Update docs, add visionconfig, consistent arg names
      
      * make fixup
      
      * Comments for forward in BTModel and make fixup
      
      * correct tests
      
      * Remove inconsistent roberta copied from
      
      * Add BridgeTowerTextModel to dummy_pt_objects.py
      
      * Add BridgeTowerTextModel to IGNORE_NON_TESTED
      
      * Update docs for BT Text and Vision Configs
      
      * Treat BridgeTowerTextModel as a private model
      
      * BridgeTowerTextModel as private
      
      * Run make fix-copies
      
      * Adding BTTextModel to PRIVATE_MODELS
      
      * Fix for issue with BT Text and Image configs
      
      * make style changes
      
      * Update README_ja.md
      
      Add 銇嬨倝 to BridgeTower's description
      
      * Clean up config, .mdx and arg names
      
      * Fix init_weights. Remove nn.Sequential
      
      * Formatting and style fixes
      
      * Re-add tie_word_embeddings in config
      
      * update test implementation
      
      * update style
      
      * remove commented out
      
      * fix style
      
      * Update README with abs for BridgeTower
      
      * fix style
      
      * fix mdx file
      
      * Update bridgetower.mdx
      
      * Update img src in bridgetower.mdx
      
      * Update README.md
      
      * Update README.md
      
      * resolve style failed
      
      * Update _toctree.yml
      
      * Update README_ja.md
      
      * Removed mlp_ratio, rename feats, rename BTCLIPModel
      
      * Replace BTCLIP with BTVisionModel,pass in vision_config to BTVisionModel
      
      * Add test_initialization support
      
      * Add support for output_hidden_states
      
      * Update support for output_hidden_states
      
      * Add support for output_attentions
      
      * Add docstring for output_hidden_states
      
      * update tests
      
      * add bridgetowervisionmodel as private model
      
      * rerun the PR test
      
      * Remove model_type, pass configs to classes, renames
      
      * Change self.device to use weight device
      
      * Remove image_size
      
      * Style check fixes
      
      * Add hidden_size and num_hidden_layers to BridgeTowerTransformer
      
      * Update device setting
      
      * cosmetic update
      
      * trigger test again
      
      * trigger tests again
      
      * Update test_modeling_bridgetower.py
      
      trigger tests again
      
      * Update test_modeling_bridgetower.py
      
      * minor update
      
      * re-trigger tests
      
      * Update docs/source/en/model_doc/bridgetower.mdx
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      * Remove pad, update max_text_len, doc cleanup, pass eps to LayerNorm
      
      * Added copied to, some more review feedback
      
      * make fixup
      
      * Use BridgeTowerVisionEmbeddings
      
      * Code cleanup
      
      * Fixes for BridgeTowerVisionEmbeddings
      
      * style checks
      
      * re-tests
      
      * fix embedding
      
      * address comment on init file
      
      * retrigger tests
      
      * update import prepare_image_inputs
      
      * update test_image_processing_bridgetower.py to reflect test_image_processing_common.py
      
      * retrigger tests
      Co-authored-by: default avatarShaoyen Tseng <shao-yen.tseng@intel.com>
      Co-authored-by: default avatarTiep Le <tiep.le@intel.com>
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      Co-authored-by: default avatarYounes Belkada <49240599+younesbelkada@users.noreply.github.com>
      Co-authored-by: default avatarTiep Le <97980157+tileintel@users.noreply.github.com>
      3a6e4a22
    • Maria Khalusova's avatar
      Documentation code sample fixes (#21302) · 23844941
      Maria Khalusova authored
      * Fixed the following:
      pipe -> pipeline
      out in pipe(data()) is a list of dict, not a dict
      
      * Fixed the TypeError: __init__() missing 1 required positional argument: 'key'
      
      * Added a tip: code sample requires additional libraries to run
      
      * Fixed custom config's name
      
      * added seqeval to the required libraries
      
      * fixed a missing dependency,
      fixed metric naming,
      added checkpoint to fix the datacollator
      
      * added checkpoint to fix the datacollator,
      added missing dependency
      23844941
    • NielsRogge's avatar
      [Mask2Former] Add doc tests (#21232) · f83135eb
      NielsRogge authored
      
      
      * Add doc tests
      
      * Add OneFormer resources茅
      
      * Fix merge
      
      * Fix style
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      f83135eb
  2. 24 Jan, 2023 3 commits
  3. 23 Jan, 2023 1 commit
  4. 20 Jan, 2023 4 commits
  5. 19 Jan, 2023 3 commits
    • Cl茅mentine Fourrier's avatar
      Graphormer model for Graph Classification (#20968) · 87208a05
      Cl茅mentine Fourrier authored
      
      
      * [FT] First commit for graphormer architecture.
      
      The model has no tokenizer, as it uses a collator and preprocessing function for its input management.
      Architecture to be tested against original one.
      The arch might need to be changed to fit the checkpoint, but a revert to the original arch will make the code less nice to read.
      TODO: doc
      
      * [FIX] removed test model
      
      * [FIX] import error
      
      * [FIX] black and flake
      
      * [DOC] added paper refs
      
      * [FIX] [DOC]
      
      * [FIX] black
      
      * [DOC] Updated READMEs
      
      * [FIX] Order of imports + rm Tokenizer calls
      
      * [FIX] Moved assert in class to prevent doc build failure
      
      * [FIX] make fix-copies
      
      * [Doc] update from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * [FIX] Removed Graphormer from Sequence classification model list
      
      * [DOC] Added HF copyright to Cython file
      
      * [DOC] Fixed comments
      
      * [FIX] typos in class doc + removed config classes.
      
      Todo: update doc from paper definitions
      
      * [FIX] Removed dependency to fairseq, and replaced all asserts with Exception management
      
      * [FIX] Homogeneized initialization of weights to pretrained constructor
      
      * [FIX] [CP] Updated multi_hop parameter to get same results as in original implementation
      
      * [DOC] Relevant parameter description in the configuration file
      
      * [DOC] Updated doc and comments in main graphormer file
      
      * [FIX] make style and quality checks
      
      * [DOC] Fix doc format
      
      * [FIX] [WIP] Updated part of the tests, though still a wip
      
      * [FIX] [WIP]
      
      * [FIX] repo consistency
      
      * [FIX] Changed input names for more understandability
      
      * [FIX] [BUG] updated num_classes params for propagation in the model
      
      * simplified collator
      
      * [FIX] Updated tests to follow new naming pattern
      
      * [TESTS] Updated test suite along with model
      
      * |FIX] rm tokenizer import
      
      * [DOC] add link to graphormerdoc
      
      * Changed section in doc from text model to graph model
      
      * Apply suggestions from code review
      
      Spacing, inits
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * [DOC] Explain algos_graphormer functions
      
      * Cython soft import protection
      
      * Rm call to Callable in configuration graphormer
      
      * [FIX] replaced asserts with Exceptions
      
      * Add org to graphormer checkpoints
      
      * Prefixed classes with Graphormer
      
      * Management of init functions
      
      * format
      
      * fixes
      
      * fix length file
      
      * update indent
      
      * relaunching ci
      
      * Errors for missing cython imports
      
      * fix style
      
      * fix style doc
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      87208a05
    • Maria Khalusova's avatar
      Updates to computer vision section of the Preprocess doc (#21181) · 0359e2e1
      Maria Khalusova authored
      
      
      * Extended the CV preprocessing section with more details and refactored the example
      
      * added padding to the CV section, though it is a special case
      
      * Added a tip about post processing methods
      
      * make style
      
      * link update
      
      * Apply suggestions from review
      Co-authored-by: default avatarSteven Liu <59462357+stevhliu@users.noreply.github.com>
      
      * review feedback
      Co-authored-by: default avatarSteven Liu <59462357+stevhliu@users.noreply.github.com>
      0359e2e1
    • Jitesh Jain's avatar
      Add OneFormer Model (#20577) · 5b949623
      Jitesh Jain authored
      * Add Oneformer Model
      
      * Add OneFormer Tests
      
      * Add UNIVERSAL_SEGMENTATION_MAPPING
      
      * Fix config
      
      * 馃悰 Fix error encountered while writing tests
      
      * 馃敤 Fix instance segmentation post processing
      
      * Format Files and Add Documentation
      
      * Add Documentation mdx file
      
      * Run make fixup
      
      * Run make fix-copies
      
      * Remove unnecessary code
      
      * Format modeling_oneformer.py
      
      * Add OneFormer to ImageSegmentationPipeline
      
      * Format files
      
      * Add Demo link to Readme
      
      * Fix fomatting errors
      
      * Fix test failures
      
      * Update Table in index.mdx
      
      * Fix version
      
      * Fix style
      
      * Remove OneFormer from TF
      
      * Fix Imports
      
      * Fix dummy objects
      
      * Fix tests
      
      * Add newline
      
      * Remove OneFormerFeatureExtractor
      
      * Remove CUDA Kernels
      
      * Use AutoBackbone for Swin
      
      * Fix description
      
      * Use Image Processor
      
      * Fix copies
      
      * Fix formatting
      
      * Fix import order
      
      * Fix flake8 errors
      
      * Fix doc errors
      
      * Add Hindi Readme entry
      
      * Update supported backbones
      
      * Update supported backbones
      
      * Undo Changes
      
      * Fix type of config
      
      * Fix isort
      
      * Fix auto.mdx
      
      * Fix swin config
      
      * Replace DinatBackbone with AutoBackbone
      
      * Use SwinBackbone
      
      * Use SwinBackbone
      
      * Fix conversion script
      
      * Fix arguments
      
      * Add argument description
      
      * Fix style
      
      * Add OneFormerProcessor
      
      * Fix OneFormerProcessor Tests
      
      * Fix mapping
      
      * Fix imports
      
      * Fix inits
      
      * Fix style
      
      * Fix comment
      
      * Fix docstring
      
      * Move OneFormer to MultiModal
      
      * Fix Copies
      
      * Remove size divisor
      
      * Fix check_repo.py
      
      * Fix copies
      
      * Add Processor for Testing Pipeline
      
      * Fix padding for tokens
      
      * Fix variables
      
      * Fix formatting with correct black version
      
      * Add Image Processor Test
      
      * Apply suggestions
      
      * Revert common modeling
      
      * Add check for task
      
      * Fix conversion script
      
      * Fix initialization order
      
      * Fix tests
      
      * Undo Pipeline Changes
      
      * Fix layers in MLP
      
      * Fix copies
      
      * Update image paths
      
      * Fix copies
      
      * Apply suggestions
      5b949623
  6. 18 Jan, 2023 3 commits
    • Matt's avatar
      Rewrite a couple of lines in the TF XLA doc (#21177) · 00ba7cad
      Matt authored
      * Rewrite a couple of lines in the TF XLA doc to explain that jit_compile can be used in model.compile() too
      
      * Remove extra )
      00ba7cad
    • Samuel Xu's avatar
      Remove Roberta Dependencies from XLM Roberta Flax and Tensorflow models (#21047) · defdcd28
      Samuel Xu authored
      * Added flax model code
      
      * Added tf changes
      
      * missed some
      
      * Added copy comments
      
      * Added style hints
      
      * Fixed copy statements
      
      * Added suggested fixes
      
      * Made some fixes
      
      * Style fixup
      
      * Added necessary copy statements
      
      * Fixing copy statements
      
      * Added more copies
      
      * Final copy fix
      
      * Some bugfixes
      
      * Adding imports to init
      
      * Fixed up all make fixup errors
      
      * Fixed doc errors
      
      * Auto model changes
      defdcd28
    • Younes Belkada's avatar
      `blip` support for training (#21021) · 023f51fe
      Younes Belkada authored
      * `blip` support for training
      
      * remove labels creation
      
      * remove unneeded `decoder_input_ids` creation
      
      * final changes
      
      - add colab link to documentation
      - reduction = mean for loss
      
      * fix nits
      
      * update link
      
      * clearer error message
      023f51fe
  7. 17 Jan, 2023 7 commits
  8. 16 Jan, 2023 2 commits
    • Alara Dirik's avatar
    • NielsRogge's avatar
      Add UperNet (#20648) · 4ed89d48
      NielsRogge authored
      
      
      * First draft
      
      * More improvements
      
      * Add convnext backbone
      
      * Add conversion script
      
      * Add more improvements
      
      * Comment out to_dict
      
      * Add to_dict method
      
      * Add default config
      
      * Fix config
      
      * Fix backbone
      
      * Fix backbone some more
      
      * Add docs, auto mapping, tests
      
      * Fix some tests
      
      * Fix more tests
      
      * Fix more tests
      
      * Add conversion script
      
      * Improve conversion script
      
      * Add support for getting reshaped undownsampled hidden states
      
      * Fix forward pass
      
      * Add print statements
      
      * Comment out set_shift_and_window_size
      
      * More improvements
      
      * Correct downsampling layers conversion
      
      * Fix style
      
      * First draft
      
      * Fix conversion script
      
      * Remove config attribute
      
      * Fix more tests
      
      * Update READMEs
      
      * Update ConvNextBackbone
      
      * Fix ConvNext tests
      
      * Align ConvNext with Swin
      
      * Remove files
      
      * Fix index
      
      * Improve docs
      
      * Add output_attentions to model forward
      
      * Add backbone mixin, improve tests
      
      * More improvements
      
      * Update init_weights
      
      * Fix interpolation of logits
      
      * Add UperNetImageProcessor
      
      * Improve image processor
      
      * Fix image processor
      
      * Remove print statements
      
      * Remove script
      
      * Update import
      
      * Add image processor tests
      
      * Remove print statements
      
      * Fix test
      
      * Add integration test
      
      * Add convnext integration test
      
      * Update docstring
      
      * Fix README
      
      * Simplify config
      
      * Apply suggestions
      
      * Improve docs
      
      * Rename class
      
      * Fix test_initialization
      
      * Fix import
      
      * Address review
      
      * Fix confg
      
      * Convert all checkpoints
      
      * Fix default backbone
      
      * Usage same processor as segformer
      
      * Apply suggestions
      
      * Fix init_weights, update conversion scripts
      
      * Improve config
      
      * Use Auto API instead of creating a new image processor
      
      * Fix docs
      
      * Add doctests
      
      * Remove ResNetConfig dependency
      
      * Add always_partition argument
      
      * Fix rebase茅
      
      * Improve docs
      
      * Convert checkpoints
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MBP.localdomain>
      4ed89d48
  9. 13 Jan, 2023 1 commit
  10. 10 Jan, 2023 1 commit
  11. 04 Jan, 2023 3 commits
  12. 03 Jan, 2023 3 commits
    • Alara Dirik's avatar
      Improve OWL-ViT postprocessing (#20980) · cd245780
      Alara Dirik authored
      * add post_process_object_detection method
      
      * style changes
      cd245780
    • NielsRogge's avatar
      Add GIT (GenerativeImage2Text) (#20295) · 9c6f7485
      NielsRogge authored
      
      
      * First draft
      
      * Make model instantiation work
      
      * Fix copied from statement
      
      * More fixes
      
      * Add correct output head
      
      * Improve configuration
      
      * Add conversion script
      
      * Improve conversion script
      
      * Remove token_type_ids
      
      * Fix conversion of projection layers
      
      * Convert all weights
      
      * Use cats image
      
      * Make logits match
      
      * Generate caption on cats image
      
      * Add GITProcessor
      
      * Update conversion script
      
      * Add support for more checkpoints
      
      * Fix conversion script
      
      * Add initial tests
      
      * Remove cross-attention
      
      * More improvements
      
      * Remove is_decoder
      
      * Improve model tests
      
      * Improve tests
      
      * Improve model outputs
      
      * Fix model outputs equivalence
      
      * Fix more tests
      
      * Remove unused code
      
      * Use generate to generate text, no use of cache for now
      
      * Use generate more appropriately
      
      * Fix config tests
      
      * Fix style
      
      * Add support for use_cache
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      
      * Fix style
      
      * Fix GIT vision encoder
      
      * Update README
      
      * Fix integration test
      
      * Set bos and eos token ids
      
      * Improve docs
      
      * Improve code
      
      * Add support for provided attention_mask
      
      * Add copied from statement
      
      * Fix gradient checkpointing test
      
      * Set model_input_names
      
      * Investigate model_input_names
      
      * Remove script
      
      * Fix model inputs
      
      * Fix docstring
      
      * Rename GIT to Git
      
      * Support more models
      
      * Add support for textvqa model
      
      * Add video support
      
      * Extend conversion script for video
      
      * Add support for large variant
      
      * Add support for more models
      
      * Fix config archive map
      
      * Update integration test
      
      * Fix README
      
      * Fix CLIP mean and std
      
      * Update processor
      
      * Fix use_cache for video, thanks @gante
      
      * Remove print statements
      
      * Remove assertion
      
      * Add processor tests
      
      * Fix model_input_names
      
      * Use Auto API for processor
      
      * Fix processor tests
      
      * Fix integration test
      
      * Fix pipeline test
      
      * Make tests faster
      
      * Update conversion script
      
      * Update conversion script
      
      * Convert more checkpoints
      
      * Update conversion script
      
      * Fix typo
      
      * Update docstrings
      
      * Improve code snippets
      
      * Fix doc tests
      
      * Add more code examples茅
      
      * Fix doc tests
      
      * Add integration tests
      
      * Fix unused variable
      
      * revert
      
      * Add GIT to Japanese README
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      9c6f7485
    • Konstantin Kotik's avatar
      `MinNewTokensLengthLogitsProcessor` for `.generate` method #20814 (#20892) · 367fdf33
      Konstantin Kotik authored
      
      
      * feat: add min new length logit processor
      
      * test: add min new length logit processor
      
      * docs: add MinNewTokensLengthLogitsProcessor
      
      * feat: import MinNewTokensLengthLogitsProcessor
      
      * fix: update pytorch dummy objects
      
      * refactor & fix: rename attributes and var and get rid of dynamic attribute
      
      * tests: align test with new interface
      
      * docs: fix typo
      
      * docs: minor clarification
      
      * Empty-Commit
      
      * empty commit
      
      * run automated quality edits
      Co-authored-by: default avatarJoao Gante <joao@huggingface.co>
      367fdf33
  13. 27 Dec, 2022 3 commits
  14. 26 Dec, 2022 1 commit
  15. 23 Dec, 2022 1 commit
  16. 21 Dec, 2022 1 commit