1. 30 Nov, 2022 5 commits
    • Yang An's avatar
      Add Chinese-CLIP implementation (#20368) · 72176402
      Yang An authored
      
      
      * init chinese-clip model from clip
      
      * init model tests and docs
      
      * implement chinese-clip into hf
      
      * implement chinese-clip into hf
      
      * implement chinese-clip into hf
      
      * implement chinese-clip into hf
      
      * implement chinese-clip into hf
      
      * update usecase example in model implementation
      
      * fix codestyle
      
      * fix model_type typo in readme
      
      * add placeholder in doc
      
      * add placeholder in doc
      
      * update the init script
      
      * update usecase
      
      * fix codestyle
      
      * update testcase
      
      * update testcase
      
      * update testcase
      
      * update testcase
      
      * update testcase
      
      * update testcase
      
      * update testcase
      
      * update testcase
      
      * update testcase
      
      * update testcase
      
      * update testcase
      
      * update testcase
      
      * forward the convert_rgb
      
      * update testcase
      
      * update testcase
      
      * update testcase
      
      * merge the recent update from clip about model_input_name property
      
      * update the doc
      
      * update the doc
      
      * update the doc
      
      * update the doc
      
      * remove unused imports
      
      * reformat code style
      
      * update the doc
      
      * fix isort style
      
      * bypass a weird failed unit test which is unrelated with my PR
      
      * update the doc
      
      * implement independent vision config class
      
      * implement independent vision model class
      
      * fix refactor bug
      
      * fix refactor bug
      
      * fix refactor bug
      
      * make style
      
      * fix refactor bug
      
      * make style
      
      * fix refactor bug
      
      * fix refactor bug
      
      * make style
      
      * fix refactor bug
      
      * fix refactor bug
      
      * doc-build restyle
      
      * implement independent text config class
      
      * implement independent text model class
      
      * implement independent text model class
      
      * make style
      
      * make fix-copies
      
      * fix refactor bug
      
      * fix refactor bug
      
      * fix refactor bug
      
      * fix refactor bug
      
      * fix refactor bug
      
      * fix refactor bug
      
      * fix refactor bug
      
      * fix refactor bug
      
      * fix refactor bug
      
      * fix refactor bug
      
      * make style
      
      * update doc
      
      * black and isort
      
      * update doc
      
      * Update src/transformers/models/chinese_clip/configuration_chinese_clip.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/models/auto/tokenization_auto.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * modify the model type from chinese-clip to chinese_clip
      
      * format the example comment of ChineseCLIPVisionConfig
      
      * correct the copyright comment
      
      * fix the tokenizer specification
      
      * add copied from for loss function
      
      * remove unused class
      
      * update CHINESE_CLIP_TEXT_INPUTS_DOCSTRING
      
      * update CHINESE_CLIP_INPUTS_DOCSTRING
      
      * update doc
      
      * update doc
      
      * update code comment in config
      
      * update copied from statement
      
      * make style
      
      * rename the doc file
      
      * add copied statement
      
      * remove unused attention_mask, causal_attention_mask in ChineseCLIPVisionEncoder
      
      * remove ChineseCLIPTextPreTrainedModel
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * update doc
      
      * make style
      
      * Update src/transformers/models/chinese_clip/configuration_chinese_clip.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/models/chinese_clip/configuration_chinese_clip.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * update ChineseCLIPImageProcessor in image_processing_auto
      
      * fix config_class of chinesecliptextmodel
      
      * fix the test case
      
      * update the docs
      
      * remove the copied from comment for ChineseCLIPTextModel, since it has diverged from BertModel with customed config_class
      
      * update the testcase
      
      * final fix
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      72176402
    • Sylvain Gugger's avatar
      Repurpose torchdynamo training args towards torch._dynamo (#20498) · 08b46218
      Sylvain Gugger authored
      * Repurpose torchdynamo training args towards torch._dynamo
      
      * Add doc
      08b46218
    • Julian Pollmann's avatar
      Fix Typo in Docs for GPU (#20509) · 829374e4
      Julian Pollmann authored
      829374e4
    • amyeroberts's avatar
      Update doc examples feature extractor -> image processor (#20501) · 17a7b49b
      amyeroberts authored
      * Update doc example feature extractor -> image processor
      
      * Apply suggestions from code review
      17a7b49b
    • amyeroberts's avatar
      Add segmentation + object detection image processors (#20160) · de6d19ea
      amyeroberts authored
      * Add transforms for object detection
      
      * DETR models + Yolos
      
      * Scrappy additions
      
      * Maskformer image processor
      
      * Fix up; MaskFormer tests
      
      * Update owlvit processor
      
      * Add to docs
      
      * OwlViT tests
      
      * Update pad logic
      
      * Remove changes to transforms
      
      * Import fn directly
      
      * Update to include pad transformation
      
      * Remove uninstended changes
      
      * Add new owlvit post processing function
      
      * Tidy up
      
      * Fix copies
      
      * Fix some copies
      
      * Include device fix
      
      * Fix scipy imports
      
      * Update _pad_image
      
      * Update padding functionality
      
      * Fix bug
      
      * Properly handle ignore index
      
      * Fix up
      
      * Remove defaults to None in docstrings
      
      * Fix docstrings & docs
      
      * Fix sizes bug
      
      * Resolve conflicts in init
      
      * Cast to float after resizing
      
      * Tidy & add size if missing
      
      * Allow kwards when processing for owlvit
      
      * Update test values
      de6d19ea
  2. 29 Nov, 2022 3 commits
  3. 28 Nov, 2022 3 commits
  4. 23 Nov, 2022 1 commit
  5. 21 Nov, 2022 9 commits
    • Ian C's avatar
      Fix toctree for Section 3 in Spanish Documentation (#20360) · c3eb0101
      Ian C authored
      * Order and group topics in the right section
      
      * Translate "Computer Vision"
      c3eb0101
    • Steven Liu's avatar
      Add inference section to task guides (#18781) · d896029e
      Steven Liu authored
      * 📝 start adding inference section to task guides
      
      *  make style
      
      * 📝 add multiple choice
      
      * add rest of inference sections
      
      * make style
      
      * add compute_metric, push_to_hub, pipeline
      
      * make style
      
      * add updated sequence and token classification
      
      * make style
      
      * make edits in token classification
      
      * add audio classification
      
      * make style
      
      * add asr
      
      * make style
      
      * add image classification
      
      * make style
      
      * add summarization
      
      * make style
      
      * add translation
      
      * make style
      
      * add multiple choice
      
      * add language modeling
      
      * add qa
      
      * make style
      
      * review and edits
      
      * apply reviews
      
      * make style
      
      * fix call to processor
      
      * apply audio reviews
      
      * update to better asr model
      
      * make style
      d896029e
    • NielsRogge's avatar
      Add Audio Spectogram Transformer (#19981) · 4973d2a0
      NielsRogge authored
      
      
      * First draft
      
      * Make conversion script work
      
      * Add id2label mapping, run code quality
      
      * Fix copies
      
      * Add first draft of feature extractor
      
      * Update conversion script to use feature extractor
      
      * Make more tests pass
      
      * Add docs
      
      * update input_features to input_values + pad by default to max length
      
      * Fix doc tests
      
      * Add feature extractor tests
      
      * Add proper padding/truncation to feature extractor
      
      * Add support for conversion of all audioset checkpoints
      
      * Improve docs and extend conversion script
      
      * Fix README
      
      * Rename spectogram to spectrogram
      
      * Fix copies
      
      * Add integration test
      
      * Remove dummy conv
      
      * Update to ast
      
      * Update organization
      
      * Fix init
      
      * Rename model to AST
      
      * Add require_torchaudio annotator
      
      * Move import of ASTFeatureExtractor under a is_speech_available
      
      * Fix rebase
      
      * Add pipeline config
      
      * Update name of classifier head
      
      * Rename time_dimension and frequency_dimension for clarity
      
      * Remove print statement
      
      * Fix pipeline test
      
      * Fix pipeline test
      
      * Fix index table
      
      * Fix init
      
      * Fix conversion script
      
      * Rename to ForAudioClassification
      
      * Fix index table
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      4973d2a0
    • NielsRogge's avatar
      Add resources (#20296) · 96783e53
      NielsRogge authored
      
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      96783e53
    • Matthijs Hollemans's avatar
      add MobileNetV1 model (#17799) · d21c97cc
      Matthijs Hollemans authored
      * add model files etc for MobileNetV2
      
      rename files for MobileNetV1
      
      initial implementation of MobileNetV1
      
      fix conversion script
      
      cleanup
      
      write docs
      
      tweaks
      
      fix conversion script
      
      extract hidden states
      
      fix test cases
      
      make fixup
      
      fixup it all
      
      remove main from doc link
      
      fixes
      
      fix tests
      
      fix up
      
      use google org
      
      fix weird assert
      
      * fixup
      
      * use google organization for checkpoints
      d21c97cc
    • Raj Rajhans's avatar
      fix: "BigSicence" typo in docs (#20331) · 22d7161a
      Raj Rajhans authored
      22d7161a
    • Ian C's avatar
      Add Spanish translation of serialization.mdx (#20245) · d28448c5
      Ian C authored
      * Update _toctree and clone original content
      
      * Translate first three sections
      
      * Add more translated chapters. Only 3 more left.
      
      * Finish translation
      
      * Run style from doc-builder
      
      * Address recommended changes from reviewer
      d28448c5
    • BFSS's avatar
      translate zh quicktour(#20095) (#20181) · 05d80d85
      BFSS authored
      
      
      * zh quicktour(#20095)
      
      * add zh to doc workflow
      
      * remove untranslation from toctree
      Co-authored-by: default avatarBeifangSusu <BeifangSusu@bfss.com>
      05d80d85
    • Joao Gante's avatar
  6. 18 Nov, 2022 3 commits
    • Steven Liu's avatar
      organize pipelines by modality (#20306) · d316037a
      Steven Liu authored
      d316037a
    • Ali Hassani's avatar
      Add Neighborhood Attention Transformer (NAT) and Dilated NAT (DiNAT) models (#20219) · fc4a993e
      Ali Hassani authored
      * Add DiNAT
      
      * Adds DiNAT + tests
      
      * Minor fixes
      
      * Added HF model
      
      * Add natten to dependencies.
      
      * Cleanup
      
      * Minor fixup
      
      * Reformat
      
      * Optional NATTEN import.
      
      * Reformat & add doc to _toctree
      
      * Reformat (finally)
      
      * Dummy objects for DiNAT
      
      * Add NAT + minor changes
      
      Adds NAT as its own independent model + docs, tests
      Adds NATTEN to ext deps to ensure ci picks it up.
      
      * Remove natten from `all` and `dev-torch` deps, add manual pip install to ci tests
      
      * Minor fixes.
      
      * Fix READMEs.
      
      * Requested changes to docs + minor fixes.
      
      * Requested changes.
      
      * Add NAT/DiNAT tests to layoutlm_job
      
      * Correction to Dinat doc.
      
      * Requested changes.
      fc4a993e
    • amyeroberts's avatar
      Add padding image transformation (#19838) · b9826942
      amyeroberts authored
      * Add padding transformation
      
      * Add in upstream changes
      
      * Update tests & docs
      
      * Code formatting tuples in docstring
      b9826942
  7. 16 Nov, 2022 2 commits
  8. 15 Nov, 2022 9 commits
  9. 14 Nov, 2022 4 commits
    • Wonhyeong Seo's avatar
      docs: translated index page to korean (#20180) · 07d8d6e2
      Wonhyeong Seo authored
      docs: i18n: first draft of index page
      docs: fix: first revision of index page
      docs: i18n: missed section - supported frameworks
      docs: fix: second revision of index page
      review by @ArthurZucker
      
      refactor: remove untranslated files from korean
      docs: fix: remove untranslated references from toctree.yml
      feat: enable korean docs in gh actions
      docs: feat: add in_translation page as placeholder
      docs: bug: testing if internal toc need alphabet chars
      docs: fix: custom english anchor for non-alphanumeric headings
      review by @sgugger
      
      docs: i18n: translate comments on install methods in _config.py
      docs: refactor: more concise wording for translations
      07d8d6e2
    • code-with-rajeev's avatar
      Update README.md (#19530) · 2308f3d4
      code-with-rajeev authored
      Fixed a grammatical error.
      2308f3d4
    • Bartosz Szmelczynski's avatar
      Fix tapas scatter (#20149) · 78a471ff
      Bartosz Szmelczynski authored
      
      
      * First draft
      
      * Remove scatter dependency
      
      * Add require_torch
      
      * update vectorized sum test, add clone call
      
      * remove artifacts
      
      * fix style
      
      * fix style v2
      
      * remove "scatter" mentions from the code base
      
      * fix isort error
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      78a471ff
    • Matthijs Hollemans's avatar
      add MobileNetV2 model (#17845) · f711d683
      Matthijs Hollemans authored
      * add model files etc for MobileNetV2
      
      * rename files for MobileNetV1
      
      * initial implementation of MobileNetV1
      
      * fix conversion script
      
      * cleanup
      
      * write docs
      
      * tweaks
      
      * fix conversion script
      
      * extract hidden states
      
      * fix test cases
      
      * make fixup
      
      * fixup it all
      
      * rename V1 to V2
      
      * fix checkpoints
      
      * fixup
      
      * implement first block + weight conversion
      
      * add remaining layers
      
      * add output stride and dilation
      
      * fixup
      
      * add tests
      
      * add deeplabv3+ head
      
      * a bit of fixup
      
      * finish deeplab conversion
      
      * add link to doc
      
      * fix issue with JIT trace
      
      in_height and in_width would be Tensor objects during JIT trace, which caused Core ML conversion to fail on the remainder op. By making them ints, the result of the padding calculation becomes a constant value.
      
      * cleanup
      
      * fix order of models
      
      * fix rebase error
      
      * remove main from doc link
      
      * add image processor
      
      * remove old feature extractor
      
      * fix converter + other issues
      
      * fixup
      
      * fix unit test
      
      * add to onnx tests (but these appear broken now)
      
      * add post_process_semantic_segmentation
      
      * use google org
      
      * remove unused imports
      
      * move args
      
      * replace weird assert
      f711d683
  10. 10 Nov, 2022 1 commit