"docs/source/ja/tasks/image_classification.md" did not exist on "ae2dd42be5f577c7ac3f336cc43ee6b51803d8cd"
  1. 26 Jan, 2022 2 commits
  2. 21 Jan, 2022 1 commit
  3. 20 Jan, 2022 1 commit
  4. 19 Jan, 2022 3 commits
    • NielsRogge's avatar
      Add ViLT (#14895) · ac227093
      NielsRogge authored
      
      
      * First commit
      
      * Add conversion script
      
      * Make conversion script work for base model
      
      * More improvements
      
      * Update conversion script, works for vqa
      
      * Add indexing argument to meshgrid
      
      * Make conversion script work for ViltForPreTraining
      
      * Add ViltForPreTraining to docs
      
      * Fix device issue
      
      * Add processor
      
      * Add MinMaxResize to feature extractor
      
      * Implement call method of ViltProcessor
      
      * Fix tests
      
      * Add integration test
      
      * Add loss calculation for VQA
      
      * Improve tests
      
      * Improve some more tests
      
      * Debug tests
      
      * Small improvements
      
      * Add support for attention_mask
      
      * Remove mask_it
      
      * Add pixel_mask
      
      * Add tests for ViltFeatureExtractor
      
      * Improve tests
      
      * Add ViltForNaturalLanguageVisualReasoning
      
      * Add ViltForNaturalLanguageVisualReasoning to conversion script
      
      * Minor fixes
      
      * Add support for image_embeds, update docstrings to markdown
      
      * Update docs to markdown
      
      * Improve conversion script
      
      * Rename ViltForPreTraining to ViltForMaskedLM
      
      * Improve conversion script
      
      * Convert docstrings to markdown
      
      * Fix code example of retrieval model
      
      * Properly convert masked language model
      
      * Add integration test for nlvr
      
      * Fix code quality
      
      * Apply suggestions from code review
      
      * Add copied from statements
      
      * Fix pretrained_config_archive_map
      
      * Fix docs
      
      * Add model to README
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Apply more suggestions from code review
      
      * Make code more readable
      
      * Add ViltForNaturalLanguageVisualReasoning to the tests
      
      * Rename ViltForVisualQuestionAnswering to ViltForQuestionAnswering
      
      * Replace pixel_values_2 by single tensor
      
      * Add hidden_states and attentions
      
      * Fix one more test
      
      * Fix all tests
      
      * Update year
      
      * Fix rebase issues
      
      * Fix another rebase issue
      
      * Remove ViltForPreTraining from auto mapping
      
      * Rename ViltForImageRetrievalTextRetrieval to ViltForImageAndTextRetrieval
      
      * Make it possible to use BertTokenizerFast in the processor
      
      * Use BertTokenizerFast by default
      
      * Rename ViltForNaturalLanguageVisualReasoning, define custom model output
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      ac227093
    • NielsRogge's avatar
      [ViTMAE] Various fixes (#15221) · 842298f8
      NielsRogge authored
      * Add MAE to AutoFeatureExtractor
      
      * Add link to notebook
      
      * Fix relative paths
      842298f8
    • Li-Huai (Allan) Lin's avatar
      Add FastTokenizer to REALM (#15211) · 841d9791
      Li-Huai (Allan) Lin authored
      * Remove BertTokenizer abstraction
      
      * Add FastTokenizer to REALM
      
      * Fix config archive map
      
      * Fix copies
      
      * Update realm.mdx
      
      * Apply suggestions from code review
      841d9791
  5. 18 Jan, 2022 3 commits
    • Sylvain Gugger's avatar
      Finish conversion of REALM doc to MDX · db350394
      Sylvain Gugger authored
      db350394
    • NielsRogge's avatar
      Add MAE (#15120) · 74bec986
      NielsRogge authored
      * First draft
      
      * More improvements
      
      * More improvements
      
      * More improvements
      
      * Fix embeddings
      
      * Add conversion script
      
      * Finish conversion script
      
      * More improvements
      
      * Fix forward pass
      
      * Remove print statements
      
      * Add weights initialization
      
      * Add initialization of decoder weights
      
      * Add support for other models in the conversion script
      
      * Fix patch_size for huge model
      
      * Fix most of the tests
      
      * Fix integration test
      
      * Fix docs
      
      * Fix archive_list
      
      * Apply suggestions from code review
      
      * Improve documentation
      
      * Apply more suggestions
      
      * Skip some tests due to non-deterministic behaviour
      
      * Fix test_initialization
      
      * Remove unneccessary initialization of nn.Embedding
      
      * Improve docs
      
      * Fix dummies
      
      * Remove ViTMAEFeatureExtractor from docs
      
      * Add model to README and table of contents
      
      * Delete inference file
      74bec986
    • Li-Huai (Allan) Lin's avatar
      Add REALM (#13292) · 22454ae4
      Li-Huai (Allan) Lin authored
      
      
      * REALM initial commit
      
      * Retriever OK (Update new_gelu).
      
      * Encoder prediction score OK
      
      * Encoder pretrained model OK
      
      * Update retriever comments
      
      * Update docs, tests, and imports
      
      * Prune unused models
      
      * Make embedder as a module `RealmEmbedder`
      
      * Add RealmRetrieverOutput
      
      * Update tokenization
      
      * Pass all tests in test_modeling_realm.py
      
      * Prune RealmModel
      
      * Update docs
      
      * Add training test.
      
      * Remove completed TODO
      
      * Style & Quality
      
      * Prune `RealmModel`
      
      * Fixup
      
      * Changes:
      1. Remove RealmTokenizerFast
      2. Update docstrings
      3. Add a method to RealmTokenizer to handle candidates tokenization.
      
      * Fix up
      
      * Style
      
      * Add tokenization tests
      
      * Update `from_pretrained` tests
      
      * Apply suggestions
      
      * Style & Quality
      
      * Copy BERT model
      
      * Fix comment to avoid docstring copying
      
      * Make RealmBertModel private
      
      * Fix bug
      
      * Style
      
      * Basic QA
      
      * Save
      
      * Complete reader logits
      
      * Add searcher
      
      * Complete searcher & reader
      
      * Move block records init to constructor
      
      * Fix training bug
      
      * Add some outputs to RealmReader
      
      * Add finetuned checkpoint variable names parsing
      
      * Fix bug
      
      * Update REALM config
      
      * Add RealmForOpenQA
      
      * Update convert_tfrecord logits
      
      * Fix bugs
      
      * Complete imports
      
      * Update docs
      
      * Update naming
      
      * Add brute-force searcher
      
      * Pass realm model tests
      
      * Style
      
      * Exclude RealmReader from common tests
      
      * Fix
      
      * Fix
      
      * convert docs
      
      * up
      
      * up
      
      * more make style
      
      * up
      
      * upload
      
      * up
      
      * Fix
      
      * Update src/transformers/__init__.py
      
      * adapt testing
      
      * change modeling code
      
      * fix test
      
      * up
      
      * up
      
      * up
      
      * correct more
      
      * make retriever work
      
      * update
      
      * make style
      
      * finish main structure
      
      * Resolve merge conflict
      
      * Make everything work
      
      * Style
      
      * Fixup
      
      * Fixup
      
      * Update training test
      
      * fix retriever
      
      * remove hardcoded path
      
      * Fix
      
      * Fix modeling test
      
      * Update model links
      
      * Initial retrieval test
      
      * Fix modeling test
      
      * Complete retrieval tests
      
      * Fix
      
      * style
      
      * Fix tests
      
      * Fix docstring example
      
      * Minor fix of retrieval test
      
      * Update license headers and docs
      
      * Apply suggestions from code review
      
      * Style
      
      * Apply suggestions from code review
      
      * Add an example to RealmEmbedder
      
      * Fix
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      22454ae4
  6. 11 Jan, 2022 1 commit
  7. 10 Jan, 2022 2 commits
  8. 04 Jan, 2022 1 commit
  9. 03 Jan, 2022 1 commit
  10. 28 Dec, 2021 2 commits
    • Sylvain Gugger's avatar
      Style · 2c5597f6
      Sylvain Gugger authored
      2c5597f6
    • Sylvain Gugger's avatar
      Doc styler examples (#14953) · b5e2b183
      Sylvain Gugger authored
      * Fix bad examples
      
      * Add black formatting to style_doc
      
      * Use first nonempty line
      
      * Put it at the right place
      
      * Don't add spaces to empty lines
      
      * Better templates
      
      * Deal with triple quotes in docstrings
      
      * Result of style_doc
      
      * Enable mdx treatment and fix code examples in MDXs
      
      * Result of doc styler on doc source files
      
      * Last fixes
      
      * Break copy from
      b5e2b183
  11. 27 Dec, 2021 3 commits
  12. 23 Dec, 2021 1 commit
    • Yih-Dar's avatar
      Add TFCLIPModel (#13967) · 8f2cc1c3
      Yih-Dar authored
      
      
      * Start the work for TFCLIPModel
      
      * Convert to TF code (TODO: loss + doc)
      
      * Clean up
      
      * Fix pooled_output for TFCLIPTextTransformer - using tf.gather_nd
      
      * assert -> raise error
      
      * Expose TFCLIPModel
      
      * Deal with dummy_inputs
      
      * Add tests
      
      * Fix all tests. TODO: manual check weight loading + add more comments
      
      * Fix pt tf equivalence test
      
      * fixes
      
      * update TFCLIPVisionEmbeddings's Conv2D
      
      * Fix loss + overwrite test_pt_tf_model_equivalence from common
      
      * Add a comment about the change about MainLayer in test_keras_save_load
      
      * Set return_loss=True in TFCLIPModelTester + make tests pass
      
      * overwrite test_pt_tf_model_equivalence from tf common
      
      * fix base_model_prefix
      
      * Fix examples
      
      * remove unused
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * apply review suggestions
      
      * change self.pre_layrnorm to self.pre_layernorm
      
      * apply more review suggestions
      
      * return attention probs before dropout (to align with PT)
      
      * fix weight init
      
      * fix
      
      * build doc
      
      * fix missing doc
      
      * fix for test
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      8f2cc1c3
  13. 22 Dec, 2021 3 commits
  14. 20 Dec, 2021 2 commits
  15. 17 Dec, 2021 2 commits
  16. 16 Dec, 2021 2 commits
    • Patrick von Platen's avatar
      Add WavLM (#14354) · bef1e3e4
      Patrick von Platen authored
      
      
      * first commit
      
      * fix some stuff
      
      * fix more readme
      
      * Apply suggestions from code review
      
      * update
      
      * correct
      
      * up
      
      * attn layer works
      
      * push code
      
      * make modedls work
      
      * Small change
      
      * more refactor
      
      * finish
      
      * up
      
      * fix convertsion
      
      * fix position bias
      
      * Fix style
      
      * fix conversion
      
      * make fix-copies
      
      * add
      
      * clean
      
      * fix docs
      
      * fix
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * apply final changes
      
      * make fix-copies
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      bef1e3e4
    • Anton Lozhkov's avatar
      Add Speaker Diarization and Verification heads (#14723) · 48463ebb
      Anton Lozhkov authored
      * Models
      
      * Squashed commit of the following:
      
      commit 72278e1e931a16d0879acc77f65762f3364833d0
      Author: anton-l <aglozhkov@gmail.com>
      Date:   Fri Dec 10 21:45:08 2021 +0300
      
      * Add unispeech heads
      
      * Add sd/sv automodels
      
      * Docs cleanup
      
      * Fix docstrings
      
      * rename xvector classes
      
      * examples
      
      * Tests cleanup
      
      * Style
      
      * Better checkpoints for tests
      
      * leftover docs
      
      * apply review suggestions
      
      * Style + init tests
      
      * Update unispeech-sat tdnn downsampling
      48463ebb
  17. 15 Dec, 2021 2 commits
  18. 13 Dec, 2021 2 commits
  19. 08 Dec, 2021 3 commits
    • Sylvain Gugger's avatar
      Move pyctcdecode (#14686) · 13186d71
      Sylvain Gugger authored
      * Move pyctcdecode dep
      
      * Fix doc and last objects
      
      * Quality
      
      * Style
      
      * Ignore this black
      13186d71
    • NielsRogge's avatar
      Add Perceiver IO (#14487) · 65b20b73
      NielsRogge authored
      * First draft
      
      * Style and remove mlm
      
      * Make forward pass work
      
      * More improvements
      
      * More improvements
      
      * Fix bug
      
      * More improvements
      
      * More improvements
      
      * Add PerceiverTokenizer first draft
      
      * Improve conversion script
      
      * More improvements
      
      * Make conversion script work for the encoder
      
      * Make conversion script work with local pickle files
      
      * Style & quality, fix-copies
      
      * Add dummy input to conversion script
      
      * Add absolute position embeddings to TextPreProcessor
      
      * Make forward pass of encoder work
      
      * More improvements
      
      * Move text preprocessor to separate script
      
      * More improvements
      
      * More improvements
      
      * Add post processor
      
      * Make MLM model work
      
      * Style
      
      * Add PerceiverForMaskedLM
      
      * Add PerceiverImagePreprocessor
      
      * Make style
      
      * Make PerceiverForImageClassification work
      
      * More improvements
      
      * More improvements
      
      * Use tokenizer in conversion script
      
      * Use PerceiverForMaskedLM in conversion script
      
      * Define custom PerceiverModelOutput
      
      * Improve PerceiverAttention to make it work for both MLM and image classification
      
      * More improvements
      
      * More improvements
      
      * More improvements to the conversion script
      
      * Make conversion script work for both MLM and image classification
      
      * Add PerceiverFeatureExtractor
      
      * More improvements
      
      * Style and quality
      
      * Add center cropping
      
      * Fix bug
      
      * Small fix
      
      * Add print statement
      
      * Fix bug in image preprocessor
      
      * Fix bug with conversion script
      
      * Make output position embeddings an nn.Parameter layer instead of nn.Embedding
      
      * Comment out print statements
      
      * Add position encoding classes
      
      * More improvements
      
      * Use position_encoding_kwargs
      
      * Add PerceiverForImageClassificationFourier
      
      * Make style & quality
      
      * Add PerceiverForImageClassificationConvProcessing
      
      * Style & quality
      
      * Add flow model
      
      * Move processors to modeling file
      
      * Make position encodings modular
      
      * Make basic decoder use modular position encodings
      
      * Add PerceiverForOpticalFlow to conversion script
      
      * Add AudioPreprocessor
      
      * Make it possible for the basic decoder to use Fourier position embeddings
      
      * Add PerceiverForMultimodalAutoencoding
      
      * Improve model for optical flow
      
      * Improve _build_network_inputs method
      
      * Add print statement
      
      * Fix device issue
      
      * Fix device of Fourier embeddings
      
      * Add print statements for debugging
      
      * Add another print statement
      
      * Add another print statement
      
      * Add another print statement
      
      * Add another print statement
      
      * Improve PerceiverAudioPreprocessor
      
      * Improve conversion script for multimodal modal
      
      * More improvements
      
      * More improvements
      
      * Improve multimodal model
      
      * Make forward pass multimodal model work
      
      * More improvements
      
      * Improve tests
      
      * Fix some more tests
      
      * Add output dataclasses
      
      * Make more tests pass
      
      * Add print statements for debuggin
      
      * Add tests for image classification
      
      * Add PerceiverClassifierOutput
      
      * More improvements
      
      * Make more tests pass for the optical flow model
      
      * Make style & quality
      
      * Small improvements
      
      * Don't support training for optical flow model for now
      
      * Fix _prepare_for_class for tests
      
      * Make more tests pass, add some docs
      
      * Add multimodal model to tests
      
      * Minor fixes
      
      * Fix tests
      
      * Improve conversion script
      
      * Make fixup
      
      * Remove pos_dim argument
      
      * Fix device issue
      
      * Potential fix for OOM
      
      * Revert previous commit
      
      * Fix test_initialization
      
      * Add print statements for debugging
      
      * Fix print statement
      
      * Add print statement
      
      * Add print statement
      
      * Add print statement
      
      * Add print statement
      
      * Add print statement
      
      * Add print statement
      
      * Remove need for output_shape
      
      * Comment out output_shape
      
      * Remove unnecessary code
      
      * Improve docs
      
      * Fix make fixup
      
      * Remove PerceiverTextProcessor from init
      
      * Improve docs
      
      * Small improvement
      
      * Apply first batch of suggestions from code review
      
      * Apply more suggestions from code review
      
      * Update docstrings
      
      * Define dicts beforehand for readability
      
      * Rename task to architecture in conversion script, include PerceiverModel in tests
      
      * Add print statements for debugging
      
      * Fix tests on GPU
      
      * Remove preprocessors, postprocessors and decoders from main init
      
      * Add integration test
      
      * Fix docs
      
      * Replace einops by torch
      
      * Update for new docs frontend
      
      * Rename PerceiverForImageClassification
      
      * Improve docs
      
      * Improve docs
      
      * Improve docs of PerceiverModel
      
      * Fix some more tests
      
      * Improve center_crop
      
      * Add PerceiverForSequenceClassification
      
      * Small improvements
      
      * Fix tests
      
      * Add integration test for optical flow model
      
      * Clean up
      
      * Add tests for tokenizer
      
      * Fix tokenizer by adding special tokens properly
      
      * Fix CI
      65b20b73
    • Patrick von Platen's avatar
      [Wav2Vec2] PyCTCDecode Integration to support language model boosted decoding (#14339) · 961732c2
      Patrick von Platen authored
      
      
      * up
      
      * up
      
      * up
      
      * make it cleaner
      
      * correct
      
      * make styhahalal
      
      * add more tests
      
      * finish
      
      * small fix
      
      * make style
      
      * up
      
      * tryout to solve cicrle ci
      
      * up
      
      * fix more tests
      
      * fix more tests
      
      * apply sylvains suggestions
      
      * fix import
      
      * correct docs
      
      * add pyctcdecode only to speech tests
      
      * fix more tests
      
      * add tf, flax and pt tests
      
      * add pt
      
      * fix last tests
      
      * fix more tests
      
      * Apply suggestions from code review
      
      * change lines
      
      * Apply suggestions from code review
      Co-authored-by: default avatarAnton Lozhkov <aglozhkov@gmail.com>
      
      * correct tests
      
      * correct tests
      
      * add doc string
      Co-authored-by: default avatarAnton Lozhkov <aglozhkov@gmail.com>
      961732c2
  20. 07 Dec, 2021 1 commit
    • Ryokan RI's avatar
      Add mLUKE (#14640) · 30646a0a
      Ryokan RI authored
      * implement MLukeTokenizer and LukeForMaskedLM
      
      * update tests
      
      * update docs
      
      * add LukeForMaskedLM to check_repo.py
      
      * update README
      
      * fix test and specify the entity pad id in tokenization_(m)luke
      
      * fix EntityPredictionHeadTransform
      30646a0a
  21. 06 Dec, 2021 1 commit
  22. 02 Dec, 2021 1 commit
    • Daniel Stancl's avatar
      [Flax] Add FlaxBlenderbotSmall (#14576) · 50d909be
      Daniel Stancl authored
      
      
      * [WIP] Add FlaxBlenderbotSmall
      
      * Revert some unintentionally changed files
      
      Revert some unintentionally files changed by improperly filled cookiecutter instructions.
      
      * Fix repo consistency
      
      * Fix Flax-PT equivalence
      
      * Apply suggestions from code review
      
      * Update index.mdx
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSuraj Patil <surajp815@gmail.com>
      50d909be