1. 09 Dec, 2021 8 commits
  2. 08 Dec, 2021 19 commits
    • Yih-Dar's avatar
      Fix doc examples: name '...' is not defined (#14687) · ee6674d4
      Yih-Dar authored
      
      
      * Fix doc examples: name '...' is not defined
      
      * remove >>> and ... in some docstrings in visual_bert
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      ee6674d4
    • Sylvain Gugger's avatar
      Make MLuke tokenizer tests slow (#14690) · e6219320
      Sylvain Gugger authored
      e6219320
    • Sylvain Gugger's avatar
      Move pyctcdecode (#14686) · 13186d71
      Sylvain Gugger authored
      * Move pyctcdecode dep
      
      * Fix doc and last objects
      
      * Quality
      
      * Style
      
      * Ignore this black
      13186d71
    • Stas Bekman's avatar
      d104dd46
    • Stas Bekman's avatar
      [bf16 support] tweaks (#14580) · 12286612
      Stas Bekman authored
      
      
      * [bf16 support] tweaks
      
      * corrections
      Co-authored-by: default avatarManuel R. Ciosici <manuelrciosici@gmail.com>
      12286612
    • Yih-Dar's avatar
      16870d11
    • Sylvain Gugger's avatar
      01b8cd59
    • Sylvain Gugger's avatar
      Fixes in init (#14681) · f6b87c5f
      Sylvain Gugger authored
      * Fixes in init
      
      * Style
      f6b87c5f
    • Dhruv Nair's avatar
      Improvements to Comet Integration (#14680) · fe06f8dc
      Dhruv Nair authored
      * change args to address overwriting issue
      
      * remove project name from args
      
      * remove passing args as kwargs to experiment object
      
      * remove passing args as kwargs to offline experiment
      
      * fix offline directory assignment in experiment kwargs
      
      * log checkpoint folder on training end
      
      * log entire output_dir as asset folder
      
      * log asset folder  recursively
      
      * end experiment at the end of training
      
      * clean up
      
      * clean up
      
      * Default to always log training assets to Comet when using CometCallback
      
      * change logging training assets to be true when running callback setup
      
      * fix so that experiment always ends when training ends
      
      * styling and quality fixes
      
      * update docstring for COMET_LOG_ASSETS environment variable
      
      * run styling and quality checks
      
      * clean up to docstring
      
      * remove merge markers
      
      * change asset logging to false to avoid hitting max assets per experiment limit
      
      * update training asset description
      
      * fix styling
      fe06f8dc
    • Gaurang Tandon's avatar
      fix: verify jsonlines file in run_translation (#14660) (#14661) · 4ea19de8
      Gaurang Tandon authored
      * fix: verify jsonl in run_translation (#14660)
      
      * fix(run_translation.py): json/jsonl validation
      
      Both json and jsonl are to be accepted as valid jsonlines file extension
      
      * fix(run_translation.py): make black happy
      
      * Ran make style
      4ea19de8
    • Sylvain Gugger's avatar
      Convert tutorials (#14665) · cf36f4d7
      Sylvain Gugger authored
      * Convert a few docs
      
      * And another
      
      * Last tutorials
      
      * New syntax for colab links
      
      * Convert a few docs
      
      * And another
      
      * Last tutorials
      
      * New syntax for colab links
      cf36f4d7
    • lewtun's avatar
      0f4e39c5
    • Michael Benayoun's avatar
      Added support for other features for already supported models (#14358) · 0c70f145
      Michael Benayoun authored
      * Added support for other features for already supported models
      
      * Partial support for causal and seq2seq models
      
      * Partial support for causal and seq2seq models
      
      * OnnxSeq2SeqConfigWithPast to support seq2seq models
      
      * Parameterized the onnx tests
      
      * Restored run_mlm.py
      
      * Restored run_mlm.py
      
      * [WIP] BART update
      
      * BART and MBART
      
      * Added comments
      
      * Another sequence length of the past_key_values
      0c70f145
    • Patrick von Platen's avatar
      [AutoProcessor] Add Wav2Vec2WithLM & small fix (#14675) · ee4fa2e4
      Patrick von Platen authored
      * [AutoProcessor] Add Wav2Vec2WithLM & small fix
      
      * revert line removal
      
      * Update src/transformers/__init__.py
      
      * add test
      
      * up
      
      * up
      
      * small fix
      ee4fa2e4
    • Lysandre Debut's avatar
      Fix doc builder (#14676) · 2294071a
      Lysandre Debut authored
      2294071a
    • ZOHETH's avatar
      fix deprecated tf method (#14671) · fab3b518
      ZOHETH authored
      tf.matrix_band_part -> tf.linalg.band_part
      fab3b518
    • NielsRogge's avatar
      Add Perceiver IO (#14487) · 65b20b73
      NielsRogge authored
      * First draft
      
      * Style and remove mlm
      
      * Make forward pass work
      
      * More improvements
      
      * More improvements
      
      * Fix bug
      
      * More improvements
      
      * More improvements
      
      * Add PerceiverTokenizer first draft
      
      * Improve conversion script
      
      * More improvements
      
      * Make conversion script work for the encoder
      
      * Make conversion script work with local pickle files
      
      * Style & quality, fix-copies
      
      * Add dummy input to conversion script
      
      * Add absolute position embeddings to TextPreProcessor
      
      * Make forward pass of encoder work
      
      * More improvements
      
      * Move text preprocessor to separate script
      
      * More improvements
      
      * More improvements
      
      * Add post processor
      
      * Make MLM model work
      
      * Style
      
      * Add PerceiverForMaskedLM
      
      * Add PerceiverImagePreprocessor
      
      * Make style
      
      * Make PerceiverForImageClassification work
      
      * More improvements
      
      * More improvements
      
      * Use tokenizer in conversion script
      
      * Use PerceiverForMaskedLM in conversion script
      
      * Define custom PerceiverModelOutput
      
      * Improve PerceiverAttention to make it work for both MLM and image classification
      
      * More improvements
      
      * More improvements
      
      * More improvements to the conversion script
      
      * Make conversion script work for both MLM and image classification
      
      * Add PerceiverFeatureExtractor
      
      * More improvements
      
      * Style and quality
      
      * Add center cropping
      
      * Fix bug
      
      * Small fix
      
      * Add print statement
      
      * Fix bug in image preprocessor
      
      * Fix bug with conversion script
      
      * Make output position embeddings an nn.Parameter layer instead of nn.Embedding
      
      * Comment out print statements
      
      * Add position encoding classes
      
      * More improvements
      
      * Use position_encoding_kwargs
      
      * Add PerceiverForImageClassificationFourier
      
      * Make style & quality
      
      * Add PerceiverForImageClassificationConvProcessing
      
      * Style & quality
      
      * Add flow model
      
      * Move processors to modeling file
      
      * Make position encodings modular
      
      * Make basic decoder use modular position encodings
      
      * Add PerceiverForOpticalFlow to conversion script
      
      * Add AudioPreprocessor
      
      * Make it possible for the basic decoder to use Fourier position embeddings
      
      * Add PerceiverForMultimodalAutoencoding
      
      * Improve model for optical flow
      
      * Improve _build_network_inputs method
      
      * Add print statement
      
      * Fix device issue
      
      * Fix device of Fourier embeddings
      
      * Add print statements for debugging
      
      * Add another print statement
      
      * Add another print statement
      
      * Add another print statement
      
      * Add another print statement
      
      * Improve PerceiverAudioPreprocessor
      
      * Improve conversion script for multimodal modal
      
      * More improvements
      
      * More improvements
      
      * Improve multimodal model
      
      * Make forward pass multimodal model work
      
      * More improvements
      
      * Improve tests
      
      * Fix some more tests
      
      * Add output dataclasses
      
      * Make more tests pass
      
      * Add print statements for debuggin
      
      * Add tests for image classification
      
      * Add PerceiverClassifierOutput
      
      * More improvements
      
      * Make more tests pass for the optical flow model
      
      * Make style & quality
      
      * Small improvements
      
      * Don't support training for optical flow model for now
      
      * Fix _prepare_for_class for tests
      
      * Make more tests pass, add some docs
      
      * Add multimodal model to tests
      
      * Minor fixes
      
      * Fix tests
      
      * Improve conversion script
      
      * Make fixup
      
      * Remove pos_dim argument
      
      * Fix device issue
      
      * Potential fix for OOM
      
      * Revert previous commit
      
      * Fix test_initialization
      
      * Add print statements for debugging
      
      * Fix print statement
      
      * Add print statement
      
      * Add print statement
      
      * Add print statement
      
      * Add print statement
      
      * Add print statement
      
      * Add print statement
      
      * Remove need for output_shape
      
      * Comment out output_shape
      
      * Remove unnecessary code
      
      * Improve docs
      
      * Fix make fixup
      
      * Remove PerceiverTextProcessor from init
      
      * Improve docs
      
      * Small improvement
      
      * Apply first batch of suggestions from code review
      
      * Apply more suggestions from code review
      
      * Update docstrings
      
      * Define dicts beforehand for readability
      
      * Rename task to architecture in conversion script, include PerceiverModel in tests
      
      * Add print statements for debugging
      
      * Fix tests on GPU
      
      * Remove preprocessors, postprocessors and decoders from main init
      
      * Add integration test
      
      * Fix docs
      
      * Replace einops by torch
      
      * Update for new docs frontend
      
      * Rename PerceiverForImageClassification
      
      * Improve docs
      
      * Improve docs
      
      * Improve docs of PerceiverModel
      
      * Fix some more tests
      
      * Improve center_crop
      
      * Add PerceiverForSequenceClassification
      
      * Small improvements
      
      * Fix tests
      
      * Add integration test for optical flow model
      
      * Clean up
      
      * Add tests for tokenizer
      
      * Fix tokenizer by adding special tokens properly
      
      * Fix CI
      65b20b73
    • Patrick von Platen's avatar
      [Wav2Vec2] PyCTCDecode Integration to support language model boosted decoding (#14339) · 961732c2
      Patrick von Platen authored
      
      
      * up
      
      * up
      
      * up
      
      * make it cleaner
      
      * correct
      
      * make styhahalal
      
      * add more tests
      
      * finish
      
      * small fix
      
      * make style
      
      * up
      
      * tryout to solve cicrle ci
      
      * up
      
      * fix more tests
      
      * fix more tests
      
      * apply sylvains suggestions
      
      * fix import
      
      * correct docs
      
      * add pyctcdecode only to speech tests
      
      * fix more tests
      
      * add tf, flax and pt tests
      
      * add pt
      
      * fix last tests
      
      * fix more tests
      
      * Apply suggestions from code review
      
      * change lines
      
      * Apply suggestions from code review
      Co-authored-by: default avatarAnton Lozhkov <aglozhkov@gmail.com>
      
      * correct tests
      
      * correct tests
      
      * add doc string
      Co-authored-by: default avatarAnton Lozhkov <aglozhkov@gmail.com>
      961732c2
    • Nicolas Patry's avatar
      Fixing Dataset for TQA + token-classification. (#14658) · 2e12d90b
      Nicolas Patry authored
      * Fixing Dataset for TQA + token-classification.
      
      * Fixing the tests.
      
      * Making sure `offset_mappings` is a valid argument.
      2e12d90b
  3. 07 Dec, 2021 5 commits
  4. 06 Dec, 2021 8 commits