1. 19 Jun, 2023 1 commit
  2. 16 Jun, 2023 9 commits
  3. 15 Jun, 2023 16 commits
  4. 14 Jun, 2023 11 commits
    • Patrick von Platen's avatar
      Add MMS CTC Fine-Tuning (#24281) · 1609a436
      Patrick von Platen authored
      * Add mms ctc fine tuning
      
      * make style
      
      * More fixes that are needed
      
      * make fix-copies
      
      * make draft for README
      
      * add new file
      
      * move to new file
      
      * make style
      
      * make style
      
      * add quick test
      
      * make style
      
      * make style
      1609a436
    • Matthijs Hollemans's avatar
      [WIP] add EnCodec model (#23655) · 0c3fdccf
      Matthijs Hollemans authored
      
      
      * boilerplate stuff
      
      * messing around with the feature extractor
      
      * fix feature extractor
      
      * unit tests for feature extractor
      
      * rename speech to audio
      
      * quick-and-dirty import of Meta's code
      
      * import weights (sort of)
      
      * cleaning up
      
      * more cleaning up
      
      * move encoder/decoder args into config
      
      * cleanup model
      
      * rename EnCodec -> Encodec
      
      * RVQ parameters in config
      
      * add slow test
      
      * add lstm init and test_init
      
      * Add save & load
      
      * finish EncodecModel
      
      * remove decoder_input_values as they are ont used anywhere (not removed from doc yet)
      
      * fix test feature extraction model name
      
      * Add better slow test
      
      * Fix tests
      
      * some fixup and cleaning
      
      * Improve further
      
      * cleaning up quantizer
      
      * fix up conversion script
      
      * test don't pass, _encode_fram does not work
      
      * update tests with output per encode and decode
      
      * more cleanup
      
      * rename _codebook
      
      * remove old config cruft
      
      * ratios & hop_length
      
      * use ModuleList instead of Sequential
      
      * clean up resnet block
      
      * update types
      
      * update tests
      
      * fixup
      
      * quick cleanup
      
      * fix padding
      
      * more styl,ing
      
      * add patrick feedback
      
      * fix copies
      
      * fixup
      
      * fix lstm
      
      * fix shape issues
      
      * fixup
      
      * rename conv layers
      
      * fixup
      
      * fix decoding
      
      * small conv refactoring
      
      * remove norm_params
      
      * simplify conv layers
      
      * rename conv layers
      
      * stuff
      
      * Clean up
      
      * Add padding logic
      
      use padding mask
      
      small conv refactoring
      
      remove norm_params
      
      simplify conv layers
      
      rename conv layers
      
      stuff
      
      add batched test
      
      update
      
      Clean up
      
      merge and update for padding
      
      fix padding
      
      fixup
      
      * clean up more
      
      * clean up more
      
      * More clean ups
      
      * cleanup convolutions
      
      * typo
      
      * fix typos
      
      * fixup
      
      * build PR doc?
      
      * start refactoring docstring
      
      * fix don't pad when no strid and chunk
      
      * update docstring
      
      * update docstring
      
      * nits
      
      * update going to lunch
      
      * update config and model
      
      * fix broken testse (becaue of the config changes)
      
      * fix scale computation
      
      * fixu[
      
      * only return dict if speciefied or if config returns it
      
      * remove todos
      
      * update defaults in config
      
      * update conversion script
      
      * fix doctest
      
      * more docstring + fixup
      
      * nits on batched_tests
      
      * more nits
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * update basxed on review
      
      * fix update
      
      * updaet tests
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * fixup
      
      * add overlap and chunl_length_s
      
      * cleanup feature extraction
      
      * teste edge cases truncation and padding
      
      * correct processor values
      
      * update config encodec, nits
      
      * fix tests
      
      * fixup
      
      * fix 24Hz test
      
      * elle tests are green
      
      * fix fixup
      
      * Apply suggestions from code review
      
      * revert readme changes
      
      * fixup
      
      * add example
      
      * use facebook checkpoints
      
      * fix typo
      
      * no pipeline tests
      
      * use slef.pad everywhere we can
      
      * Apply suggestions from code review
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      * update based on review
      
      * update
      
      * update mdx
      
      * fix bug and tests
      
      * fixup
      
      * fix doctest
      
      * remove comment
      
      * more nits
      
      * add more coverage for `test_truncation_and_padding`
      
      * fixup
      
      * add last test
      
      * fix text
      
      * nits
      
      * Update tests/models/encodec/test_modeling_encodec.py
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      * take care of the last comments
      
      * typo
      
      * fix test
      
      * nits
      
      * fixup
      
      * Update src/transformers/models/encodec/feature_extraction_encodec.py
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      ---------
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatararthur.zucker@gmail.com <arthur.zucker@gmail.com>
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      0c3fdccf
    • Sylvain Gugger's avatar
      Clean up old Accelerate checks (#24279) · 26a2ec56
      Sylvain Gugger authored
      * Clean up old Accelerate checks
      
      * Put back imports
      26a2ec56
    • Wissam Antoun's avatar
      Fix Debertav2 embed_proj (#24205) · 860d11ff
      Wissam Antoun authored
      * MLM prediction head output size from embed_size
      
      Take the output size of the dense projection layer from embedding_size instead of hidden_size since there could be a projection of the input embedding into hidden_size if they are different
      
      * project TFDebertaV2 mlm output to embedding size
      
      embedding size can be different that hidden_size, so the final layer needs to project back to embedding size. like in ELECTRA or DeBERTaV3 style pertaining.
      
      This should solve an error that occurs when loading models like "almanach/camemberta-base-generator".
      
      * fix the same issue for reshaping after projection
      
      * fix layernorm size
      
      * add self.embedding_size to scope
      
      * fix embed_proj scope name
      
      * apply the same changes to TF Deberta
      
      * add the changes to deberta
      
      * added self.embedding_size instead of config.embedding_size
      
      * added the same change to debertav2
      
      * added coppied from deberta to deberta2 model
      
      * config.embedding_size fix
      
      * black
      
      * fix deberta config name
      860d11ff
    • Yih-Dar's avatar
      `Pix2StructImageProcessor` requires `torch>=1.11.0` (#24270) · a04ebc8b
      Yih-Dar authored
      
      
      * fix
      
      * fix
      
      * fix
      
      ---------
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      a04ebc8b
    • Sylvain Gugger's avatar
      Update check of core deps (#24277) · 8978b696
      Sylvain Gugger authored
      8978b696
    • Patrick von Platen's avatar
      Adapt Wav2Vec2 conversion for MMS lang identification (#24234) · c4fec38b
      Patrick von Platen authored
      * Add conversion for mms lid
      
      * make style
      c4fec38b
    • Joao Gante's avatar
      4626df50
    • Yih-Dar's avatar
      eac8dede
    • ByronHsu's avatar
      QA doc: import torch before it is used (#24228) · 91b62f5a
      ByronHsu authored
      
      
      * import torch before it is used
      
      * style
      Signed-off-by: default avatarbyhsu <byhsu@linkedin.com>
      
      ---------
      Signed-off-by: default avatarbyhsu <byhsu@linkedin.com>
      Co-authored-by: default avatarbyhsu <byhsu@linkedin.com>
      91b62f5a
    • TAE YOUNGDON's avatar
      Fix URL in comment for contrastive loss function (#24271) · 6ab045d6
      TAE YOUNGDON authored
      * Update language_modeling.py
      
      in "class TextDatasetForNextSentencePrediction(Dataset)", double considering "self.tokenizer.num_special_tokens_to_add(pair=True)" 
      
      so, i remove self.block_size, and add parameter for "def create_examples_from_document". like "class LineByLineWithSOPTextDataset" do
      
      * Update language_modeling.py
      
      * Fix URL in comment for contrastive loss function
      6ab045d6
  5. 13 Jun, 2023 3 commits