1. 16 Jun, 2023 2 commits
  2. 15 Jun, 2023 16 commits
  3. 14 Jun, 2023 11 commits
    • Patrick von Platen's avatar
      Add MMS CTC Fine-Tuning (#24281) · 1609a436
      Patrick von Platen authored
      * Add mms ctc fine tuning
      
      * make style
      
      * More fixes that are needed
      
      * make fix-copies
      
      * make draft for README
      
      * add new file
      
      * move to new file
      
      * make style
      
      * make style
      
      * add quick test
      
      * make style
      
      * make style
      1609a436
    • Matthijs Hollemans's avatar
      [WIP] add EnCodec model (#23655) · 0c3fdccf
      Matthijs Hollemans authored
      * boilerplate stuff
      
      * messing around with the feature extractor
      
      * fix feature extractor
      
      * unit tests for feature extractor
      
      * rename speech to audio
      
      * quick-and-dirty import of Meta's code
      
      * import weights (sort of)
      
      * cleaning up
      
      * more cleaning up
      
      * move encoder/decoder args into config
      
      * cleanup model
      
      * rename EnCodec -> Encodec
      
      * RVQ parameters in config
      
      * add slow test
      
      * add lstm init and test_init
      
      * Add save & load
      
      * finish EncodecModel
      
      * remove decoder_input_values as they are ont used anywhere (not removed from doc yet)
      
      * fix test feature extraction model name
      
      * Add better slow test
      
      * Fix tests
      
      * some fixup and cleaning
      
      * Improve further
      
      * cleaning up quantizer
      
      * fix up conversion script
      
      * test don't pass, _encode_fram does not work
      
      * update tests with output per encode and decode
      
      * more cleanup
      
      * rename _codebook
      
      * remove old config cruft
      
      * ratios & hop_length
      
      * use ModuleLi...
      0c3fdccf
    • Sylvain Gugger's avatar
      Clean up old Accelerate checks (#24279) · 26a2ec56
      Sylvain Gugger authored
      * Clean up old Accelerate checks
      
      * Put back imports
      26a2ec56
    • Wissam Antoun's avatar
      Fix Debertav2 embed_proj (#24205) · 860d11ff
      Wissam Antoun authored
      * MLM prediction head output size from embed_size
      
      Take the output size of the dense projection layer from embedding_size instead of hidden_size since there could be a projection of the input embedding into hidden_size if they are different
      
      * project TFDebertaV2 mlm output to embedding size
      
      embedding size can be different that hidden_size, so the final layer needs to project back to embedding size. like in ELECTRA or DeBERTaV3 style pertaining.
      
      This should solve an error that occurs when loading models like "almanach/camemberta-base-generator".
      
      * fix the same issue for reshaping after projection
      
      * fix layernorm size
      
      * add self.embedding_size to scope
      
      * fix embed_proj scope name
      
      * apply the same changes to TF Deberta
      
      * add the changes to deberta
      
      * added self.embedding_size instead of config.embedding_size
      
      * added the same change to debertav2
      
      * added coppied from deberta to deberta2 model
      
      * config.embedding_size fix
      
      * black
      
      * fix deberta config name
      860d11ff
    • Yih-Dar's avatar
      `Pix2StructImageProcessor` requires `torch>=1.11.0` (#24270) · a04ebc8b
      Yih-Dar authored
      
      
      * fix
      
      * fix
      
      * fix
      
      ---------
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      a04ebc8b
    • Sylvain Gugger's avatar
      Update check of core deps (#24277) · 8978b696
      Sylvain Gugger authored
      8978b696
    • Patrick von Platen's avatar
      Adapt Wav2Vec2 conversion for MMS lang identification (#24234) · c4fec38b
      Patrick von Platen authored
      * Add conversion for mms lid
      
      * make style
      c4fec38b
    • Joao Gante's avatar
      4626df50
    • Yih-Dar's avatar
      eac8dede
    • ByronHsu's avatar
      QA doc: import torch before it is used (#24228) · 91b62f5a
      ByronHsu authored
      
      
      * import torch before it is used
      
      * style
      Signed-off-by: default avatarbyhsu <byhsu@linkedin.com>
      
      ---------
      Signed-off-by: default avatarbyhsu <byhsu@linkedin.com>
      Co-authored-by: default avatarbyhsu <byhsu@linkedin.com>
      91b62f5a
    • TAE YOUNGDON's avatar
      Fix URL in comment for contrastive loss function (#24271) · 6ab045d6
      TAE YOUNGDON authored
      * Update language_modeling.py
      
      in "class TextDatasetForNextSentencePrediction(Dataset)", double considering "self.tokenizer.num_special_tokens_to_add(pair=True)" 
      
      so, i remove self.block_size, and add parameter for "def create_examples_from_document". like "class LineByLineWithSOPTextDataset" do
      
      * Update language_modeling.py
      
      * Fix URL in comment for contrastive loss function
      6ab045d6
  4. 13 Jun, 2023 11 commits