1. 01 Feb, 2024 1 commit
  2. 30 Jan, 2024 1 commit
    • Matt's avatar
      Pin Torch to <2.2.0 (#28785) · 74c9cfea
      Matt authored
      
      
      * Pin torch to <2.2.0
      
      * Pin torchvision and torchaudio as well
      
      * Playing around with versions to see if this helps
      
      * twiddle something to restart the CI
      
      * twiddle it back
      
      * Try changing the natten version
      
      * make fixup
      
      * Revert "Try changing the natten version"
      
      This reverts commit de0d6592c35dc39ae8b5a616c27285db28262d06.
      
      * make fixup
      
      * fix fix fix
      
      * fix fix fix
      
      ---------
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      74c9cfea
  3. 29 Jan, 2024 1 commit
  4. 26 Jan, 2024 1 commit
  5. 22 Jan, 2024 2 commits
  6. 19 Jan, 2024 1 commit
  7. 18 Jan, 2024 2 commits
    • Yoach Lacombe's avatar
      Making CTC training example more general (#28582) · 772307be
      Yoach Lacombe authored
      
      
      * add w2v2bert compatibility
      
      * Update examples/pytorch/speech-recognition/run_speech_recognition_ctc.py
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      ---------
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      772307be
    • Yoach Lacombe's avatar
      Add new meta w2v2-conformer BERT-like model (#28165) · d2cdefb9
      Yoach Lacombe authored
      
      
      * first commit
      
      * correct default value non causal
      
      * update config and modeling code
      
      * update converting checkpoint
      
      * clean modeling and fix tests
      
      * make style
      
      * add new config parameters to docstring
      
      * fix copied from statements
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
      
      * make position_embeddings_type docstrings clearer
      
      * clean converting script
      
      * remove function not used
      
      * clean modeling file
      
      * apply suggestion for test file + add convert script to not_doctested
      
      * modify tests according to review - cleaner logic and more tests
      
      * Apply nit suggestions from code review
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      * add checker of valid position embeddings type
      
      * instantiate new layer norm layer with the right eps
      
      * fix freeze_feature_encoder since it can be None in some cases
      
      * add test same output in convert script
      
      * restore wav2vec2conformer and add new model
      
      * create processor and FE + clean
      
      * add new model code
      
      * fix convert script and set default config parameters
      
      * correct model id paths
      
      * make style
      
      * make fix-copies and cleaning files
      
      * fix copied from statements
      
      * complete .md and fixe copies
      
      * clean convert script argument defaults
      
      * fix config parameters docstrings
      
      * fix config docstring
      
      * add copied from and enrich FE tests
      
      * fix copied from and repo-consistency
      
      * add autotokenizer
      
      * make test input length shorter and change docstring code
      
      * fix docstrings and copied from
      
      * add add_adapter to ASR training example
      
      * make testing of adapters more robust
      
      * adapt to multi adapter layers
      
      * refactor input_values->input_features and remove w2v2-bert feature extractor
      
      * remove pretraining model
      
      * remove depreciated features and useless lines
      
      * add copied from and ignore statements to modeling tests
      
      * remove pretraining model #2
      
      * change import in convert script
      
      * change default in convert script
      
      * update readme and remove useless line
      
      * Update tests/models/wav2vec2_bert/test_processor_wav2vec2_bert.py
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      * refactor BERT to Bert for consistency
      
      * remove useless ignore copy statement
      
      * add persistent to buffer in rotary
      
      * add eps in LayerNorm init and remove copied from
      
      * add adapter activation parameters and add copied from statements
      
      * Fix copied statements and add unitest.skip reasons
      
      * add copied statement in test_processor
      
      * refactor processor
      
      * make style
      
      * replace numpy random by torch rand
      
      * remove expected output CTC
      
      * improve converting script with processor class
      
      * Apply suggestions from code review
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      * remove gumbel class
      
      * remove tests related to previously deleted class
      
      * Update src/transformers/models/wav2vec2_bert/configuration_wav2vec2_bert.py
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      * correct typos
      
      * remove uused parameters
      
      * update processor to takes both text and audio
      
      * update checkpoints
      
      * update expected output and add ctc expected output
      
      * add label_attention_mask
      
      * replace pt with np in processor tests
      
      * fix typo
      
      * revert to behaviour with labels_attention_mask
      
      ---------
      Co-authored-by: default avatarSanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      d2cdefb9
  8. 16 Jan, 2024 1 commit
  9. 15 Jan, 2024 2 commits
  10. 12 Jan, 2024 1 commit
  11. 11 Jan, 2024 1 commit
    • Alex Hedges's avatar
      Set `cache_dir` for `evaluate.load()` in example scripts (#28422) · 95091e15
      Alex Hedges authored
      While using `run_clm.py`,[^1] I noticed that some files were being added
      to my global cache, not the local cache. I set the `cache_dir` parameter
      for the one call to `evaluate.load()`, which partially solved the
      problem. I figured that while I was fixing the one script upstream, I
      might as well fix the problem in all other example scripts that I could.
      
      There are still some files being added to my global cache, but this
      appears to be a bug in `evaluate` itself. This commit at least moves
      some of the files into the local cache, which is better than before.
      
      To create this PR, I made the following regex-based transformation:
      `evaluate\.load\((.*?)\)` -> `evaluate\.load\($1,
      cache_dir=model_args.cache_dir\)`. After using that, I manually fixed
      all modified files with `ruff` serving as useful guidance. During the
      process, I removed one existing usage of the `cache_dir` parameter in a
      script that did not have a corresponding `--cache-dir` argument
      declared.
      
      [^1]: I specifically used `pytorch/language-modeling/run_clm.py` from
      v4.34.1 of the library. For the original code, see the following URL:
      https://github.com/huggingface/transformers/tree/acc394c4f5e1283c19783581790b3dc3105a3697/examples/pytorch/language-modeling/run_clm.py.
      95091e15
  12. 13 Dec, 2023 1 commit
  13. 11 Dec, 2023 1 commit
    • Adam Louly's avatar
      fix no sequence length models error (#27522) · 4850aaba
      Adam Louly authored
      * fix no sequence length models error
      
      * block size check
      
      ---------
      
      Co-authored-by: Adam Louly <adamlouly@microsoft.com@orttrainingdev9.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>
      4850aaba
  14. 30 Nov, 2023 1 commit
  15. 27 Nov, 2023 1 commit
  16. 20 Nov, 2023 1 commit
  17. 17 Nov, 2023 1 commit
  18. 15 Nov, 2023 3 commits
  19. 09 Nov, 2023 3 commits
  20. 08 Nov, 2023 3 commits
  21. 02 Nov, 2023 1 commit
  22. 31 Oct, 2023 1 commit
  23. 30 Oct, 2023 1 commit
  24. 29 Oct, 2023 1 commit
  25. 27 Oct, 2023 1 commit
  26. 24 Oct, 2023 1 commit
  27. 23 Oct, 2023 1 commit
  28. 12 Oct, 2023 1 commit
  29. 11 Oct, 2023 1 commit
  30. 10 Oct, 2023 1 commit
  31. 04 Oct, 2023 1 commit
    • Phuc Van Phan's avatar
      refactor: change default block_size (#26229) · 6015f91a
      Phuc Van Phan authored
      * refactor: change default block_size
      
      * fix: return tf to origin
      
      * fix: change files to origin
      
      * rebase
      
      * rebase
      
      * rebase
      
      * rebase
      
      * rebase
      
      * rebase
      
      * rebase
      
      * rebase
      
      * refactor: add min block_size to files
      
      * reformat: add min block_size for run_clm tf
      6015f91a