"vscode:/vscode.git/clone" did not exist on "dfaf2b20fb7c7e2f24553342af27ef67989426a7"
  1. 06 Jul, 2023 1 commit
  2. 05 Jul, 2023 4 commits
  3. 03 Jul, 2023 1 commit
  4. 28 Jun, 2023 2 commits
  5. 26 Jun, 2023 1 commit
  6. 21 Jun, 2023 2 commits
  7. 16 Jun, 2023 1 commit
    • Pingchuan Ma's avatar
      Add LRS3 data preparation (#3421) · 77cdd160
      Pingchuan Ma authored
      Summary:
      This PR adds a data preparation recipe that uses the ultra face detector to extract full-face video. The resulting video output is then used as input for training and evaluating RNNT-based models for automatic speech recognition (ASR), visual speech recognition (VSR), and audio-visual ASR (AV-ASR) on the LRS3 dataset.
      
      This PR also updates the word error rate (WER) for AV-ASR LRS3 models and improves the code readability.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/3421
      
      Reviewed By: mpc001
      
      Differential Revision: D46799748
      
      Pulled By: mthrok
      
      fbshipit-source-id: 97af3feac0592b240617faaffa4c0ac8cef614a9
      77cdd160
  8. 15 Jun, 2023 1 commit
    • moto's avatar
      Update forced alignment tutorial (#3440) · 18601691
      moto authored
      Summary:
      * Fix backtrack visualization (the cooridnate was off-by-one.)
      * Add note about the simplification and the new align API
      * Explicitly handle SOS and EOS
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/3440
      
      Reviewed By: xiaohui-zhang
      
      Differential Revision: D46761282
      
      Pulled By: mthrok
      
      fbshipit-source-id: b0b6c9754674e8e23543e9f002e29b55102c92f8
      18601691
  9. 14 Jun, 2023 1 commit
  10. 13 Jun, 2023 2 commits
  11. 12 Jun, 2023 1 commit
  12. 09 Jun, 2023 3 commits
  13. 08 Jun, 2023 8 commits
  14. 07 Jun, 2023 2 commits
  15. 06 Jun, 2023 4 commits
  16. 05 Jun, 2023 1 commit
  17. 04 Jun, 2023 1 commit
  18. 03 Jun, 2023 1 commit
  19. 02 Jun, 2023 3 commits
    • moto's avatar
      [BC-Breaking] Remove compute_kaldi_pitch (#3368) · 5bbbb1d5
      moto authored
      Summary:
      This commit removes compute_kaldi_pitch function and the underlying Kaldi integration from torchaudio.
      
      Kaldi pitch function was added in a short period of time by integrating the original Kaldi implementation, instead of reimplementing it in PyTorch.
      
      The Kaldi integration employed a hack which replaces the base vector/matrix implementation of Kaldi with PyTorch Tensor so that there is only one blas library within torchaudio.
      
      Recently, we are making torchaudio more lean, and we don't see a wide adoption of kaldi_pitch feature, so we decided to remove them.
      
      See some of the discussion https://github.com/pytorch/audio/issues/1269
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/3368
      
      Differential Revision: D46406176
      
      Pulled By: mthrok
      
      fbshipit-source-id: ee5e24d825188f379979ddccd680c7323b119b1e
      5bbbb1d5
    • moto's avatar
      Update data augmentation tutorial (#3375) · 2ba36b47
      moto authored
      Summary:
      Replace sox_effects with `torchaudio.io.AudioEffector`
      
      1. To show case the new and better feature
      2. To prepare for the upcoming removal of file-like support object
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/3375
      
      Reviewed By: nateanl
      
      Differential Revision: D46379016
      
      Pulled By: mthrok
      
      fbshipit-source-id: 70f24b62494204949f327f6ac6c49f315c9ee315
      2ba36b47
    • Moto Hira's avatar
      Revert D46059199: [audio][PR] Use dlopen for FFmpeg · ab7a39f7
      Moto Hira authored
      Differential Revision:
      D46059199
      
      Original commit changeset: 4493a5fd8a4c
      
      Original Phabricator Diff: D46059199
      
      fbshipit-source-id: 71cde3f8cd870d1ad9114e3e87cdd1ba564441c0
      ab7a39f7