1. 11 Jul, 2023 4 commits
  2. 10 Jul, 2023 1 commit
  3. 07 Jul, 2023 3 commits
  4. 06 Jul, 2023 2 commits
  5. 05 Jul, 2023 4 commits
  6. 03 Jul, 2023 1 commit
  7. 28 Jun, 2023 2 commits
  8. 26 Jun, 2023 1 commit
  9. 21 Jun, 2023 2 commits
  10. 16 Jun, 2023 1 commit
    • Pingchuan Ma's avatar
      Add LRS3 data preparation (#3421) · 77cdd160
      Pingchuan Ma authored
      Summary:
      This PR adds a data preparation recipe that uses the ultra face detector to extract full-face video. The resulting video output is then used as input for training and evaluating RNNT-based models for automatic speech recognition (ASR), visual speech recognition (VSR), and audio-visual ASR (AV-ASR) on the LRS3 dataset.
      
      This PR also updates the word error rate (WER) for AV-ASR LRS3 models and improves the code readability.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/3421
      
      Reviewed By: mpc001
      
      Differential Revision: D46799748
      
      Pulled By: mthrok
      
      fbshipit-source-id: 97af3feac0592b240617faaffa4c0ac8cef614a9
      77cdd160
  11. 15 Jun, 2023 1 commit
    • moto's avatar
      Update forced alignment tutorial (#3440) · 18601691
      moto authored
      Summary:
      * Fix backtrack visualization (the cooridnate was off-by-one.)
      * Add note about the simplification and the new align API
      * Explicitly handle SOS and EOS
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/3440
      
      Reviewed By: xiaohui-zhang
      
      Differential Revision: D46761282
      
      Pulled By: mthrok
      
      fbshipit-source-id: b0b6c9754674e8e23543e9f002e29b55102c92f8
      18601691
  12. 14 Jun, 2023 1 commit
  13. 13 Jun, 2023 2 commits
  14. 12 Jun, 2023 1 commit
  15. 09 Jun, 2023 3 commits
  16. 08 Jun, 2023 8 commits
  17. 07 Jun, 2023 2 commits
  18. 06 Jun, 2023 1 commit