1. 06 Oct, 2022 1 commit
  2. 05 Oct, 2022 1 commit
  3. 03 Oct, 2022 1 commit
  4. 23 Sep, 2022 2 commits
  5. 22 Sep, 2022 2 commits
  6. 21 Sep, 2022 2 commits
  7. 14 Sep, 2022 1 commit
  8. 13 Sep, 2022 1 commit
  9. 09 Sep, 2022 1 commit
  10. 06 Sep, 2022 1 commit
  11. 26 Aug, 2022 1 commit
  12. 18 Aug, 2022 3 commits
    • moto's avatar
      Update ASR inference tutorial (#2631) · 189edb1b
      moto authored
      Summary:
      * Use download_asset
      * Remove notes around nightly
      * Print versions first
      * Remove duplicated import
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2631
      
      Reviewed By: carolineechen
      
      Differential Revision: D38830395
      
      Pulled By: mthrok
      
      fbshipit-source-id: c9259df33562defe249734d1ed074dac0fddc2f6
      189edb1b
    • moto's avatar
      Update notes around nightly build and third parties (#2632) · 55ce80b1
      moto authored
      Summary:
      Google Colab now has torchaudio 0.12 pre-installed.
      This commit removes the note about nightly build.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2632
      
      Reviewed By: carolineechen
      
      Differential Revision: D38827632
      
      Pulled By: mthrok
      
      fbshipit-source-id: ac769780868b741c3012357d589ec0019d9af6eb
      55ce80b1
    • moto's avatar
      Tweak tutorials (#2630) · cab2bb44
      moto authored
      Summary:
      Resolves the following warnings
      
      ```
      /torchaudio/docs/source/tutorials/asr_inference_with_ctc_decoder_tutorial.rst:195: WARNING: Unexpected indentation.
      /torchaudio/docs/source/tutorials/asr_inference_with_ctc_decoder_tutorial.rst:446: WARNING: Unexpected indentation.
      /torchaudio/docs/source/tutorials/audio_io_tutorial.rst:559: WARNING: Content block expected for the "note" directive; none found.
      /torchaudio/docs/source/tutorials/mvdr_tutorial.rst:338: WARNING: Bullet list ends without a blank line; unexpected unindent.
      ```
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2630
      
      Reviewed By: nateanl
      
      Differential Revision: D38816632
      
      Pulled By: mthrok
      
      fbshipit-source-id: 135ded4e064d136be67ce24439e96f5e9c9ce635
      cab2bb44
  13. 10 Aug, 2022 1 commit
  14. 05 Aug, 2022 1 commit
    • Caroline Chen's avatar
      Add note for lexicon free decoder output (#2603) · 33485b8c
      Caroline Chen authored
      Summary:
      ``words`` field of CTCHypothesis is empty if no lexicon is provided, which produces confusing output (see issue https://github.com/pytorch/audio/issues/2584) when following our tutorial example with lexicon free usage. This PR adds a note in both docs and tutorial.
      
      Followup: determine if we want to modify the behavior of ``words`` in the lexicon free case. One option is to merge and then split the generated tokens by the input silent token to populate the words field, but this is tricky since the meaning of a "word" in the lexicon free case can be vague and not all languages have whitespaces between words, etc
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2603
      
      Reviewed By: mthrok
      
      Differential Revision: D38459709
      
      Pulled By: carolineechen
      
      fbshipit-source-id: d64ff186df4633f00e94c64afeaa6a50cebf2934
      33485b8c
  15. 01 Aug, 2022 1 commit
  16. 29 Jul, 2022 2 commits
    • moto's avatar
      Update forced alignment tutorial (#2544) · c26b38b2
      moto authored
      Summary:
      1. Fix initialization.
      Previously, the SOS token score was initialized to 0 across the time axis.
      This was biasing the alignment to delay the start.
      The proper way to delay the SOS is via blank token.
      The new initilization takes the cumulated sum of blank scores.
      2. Fill the end of trellis with Inf
      Similar to the start, at the end where there remaining time frame is less
      than the number of tokens, it is no longer possible to align the text, thus
      we fill with Inf for better visualization.
      3. Clean up asset management code.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2544
      
      Reviewed By: nateanl
      
      Differential Revision: D38276478
      
      Pulled By: mthrok
      
      fbshipit-source-id: 6d934cc850a0790b8c463a4f69f8f1143633d299
      c26b38b2
    • Zhaoheng Ni's avatar
      Improve speech enhancement tutorial (#2527) · d6267031
      Zhaoheng Ni authored
      Summary:
      - The "speech + noise" mixture still has a high SNR, which can't show the effectiveness of MVDR beamforming. To make the task more challenging, amplify the noise waveform to reduce the SNR of mixture speech.
      - Show the Si-SNR score of mixture speech when visualizing the mixture spectrogram.
      - FIx the figure in `rtf_power` subsection.
          - The description of enhanced spectrogram by `rtf_power` is wrong. Correct it to `rtf_power`.
      - Print PESQ, STOI, and SDR metric scores.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2527
      
      Reviewed By: mthrok
      
      Differential Revision: D38190218
      
      Pulled By: nateanl
      
      fbshipit-source-id: 39562850a67f58a16e0a2866ed95f78c3f4dc7de
      d6267031
  17. 28 Jul, 2022 2 commits
  18. 11 Jul, 2022 1 commit
  19. 23 Jun, 2022 1 commit
  20. 17 Jun, 2022 1 commit
  21. 08 Jun, 2022 2 commits
  22. 07 Jun, 2022 3 commits
  23. 04 Jun, 2022 1 commit
  24. 03 Jun, 2022 3 commits
  25. 02 Jun, 2022 1 commit
    • Zhaoheng Ni's avatar
      Update MVDR beamforming tutorial (#2398) · d01f5891
      Zhaoheng Ni authored
      Summary:
      - Use `download_asset` to download audios.
      - Replace `MVDR` module with new-added `SoudenMVDR` and `RTFMVDR` modules.
      - Benchmark performances of `F.rtf_evd` and `F.rtf_power` for RTF computation.
      - Visualize the spectrograms and masks.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2398
      
      Reviewed By: carolineechen
      
      Differential Revision: D36549402
      
      Pulled By: nateanl
      
      fbshipit-source-id: dfd6754e6c33246e6991ccc51c4603b12502a1b5
      d01f5891
  26. 01 Jun, 2022 1 commit
    • Caroline Chen's avatar
      Move CTC beam search decoder to beta (#2410) · 93024ace
      Caroline Chen authored
      Summary:
      Move CTC beam search decoder out of prototype to new `torchaudio.models.decoder` module.
      
      hwangjeff mthrok any thoughts on the new module + naming, and if we should move rnnt beam search here as well??
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2410
      
      Reviewed By: mthrok
      
      Differential Revision: D36784521
      
      Pulled By: carolineechen
      
      fbshipit-source-id: a2ec52f86bba66e03327a9af0c5df8bbefcd67ed
      93024ace
  27. 26 May, 2022 1 commit
  28. 23 May, 2022 1 commit