1. 25 Feb, 2022 2 commits
  2. 16 Feb, 2022 1 commit
    • Zhaoheng Ni's avatar
      Add EMFORMER_RNNT_BASE_MUSTC bundle to torchaudio.prototype (#2241) · 99b5ef5c
      Zhaoheng Ni authored
      Summary:
      This PR provides a RNNTBundle that is pre-trained on the MuST-C release v2.0 dataset.
      The model preserves the casing and punctuations of the transcripts when training the SentencePiece model.
      
      Here is the model performance on the dev and test sets of MuST-C 2.0:
      |                   |          WER |
      |:-----------------:|-------------:|
      | dev               |       0.190  |
      | tst-COMMON        |       0.213  |
      | tst-HE            |       0.186  |
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2241
      
      Reviewed By: mthrok
      
      Differential Revision: D34267792
      
      Pulled By: nateanl
      
      fbshipit-source-id: 67bca9f277e66d41a4530d01615f249b3cec7167
      99b5ef5c
  3. 04 Feb, 2022 1 commit
  4. 03 Feb, 2022 1 commit
  5. 02 Feb, 2022 1 commit
  6. 01 Feb, 2022 3 commits
  7. 27 Jan, 2022 1 commit
    • Caroline Chen's avatar
      Add no lm support for CTC decoder (#2174) · 4c3fa875
      Caroline Chen authored
      Summary:
      Add support for CTC lexicon decoder without LM support by adding a non language model `ZeroLM` that returns score 0 for everything. Generalize the decoder class/API a bit to support this, adding it as an option for the kenlm decoder at the moment (will likely be separated out from kenlm when adding support for other kinds of LMs in the future)
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2174
      
      Reviewed By: hwangjeff, nateanl
      
      Differential Revision: D33798674
      
      Pulled By: carolineechen
      
      fbshipit-source-id: ef8265f1d046011b143597b3b7c691566b08dcde
      4c3fa875
  8. 14 Jan, 2022 1 commit
    • moto's avatar
      Tweak documentation (#2152) · 7f859111
      moto authored
      Summary:
      - Change the version of nightly build to `Nightly Build (VERSION)`.
      - Use `BUILD_VERSION` env var for release.
      - Automatically change copyright year.
      - Update the link to nightly in README so that the main branch directs to the corresponding document.
      
      Because of the way CI job is setup, the resulting documentation says 0.8.0. This is fixed by https://github.com/pytorch/audio/issues/2151.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2152
      
      Reviewed By: carolineechen, nateanl
      
      Differential Revision: D33585053
      
      Pulled By: mthrok
      
      fbshipit-source-id: 3c2bf9fc3214c89f989f5ac65b74bc1e276a7161
      7f859111
  9. 06 Jan, 2022 1 commit
  10. 04 Jan, 2022 1 commit
  11. 31 Dec, 2021 1 commit
  12. 30 Dec, 2021 1 commit
  13. 29 Dec, 2021 3 commits
  14. 28 Dec, 2021 4 commits
  15. 23 Dec, 2021 3 commits
  16. 24 Nov, 2021 1 commit
    • hwangjeff's avatar
      Add RNN-T beam search decoder (#2028) · 60a85b50
      hwangjeff authored
      Summary:
      Adds beam search decoder for RNN-T implementation ``torchaudio.prototype.RNNT`` that is TorchScript-able and supports both streaming and non-streaming inference.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2028
      
      Reviewed By: mthrok
      
      Differential Revision: D32627919
      
      Pulled By: hwangjeff
      
      fbshipit-source-id: aab99e346d6514a3207a9fb69d4b42978b4cdbbd
      60a85b50
  17. 23 Nov, 2021 1 commit
  18. 19 Nov, 2021 1 commit
    • moto's avatar
      Disable SPHINXOPT=-W for local env (#2013) · 3ff46bfa
      moto authored
      Summary:
      With the introduction of tutorials, the turn around time for doc build
      has become longer. By default, the tutorial is not built but SPHINXOPT=-W
      treats it as error.
      
      This commit disable the option for the local build while keeping it
      for the CI.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2013
      
      Reviewed By: carolineechen
      
      Differential Revision: D32538952
      
      Pulled By: mthrok
      
      fbshipit-source-id: eae4ffd87100dff466f91abfe26a82aa702d605a
      3ff46bfa
  19. 18 Nov, 2021 1 commit
    • hwangjeff's avatar
      Add Emformer RNN-T model (#2003) · 78ce7010
      hwangjeff authored
      Summary:
      Adds streaming-capable recurrent neural network transducer (RNN-T) model that uses Emformer for its transcription network. Includes two factory functions — one that allows for building a custom model, and one that builds a preconfigured base model.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2003
      
      Reviewed By: nateanl
      
      Differential Revision: D32440879
      
      Pulled By: hwangjeff
      
      fbshipit-source-id: 601cb1de368427f25e3b7d120e185960595d2360
      78ce7010
  20. 10 Nov, 2021 1 commit
  21. 05 Nov, 2021 4 commits
  22. 04 Nov, 2021 6 commits