1. 20 Jun, 2022 1 commit
  2. 23 May, 2022 1 commit
    • Zhaoheng Ni's avatar
      Add LibriLightLimited dataset (#2302) · af9cab3b
      Zhaoheng Ni authored
      Summary:
      The `LibriLightLimited` dataset is created for fine-tuning SSL models, such as Wav2Vec2 and HuBERT. It is a supervised subset of [Libri-Light](https://github.com/facebookresearch/libri-light) dataset. To distinguish the unsupervised subset and the supervised one, it's clearer to put it in a separate dataset class for fine-tuning purpose.
      It contains "10 min", "1 hour", "10 hour" splits.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2302
      
      Reviewed By: mthrok
      
      Differential Revision: D36388188
      
      Pulled By: nateanl
      
      fbshipit-source-id: ba49f1c9996be17db5db41127d8ca96224c94249
      af9cab3b
  3. 10 May, 2022 1 commit
  4. 26 Apr, 2022 1 commit
  5. 18 Apr, 2022 1 commit
  6. 23 Nov, 2021 1 commit
  7. 04 Nov, 2021 1 commit
  8. 06 Oct, 2021 1 commit
  9. 05 Oct, 2021 1 commit
  10. 02 Aug, 2021 1 commit
  11. 04 Dec, 2020 1 commit
  12. 02 Oct, 2020 1 commit
  13. 15 Sep, 2020 1 commit
  14. 20 Aug, 2020 1 commit
  15. 19 Aug, 2020 1 commit
  16. 20 Jul, 2020 1 commit
    • jimchen90's avatar
      Add LibriTTS dataset (#790) · 4b8aad7a
      jimchen90 authored
      
      
      * Add libritts
      
      Add LibriTTS dataset draft
      
      * Add libritts
      
      Use two separate ids for utterance_id.
      
      * Update output form
      
      Use full_id as utterance_id.
      
      * Update format
      
      Add space and test black format
      
      * Update test method
      
      * Add audio and text test
      
      Generate audio and test files on-the-fly in test 
      
      * Update format
      
      * Fix test error and remove assets libritts
      
      The test error is fixed by sorting the file in 4th element instead of 2nd element in samples. Since the files are generated on-the-fly, so the the libritts files in assets are removed.
      
      * Add seed in `get_whitenoise` function
      
      * Change utterance to text
      
      Change `_utterance` to `_text`.
      Co-authored-by: default avatarJi Chen <jimchen90@devfair0160.h2.fair>
      4b8aad7a
  17. 10 Jun, 2020 1 commit
  18. 02 Jun, 2020 1 commit
    • Emmanouil Theofanis Chourdakis's avatar
      Added the popular GTZAN dataset: (#668) · b0367251
      Emmanouil Theofanis Chourdakis authored
      
      
      * Added the popular GTZAN dataset:
      
      * Added the GTZAN class in torchaudio.datasets using the same format as the rest of the datasets.
      * Added the appropriate test function in test_datasets.py.
      * Added the GTZAN class in the datasets.rst documentation file.
      
      * Addressed review issues in PR #668
      
      * Added dummy noise .wav in `test/assets/`
      * Removed transforms of input and output from the dataset
        `__init__` function, as well as the corresponding methods.
      * Replaced rendundant `filtered` and `subset` methods from
        class initialization and also changed the corresponding
        assertion message.
      
      * Fixed E303: too many blank lines error
      
      * Added GTZAN to __init__.__all__
      
      * Fixed incorrectly not importing GTZAN
      
      * removed duplicate warning
      
      * lint
      Co-authored-by: default avatarVincent QB <vincentqb@users.noreply.github.com>
      b0367251
  19. 27 Apr, 2020 1 commit
  20. 18 Dec, 2017 1 commit