1. 11 Oct, 2022 1 commit
  2. 09 Oct, 2022 1 commit
  3. 22 Sep, 2022 1 commit
  4. 15 Sep, 2022 1 commit
  5. 27 Jun, 2022 1 commit
    • Zhaoheng Ni's avatar
      Add VoxCeleb1 dataset (#2349) · 21b2d139
      Zhaoheng Ni authored
      Summary:
      This PR adds two dataset classes of VoxCeleb1 corpus.
      - `VoxCeleb1Identification`
      Each data sample contains the waveform, sample rate, speaker id, and the file id.
      - `VoxCeleb1Verification`
      Each data sample contains a pair of waveforms, sample rate, the label indicating if they are from the same speaker, and the file ids.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2349
      
      Reviewed By: carolineechen
      
      Differential Revision: D35927921
      
      Pulled By: nateanl
      
      fbshipit-source-id: 3e07ddd329178777698841565053eb59befe6449
      21b2d139
  6. 21 Jun, 2022 1 commit
    • Sean Kim's avatar
      Create musdb handler and tests (#2484) · b92a8a09
      Sean Kim authored
      Summary:
      Create dataset handler and tests for new dataset. Manually tested and unit tested to test validity. Pre-commit ran for style checks.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2484
      
      Reviewed By: carolineechen, nateanl
      
      Differential Revision: D37250556
      
      Pulled By: skim0514
      
      fbshipit-source-id: d2c8d73d22fd9d7282026265676f3eab1e178d51
      b92a8a09
  7. 20 Jun, 2022 1 commit
  8. 23 May, 2022 1 commit
    • Zhaoheng Ni's avatar
      Add LibriLightLimited dataset (#2302) · af9cab3b
      Zhaoheng Ni authored
      Summary:
      The `LibriLightLimited` dataset is created for fine-tuning SSL models, such as Wav2Vec2 and HuBERT. It is a supervised subset of [Libri-Light](https://github.com/facebookresearch/libri-light) dataset. To distinguish the unsupervised subset and the supervised one, it's clearer to put it in a separate dataset class for fine-tuning purpose.
      It contains "10 min", "1 hour", "10 hour" splits.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2302
      
      Reviewed By: mthrok
      
      Differential Revision: D36388188
      
      Pulled By: nateanl
      
      fbshipit-source-id: ba49f1c9996be17db5db41127d8ca96224c94249
      af9cab3b
  9. 10 May, 2022 1 commit
  10. 26 Apr, 2022 1 commit
  11. 18 Apr, 2022 1 commit
  12. 23 Nov, 2021 1 commit
  13. 04 Nov, 2021 1 commit
  14. 06 Oct, 2021 1 commit
  15. 05 Oct, 2021 1 commit
  16. 02 Aug, 2021 1 commit
  17. 04 Dec, 2020 1 commit
  18. 02 Oct, 2020 1 commit
  19. 15 Sep, 2020 1 commit
  20. 20 Aug, 2020 1 commit
  21. 19 Aug, 2020 1 commit
  22. 20 Jul, 2020 1 commit
    • jimchen90's avatar
      Add LibriTTS dataset (#790) · 4b8aad7a
      jimchen90 authored
      
      
      * Add libritts
      
      Add LibriTTS dataset draft
      
      * Add libritts
      
      Use two separate ids for utterance_id.
      
      * Update output form
      
      Use full_id as utterance_id.
      
      * Update format
      
      Add space and test black format
      
      * Update test method
      
      * Add audio and text test
      
      Generate audio and test files on-the-fly in test 
      
      * Update format
      
      * Fix test error and remove assets libritts
      
      The test error is fixed by sorting the file in 4th element instead of 2nd element in samples. Since the files are generated on-the-fly, so the the libritts files in assets are removed.
      
      * Add seed in `get_whitenoise` function
      
      * Change utterance to text
      
      Change `_utterance` to `_text`.
      Co-authored-by: default avatarJi Chen <jimchen90@devfair0160.h2.fair>
      4b8aad7a
  23. 10 Jun, 2020 1 commit
  24. 02 Jun, 2020 1 commit
    • Emmanouil Theofanis Chourdakis's avatar
      Added the popular GTZAN dataset: (#668) · b0367251
      Emmanouil Theofanis Chourdakis authored
      
      
      * Added the popular GTZAN dataset:
      
      * Added the GTZAN class in torchaudio.datasets using the same format as the rest of the datasets.
      * Added the appropriate test function in test_datasets.py.
      * Added the GTZAN class in the datasets.rst documentation file.
      
      * Addressed review issues in PR #668
      
      * Added dummy noise .wav in `test/assets/`
      * Removed transforms of input and output from the dataset
        `__init__` function, as well as the corresponding methods.
      * Replaced rendundant `filtered` and `subset` methods from
        class initialization and also changed the corresponding
        assertion message.
      
      * Fixed E303: too many blank lines error
      
      * Added GTZAN to __init__.__all__
      
      * Fixed incorrectly not importing GTZAN
      
      * removed duplicate warning
      
      * lint
      Co-authored-by: default avatarVincent QB <vincentqb@users.noreply.github.com>
      b0367251
  25. 27 Apr, 2020 1 commit
  26. 18 Dec, 2017 1 commit