Commits · 66a67d2efbab894196a733426b05f2b08da6fd79 · OpenDAS / Torchaudio

20 Jun, 2022 1 commit

Add fluent speech commands (#2480) · 66a67d2e

Caroline Chen authored Jun 20, 2022

Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/2480

Reviewed By: nateanl

Differential Revision: D37249571

Pulled By: carolineechen

fbshipit-source-id: caefeec4253c91f2579655a0c1735edaeed51be9

66a67d2e

23 May, 2022 1 commit

Add LibriLightLimited dataset (#2302) · af9cab3b

Zhaoheng Ni authored May 23, 2022

Summary:
The `LibriLightLimited` dataset is created for fine-tuning SSL models, such as Wav2Vec2 and HuBERT. It is a supervised subset of [Libri-Light](https://github.com/facebookresearch/libri-light) dataset. To distinguish the unsupervised subset and the supervised one, it's clearer to put it in a separate dataset class for fine-tuning purpose.
It contains "10 min", "1 hour", "10 hour" splits.

Pull Request resolved: https://github.com/pytorch/audio/pull/2302

Reviewed By: mthrok

Differential Revision: D36388188

Pulled By: nateanl

fbshipit-source-id: ba49f1c9996be17db5db41127d8ca96224c94249

af9cab3b

10 May, 2022 1 commit

Add citations for datasets (#2371) · 638120ca

Caroline Chen authored May 09, 2022

Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/2371

Reviewed By: xiaohui-zhang

Differential Revision: D36246167

Pulled By: carolineechen

fbshipit-source-id: 23042a1c393711864a18c9815d248c18d1d258b4

638120ca

26 Apr, 2022 1 commit

Fix LibriMix documentation (#2351) · 892d6d34

Zhaoheng Ni authored Apr 26, 2022

Summary:
The `LibriMix` dataset is missing on the [documentation webpage](https://pytorch.org/audio/stable/datasets.html).

Pull Request resolved: https://github.com/pytorch/audio/pull/2351

Reviewed By: carolineechen

Differential Revision: D35926695

Pulled By: nateanl

fbshipit-source-id: 168aed3bb15510d1b1ec57d77727932e481aca48

892d6d34

18 Apr, 2022 1 commit

Add QUESST14 dataset (#2290) · aebcf6af

Caroline Chen authored Apr 18, 2022

Summary:
implementation adapted from [s3prl](https://github.com/s3prl/s3prl/blob/master/s3prl/downstream/quesst14_dtw/dataset.py)

modifying the s3prl downstream expert to [this](https://github.com/carolineechen/s3prl/commit/adc91a53d581a604f495f3795a865d84aa17f1a5) using this dataset implementation produces the same results as using the original s3prl pipeline

Pull Request resolved: https://github.com/pytorch/audio/pull/2290

Reviewed By: nateanl

Differential Revision: D35692551

Pulled By: carolineechen

fbshipit-source-id: 035ad161d4cbbd2072411cfdf89984b73a89868c

aebcf6af

23 Nov, 2021 1 commit

Update datasets document (#2029) · 9c9aef88

moto authored Nov 23, 2021

Summary:
- Remove unnecessary content list
- Remove legacy description

Pull Request resolved: https://github.com/pytorch/audio/pull/2029

Reviewed By: carolineechen

Differential Revision: D32629917

Pulled By: mthrok

fbshipit-source-id: bc9a9366c681bcf8b74907c2a6459c73fb6a7424

9c9aef88

04 Nov, 2021 1 commit
- Add Sphinx-gallery to doc (#1967) · a3363539
  moto authored Nov 04, 2021
  
  a3363539
06 Oct, 2021 1 commit
- Add DR-VCTK dataset (#1819) · 9a34e7c0
  kingyiusuen authored Oct 06, 2021
  
  9a34e7c0
05 Oct, 2021 1 commit
- [BC-Breaking] Remove deprecated VCTK (#1825) · fc4f481b
  moto authored Oct 05, 2021
  
  fc4f481b
02 Aug, 2021 1 commit
- Add CMUDict dataset (#1627) · 077a5f4a
  yangarbiter authored Aug 02, 2021
  
  077a5f4a
04 Dec, 2020 1 commit

[Doc] Add missing modules and minor fixes (#1022) · 2a02d7f5

Krishna Kalyan authored Dec 04, 2020



* Add griffinlim and DB_to_amplitude
* Fix Dataset docstring
* Fix other formatting
Co-authored-by: krishnakalyan3 <skalyan@cloudera.com>

2a02d7f5

02 Oct, 2020 1 commit
- Update docstrings/documentations of all the datasets (#931) · e3d1d746
  moto authored Oct 02, 2020
  
  e3d1d746
15 Sep, 2020 1 commit
- Add tedlium dataset (#882) · 914a846d
  Jaime Ferrando Huertas authored Sep 15, 2020
  
  914a846d
20 Aug, 2020 1 commit

Update VCTK_092 interface and add tests (#875) · 2205cc9e

JianwuXu authored Aug 20, 2020

* Tweak docstring, audio_ext, load method signature and constructor of VCTK_092

* Add test for VCTK_092 dataset.

2205cc9e

19 Aug, 2020 1 commit

Add VCTK_092 dataset (#812) · 4bfebd85

Abhishek Dubey authored Aug 19, 2020



* Added version 0.92 of VCTK dataset
Signed-off-by: Abhishek Dubey <abhi.dubey011999@gmail.com>

4bfebd85

20 Jul, 2020 1 commit

Add LibriTTS dataset (#790) · 4b8aad7a

jimchen90 authored Jul 20, 2020



* Add libritts

Add LibriTTS dataset draft

* Add libritts

Use two separate ids for utterance_id.

* Update output form

Use full_id as utterance_id.

* Update format

Add space and test black format

* Update test method

* Add audio and text test

Generate audio and test files on-the-fly in test 

* Update format

* Fix test error and remove assets libritts

The test error is fixed by sorting the file in 4th element instead of 2nd element in samples. Since the files are generated on-the-fly, so the the libritts files in assets are removed.

* Add seed in `get_whitenoise` function

* Change utterance to text

Change `_utterance` to `_text`.
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

4b8aad7a

10 Jun, 2020 1 commit

Add cmu_arctic dataset (#710) · 55b5c80c

jimchen90 authored Jun 10, 2020



* Add cmu_arctic dataset

* add dataset name

* update audio test file with whitenoise.wav file

* add test text file

* update text method and file name

* update comment

* change datasets order in doc

* add line length
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

55b5c80c

02 Jun, 2020 1 commit

Added the popular GTZAN dataset: (#668) · b0367251

Emmanouil Theofanis Chourdakis authored Jun 03, 2020



* Added the popular GTZAN dataset:

* Added the GTZAN class in torchaudio.datasets using the same format as the rest of the datasets.
* Added the appropriate test function in test_datasets.py.
* Added the GTZAN class in the datasets.rst documentation file.

* Addressed review issues in PR #668

* Added dummy noise .wav in `test/assets/`
* Removed transforms of input and output from the dataset
  `__init__` function, as well as the corresponding methods.
* Replaced rendundant `filtered` and `subset` methods from
  class initialization and also changed the corresponding
  assertion message.

* Fixed E303: too many blank lines error

* Added GTZAN to __init__.__all__

* Fixed incorrectly not importing GTZAN

* removed duplicate warning

* lint
Co-authored-by: Vincent QB <vincentqb@users.noreply.github.com>

b0367251

27 Apr, 2020 1 commit
- Update documentation (#568) · 3012050d
  Vincent QB authored Apr 27, 2020
```
* formatting.

* update datasets.
```
  3012050d
18 Dec, 2017 1 commit
- improve README and add sphinx docs generator · 088d5674
  Soumith Chintala authored Dec 17, 2017
  
  088d5674