Commits · e8ae0ad22d5136e8252e4e9f5fe6dc38522dc880 · OpenDAS / Torchaudio

19 Oct, 2022 1 commit

Add file_name to the returned item in Snips dataset (#2775) · e8ae0ad2

Zhaoheng Ni authored Oct 18, 2022

Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/2775

Reviewed By: carolineechen

Differential Revision: D40481144

Pulled By: nateanl

fbshipit-source-id: 5d0fb2478767704603a3ec28d74160e7892d4d0e

e8ae0ad2

11 Oct, 2022 1 commit

Add Snips Dataset (#2738) · 84187909

Zhaoheng Ni authored Oct 10, 2022

Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/2738

Reviewed By: carolineechen

Differential Revision: D40238099

Pulled By: nateanl

fbshipit-source-id: c5cc94c2a348a6ef34c04b8dd26114ecb874d73e

84187909

10 Oct, 2022 1 commit

Add unit test for LibriMix dataset (#2659) · c5b8e585

Zhaoheng Ni authored Oct 10, 2022

Summary:
Besides the unit test, the PR also addresses these issues:
- The original `LibriMix` dataset only supports "min" mode, which means the audio length is the minimum of all clean sources. It is default for source separation task. Users may also want to use "max" mode which allows for end-to-end separation and recognition. The PR adds ``mode`` argument to let users decide which dataset they want to use.
- If the task is ``"enh_both"``, the target is the audios in ``mix_clean`` instead of separate clean sources. The PR fixes it to use ``mix_clean`` as target.

Pull Request resolved: https://github.com/pytorch/audio/pull/2659

Reviewed By: carolineechen

Differential Revision: D40229227

Pulled By: nateanl

fbshipit-source-id: fc07e0d88a245e1367656d3767cf98168a799235

c5b8e585

09 Oct, 2022 1 commit

Add IEMOCAP dataset (#2732) · 0b4b1fd4

Caroline Chen authored Oct 09, 2022

Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/2732

Reviewed By: nateanl

Differential Revision: D40186996

Pulled By: nateanl

fbshipit-source-id: a0ad325b7153c9e580dad2c515730dadbe8840c4

0b4b1fd4

06 Jul, 2022 1 commit

Fix fluent test for windows (#2510) · 09daa438

Caroline Chen authored Jul 05, 2022

Summary:
fluent dataset test currently fails on windows, due to new line generation in csv writer in testing and incorrect path parsing in dataset impl.

Pull Request resolved: https://github.com/pytorch/audio/pull/2510

Reviewed By: carolineechen

Differential Revision: D37573203

Pulled By: mthrok

fbshipit-source-id: 4868bc649690c7e596b002686c6128ce735d3564

09daa438

27 Jun, 2022 1 commit

Add VoxCeleb1 dataset (#2349) · 21b2d139

Zhaoheng Ni authored Jun 27, 2022

Summary:
This PR adds two dataset classes of VoxCeleb1 corpus.
- `VoxCeleb1Identification`
Each data sample contains the waveform, sample rate, speaker id, and the file id.
- `VoxCeleb1Verification`
Each data sample contains a pair of waveforms, sample rate, the label indicating if they are from the same speaker, and the file ids.

Pull Request resolved: https://github.com/pytorch/audio/pull/2349

Reviewed By: carolineechen

Differential Revision: D35927921

Pulled By: nateanl

fbshipit-source-id: 3e07ddd329178777698841565053eb59befe6449

21b2d139

23 Jun, 2022 1 commit

[AutoAccept][Codemod][FBSourceBlackLinter] Daily `arc lint --take BLACK` · fee994ce

CodemodService FBSourceBlackLinterBot authored Jun 23, 2022

Summary:
Meta:
**If you take no action, this diff will be automatically accepted on 2022-06-23.**
(To remove yourself from auto-accept diffs and just let them all land, add yourself to [this Butterfly rule](https://www.internalfb.com/butterfly/rule/904302247110220))

Produced by `tools/arcanist/lint/codemods/black-fbsource`.

#nocancel

Rules run:
- CodemodTransformerSimpleShell

Config Oncall: [lint](https://our.intern.facebook.com/intern/oncall3/?shortname=lint)
CodemodConfig: [CodemodConfigFBSourceBlackLinter](https://www.internalfb.com/code/www/flib/intern/codemod_service/config/fbsource_arc_f/CodemodConfigFBSourceBlackLinter.php)
ConfigType: php
Sandcastle URL: https://www.internalfb.com/intern/sandcastle/job/13510799586951394/
This diff was automatically created with CodemodService.
To learn more about CodemodService, check out the [CodemodService wiki](https://fburl.com/CodemodService).

_____

## Questions / Comments / Feedback?

**[Click here to give feedback about this diff](https://www.internalfb.com/codemod_service/feedback?sandcastle_job_id=13510799586951394).**

* Returning back to author or abandoning this diff will only cause the diff to be regenerated in the future.
* Do **NOT** post in the CodemodService Feedback group about this specific diff.

drop-conflicts

Reviewed By: adamjernst

Differential Revision: D37375235

fbshipit-source-id: 3d7eb39e5c0539a78d1412f37562dec90b0fc759

fee994ce

21 Jun, 2022 1 commit

Create musdb handler and tests (#2484) · b92a8a09

Sean Kim authored Jun 21, 2022

Summary:
Create dataset handler and tests for new dataset. Manually tested and unit tested to test validity. Pre-commit ran for style checks.

Pull Request resolved: https://github.com/pytorch/audio/pull/2484

Reviewed By: carolineechen, nateanl

Differential Revision: D37250556

Pulled By: skim0514

fbshipit-source-id: d2c8d73d22fd9d7282026265676f3eab1e178d51

b92a8a09

20 Jun, 2022 1 commit

Add fluent speech commands (#2480) · 66a67d2e

Caroline Chen authored Jun 20, 2022

Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/2480

Reviewed By: nateanl

Differential Revision: D37249571

Pulled By: carolineechen

fbshipit-source-id: caefeec4253c91f2579655a0c1735edaeed51be9

66a67d2e

02 Jun, 2022 1 commit

Update QUESST14 getitem (#2435) · ceee6912

Caroline Chen authored Jun 02, 2022

Summary:
update QUESST14 getitem to include docstrings and additionally return sample rate

Pull Request resolved: https://github.com/pytorch/audio/pull/2435

Reviewed By: nateanl

Differential Revision: D36864254

Pulled By: carolineechen

fbshipit-source-id: 9e68bbc5de27ad2f32f6b298414103c4f6784801

ceee6912

23 May, 2022 1 commit

Add LibriLightLimited dataset (#2302) · af9cab3b

Zhaoheng Ni authored May 23, 2022

Summary:
The `LibriLightLimited` dataset is created for fine-tuning SSL models, such as Wav2Vec2 and HuBERT. It is a supervised subset of [Libri-Light](https://github.com/facebookresearch/libri-light) dataset. To distinguish the unsupervised subset and the supervised one, it's clearer to put it in a separate dataset class for fine-tuning purpose.
It contains "10 min", "1 hour", "10 hour" splits.

Pull Request resolved: https://github.com/pytorch/audio/pull/2302

Reviewed By: mthrok

Differential Revision: D36388188

Pulled By: nateanl

fbshipit-source-id: ba49f1c9996be17db5db41127d8ca96224c94249

af9cab3b

20 May, 2022 1 commit

Refactor LibriSpeech tests to accommodate different dataset classes (#2392) · 010583b6

Jeff Hwang authored May 20, 2022

Summary:
Pull Request resolved: https://github.com/pytorch/audio/pull/2392

Refactors LibriSpeech tests to accommodate different dataset classes

Reviewed By: xiaohui-zhang

Differential Revision: D36387835

fbshipit-source-id: 73b4e7565b4a077b25f036f4bd854ac7f2194b28

010583b6

15 May, 2022 1 commit

[codemod][usort] apply import merging for fbcode (8 of 11) · d62875cc

John Reese authored May 15, 2022

Summary:
Applies new import merging and sorting from µsort v1.0.

When merging imports, µsort will make a best-effort to move associated
comments to match merged elements, but there are known limitations due to
the diynamic nature of Python and developer tooling. These changes should
not produce any dangerous runtime changes, but may require touch-ups to
satisfy linters and other tooling.

Note that µsort uses case-insensitive, lexicographical sorting, which
results in a different ordering compared to isort. This provides a more
consistent sorting order, matching the case-insensitive order used when
sorting import statements by module name, and ensures that "frog", "FROG",
and "Frog" always sort next to each other.

For details on µsort's sorting and merging semantics, see the user guide:
https://usort.readthedocs.io/en/stable/guide.html#sorting

Reviewed By: lisroach

Differential Revision: D36402214

fbshipit-source-id: b641bfa9d46242188524d4ae2c44998922a62b4c

d62875cc

18 Apr, 2022 1 commit

Add QUESST14 dataset (#2290) · aebcf6af

Caroline Chen authored Apr 18, 2022

Summary:
implementation adapted from [s3prl](https://github.com/s3prl/s3prl/blob/master/s3prl/downstream/quesst14_dtw/dataset.py)

modifying the s3prl downstream expert to [this](https://github.com/carolineechen/s3prl/commit/adc91a53d581a604f495f3795a865d84aa17f1a5) using this dataset implementation produces the same results as using the original s3prl pipeline

Pull Request resolved: https://github.com/pytorch/audio/pull/2290

Reviewed By: nateanl

Differential Revision: D35692551

Pulled By: carolineechen

fbshipit-source-id: 035ad161d4cbbd2072411cfdf89984b73a89868c

aebcf6af

30 Dec, 2021 1 commit

Enforce lint checks and fix/mute lint errors (#2116) · 8ed14782

Joao Gomes authored Dec 30, 2021

Summary:
cc mthrok

Pull Request resolved: https://github.com/pytorch/audio/pull/2116

Reviewed By: mthrok

Differential Revision: D33368453

Pulled By: jdsgomes

fbshipit-source-id: 09cf3fe5ed6f771c2f16505633c0e59b0c27453c

8ed14782

23 Dec, 2021 1 commit

Apply arc lint to pytorch audio (#2096) · 5859923a

Joao Gomes authored Dec 23, 2021

Summary:
Pull Request resolved: https://github.com/pytorch/audio/pull/2096

run: `arc lint --apply-patches --paths-cmd 'hg files -I "./**/*.py"'`

Reviewed By: mthrok

Differential Revision: D33297351

fbshipit-source-id: 7bf5956edf0717c5ca90219f72414ff4eeaf5aa8

5859923a

08 Oct, 2021 1 commit
- Rename utterance to transcript in datasets (#1841) · c38ecd2e
  hwangjeff authored Oct 08, 2021
  
  c38ecd2e
06 Oct, 2021 2 commits
- Add DR-VCTK dataset (#1819) · 9a34e7c0
  kingyiusuen authored Oct 06, 2021
  
  9a34e7c0
- Remove deprecated dataset utils (#1826) · 1efba850
  moto authored Oct 05, 2021
  
  1efba850
05 Oct, 2021 1 commit
- [BC-Breaking] Remove deprecated VCTK (#1825) · fc4f481b
  moto authored Oct 05, 2021
  
  fc4f481b
02 Aug, 2021 1 commit
- Add CMUDict dataset (#1627) · 077a5f4a
  yangarbiter authored Aug 02, 2021
  
  077a5f4a
02 Mar, 2021 1 commit
- Make sox selective (#1338) · ecfed4d9
  Caroline Chen authored Mar 02, 2021
  
  ecfed4d9
24 Feb, 2021 1 commit
- Remove legacy backends (#1311) · 33dc817c
  Prabhat Roy authored Feb 24, 2021
  
  33dc817c
08 Feb, 2021 1 commit
- Improve Flake Rules (#1214) · 0f23e6d3
  Krishna Kalyan authored Feb 08, 2021
  
  0f23e6d3
05 Jan, 2021 6 commits
- Refactor GTZAN unittest (#1148) · 2067d034
  Krishna Kalyan authored Jan 05, 2021
```
Co-authored-by: krishnakalyan3 <skalyan@cloudera.com>
```
  2067d034
- Refactor CMUARCTIC unittest (#1147) · 6edb3355
  Krishna Kalyan authored Jan 05, 2021
```
Co-authored-by: krishnakalyan3 <skalyan@cloudera.com>
```
  6edb3355
- Refactor LibriTTS unittest (#1139) · 02e4f6d2
  Aziz authored Jan 05, 2021
  
  02e4f6d2
- Refactor LJSpeech unittest (#1138) · 1838f927
  Aziz authored Jan 05, 2021
  
  1838f927
- Refactor TEDLIUM unittest (#1135) · 64956d54
  Aziz authored Jan 05, 2021
  
  64956d54
- Clean up Dataset test CommonVoice and YesNo (#1133) · 8f02af5f
  Aziz authored Jan 05, 2021
```
* improve commonvoice test

* update yesno test

* refactor commonvoice dataset unittest
```
  8f02af5f
30 Dec, 2020 4 commits
- Refactor librispeech unittest (#1140) · cf114276
  Aziz authored Dec 30, 2020
  
  cf114276
- Refactor speechcommands unittest (#1136) · 5bf6b146
  Aziz authored Dec 30, 2020
  
  5bf6b146
- Refactor vctk unittest (#1134) · 70fd2f3d
  Aziz authored Dec 30, 2020
  
  70fd2f3d
- Make Dataset utility test independent of CommonVoice (#1132) · 93c3025f
  Aziz authored Dec 30, 2020
  
  93c3025f
27 Dec, 2020 1 commit
- Fix CommonVoice for French (#1126) · aa56d30c
  Aziz authored Dec 27, 2020
```
Resolves #1125 where dataset metadata does not contain an extension.
```
  aa56d30c
21 Dec, 2020 1 commit

Remove walk_files (#1111) · 8187dc0a

Aziz authored Dec 21, 2020

The use of `walk_files` made it ambiguous who is responsible to locate
the correct set of files. (Dataset class? or utility?)
In fact, just glob-ing everything is not the right problem being solved in implementing
Dataset, because if you have a specific dataset you consider to access, then
the directory structure and file locations are determined. No need to do arbitral number of recursions.
Each Dataset implementation should be glob-ing the right set of files it requires.

8187dc0a

18 Dec, 2020 1 commit

[BC-Breaking] Remove download and subdir from CommonVoice (#1082) · 6b810240

moto authored Dec 18, 2020

* Removes code for download logics 
* [BC-breaking] Changes the meaning of `root` argument to the exact directory of the dataset
* Deprecates the constructor arguments for download and subdirectory construction

6b810240

11 Dec, 2020 1 commit
- Revert "no longer download CommonVoice directly (#1018)" (#1079) · 366cef83
  moto authored Dec 11, 2020
```
This reverts commit 09a6fca1.
```
  366cef83
03 Dec, 2020 1 commit
- no longer download CommonVoice directly (#1018) · 09a6fca1
  Vincent QB authored Dec 03, 2020
```
no longer allow to download the dataset directly. deprecate: download and url. add language.
```
  09a6fca1
18 Nov, 2020 1 commit
- Add pathlib support for LIBRITTS and LIBRISPEECH (#1046) · b5c16d33
  Bhargav Kathivarapu authored Nov 19, 2020
  
  b5c16d33