- 18 Dec, 2020 1 commit
-
-
moto authored
* Removes code for download logics * [BC-breaking] Changes the meaning of `root` argument to the exact directory of the dataset * Deprecates the constructor arguments for download and subdirectory construction
-
- 15 Dec, 2020 1 commit
-
-
Krishna Kalyan authored
- yesno - librispeech - libritts - speechcommands Co-authored-by:
krishnakalyan3 <skalyan@cloudera.com> Co-authored-by:
Vincent Quenneville-Belair <vincentqb@gmail.com>
-
- 11 Dec, 2020 2 commits
-
-
moto authored
-
- 08 Dec, 2020 1 commit
-
-
moto authored
* Import torchaudio #1034 70f429a4 Summary: Import torchaudio #1027 0cf4b8a9 Reviewed By: vincentqb, cpuhrsch Differential Revision: D24958707 fbshipit-source-id: d06dd6b59197cc2c16bec5a9012cbf33a172b6b3 * Import torchaudio #1066 4406a6bb Summary: Import up to #1066 Reviewed By: cpuhrsch Differential Revision: D25373068 fbshipit-source-id: 890d36a25259b93428b3037c3123ff5a2cacfa04
-
- 03 Dec, 2020 1 commit
-
-
Vincent QB authored
no longer allow to download the dataset directly. deprecate: download and url. add language.
-
- 18 Nov, 2020 2 commits
-
-
Bhargav Kathivarapu authored
-
Bhargav Kathivarapu authored
-
- 17 Nov, 2020 1 commit
-
-
Bhargav Kathivarapu authored
-
- 16 Nov, 2020 2 commits
-
-
Kshiteej K authored
Co-authored-by:Vincent QB <vincentqb@users.noreply.github.com>
-
Bhargav Kathivarapu authored
-
- 13 Nov, 2020 3 commits
-
-
Kshiteej K authored
-
Bhargav Kathivarapu authored
Co-authored-by:Vincent QB <vincentqb@users.noreply.github.com>
-
Kshiteej K authored
-
- 27 Oct, 2020 1 commit
-
-
Vincent QB authored
-
- 13 Oct, 2020 1 commit
-
-
moto authored
-
- 09 Oct, 2020 1 commit
-
-
Vincent QB authored
and add test on other backend.
-
- 02 Oct, 2020 1 commit
-
-
moto authored
-
- 15 Sep, 2020 1 commit
-
-
Jaime Ferrando Huertas authored
-
- 20 Aug, 2020 1 commit
-
-
JianwuXu authored
* Tweak docstring, audio_ext, load method signature and constructor of VCTK_092 * Add test for VCTK_092 dataset.
-
- 19 Aug, 2020 1 commit
-
-
Abhishek Dubey authored
* Added version 0.92 of VCTK dataset Signed-off-by:Abhishek Dubey <abhi.dubey011999@gmail.com>
-
- 27 Jul, 2020 1 commit
-
-
Lawrence Chen authored
Co-authored-by:lawrencechen <lawrencechen@devvm3189.vll0.facebook.com>
-
- 23 Jul, 2020 2 commits
- 20 Jul, 2020 1 commit
-
-
jimchen90 authored
* Add libritts Add LibriTTS dataset draft * Add libritts Use two separate ids for utterance_id. * Update output form Use full_id as utterance_id. * Update format Add space and test black format * Update test method * Add audio and text test Generate audio and test files on-the-fly in test * Update format * Fix test error and remove assets libritts The test error is fixed by sorting the file in 4th element instead of 2nd element in samples. Since the files are generated on-the-fly, so the the libritts files in assets are removed. * Add seed in `get_whitenoise` function * Change utterance to text Change `_utterance` to `_text`. Co-authored-by:Ji Chen <jimchen90@devfair0160.h2.fair>
-
- 17 Jul, 2020 1 commit
-
-
Emmanouil Theofanis Chourdakis authored
* Addressed review issues in PR #668 * Changed GTZAN so that it only traverses filenames belonging to the dataset Now, instead of walking the whole directory and subdirectories of the dataset GTZAN only looks for files under a `genre`/`genre`.`5 digit number`.wav format, where `genre` is an allowed GTZAN genre label. This allows moving or removing files from the dataset (e.g. for fixing duplication or mislabeling issues).
-
- 10 Jun, 2020 1 commit
-
-
jimchen90 authored
* Add cmu_arctic dataset * add dataset name * update audio test file with whitenoise.wav file * add test text file * update text method and file name * update comment * change datasets order in doc * add line length Co-authored-by:Ji Chen <jimchen90@devfair0160.h2.fair>
-
- 02 Jun, 2020 1 commit
-
-
Emmanouil Theofanis Chourdakis authored
* Added the popular GTZAN dataset: * Added the GTZAN class in torchaudio.datasets using the same format as the rest of the datasets. * Added the appropriate test function in test_datasets.py. * Added the GTZAN class in the datasets.rst documentation file. * Addressed review issues in PR #668 * Added dummy noise .wav in `test/assets/` * Removed transforms of input and output from the dataset `__init__` function, as well as the corresponding methods. * Replaced rendundant `filtered` and `subset` methods from class initialization and also changed the corresponding assertion message. * Fixed E303: too many blank lines error * Added GTZAN to __init__.__all__ * Fixed incorrectly not importing GTZAN * removed duplicate warning * lint Co-authored-by:Vincent QB <vincentqb@users.noreply.github.com>
-
- 21 Apr, 2020 2 commits
-
-
Tomás Osório authored
* fix inline typing for mypy * fix flake8 * change check position * fix for py3.5 * fix for py3.5 * change to inline typing * add inline typing
-
Bhargav Kathivarapu authored
* add checksums * checksum function changes * function Docstring change * checksums moved to Dataset Modules
-
- 07 Apr, 2020 1 commit
-
-
Tomás Osório authored
* add inline typing to utils Dataset * add inline typing to common_utils * add missing inline typing * add typing to kwarg * add missing inline typing * update docstring * undo indentation Co-authored-by:Vincent QB <vincentqb@users.noreply.github.com>
-
- 06 Apr, 2020 1 commit
-
-
Tomás Osório authored
* add CommonDataset Inline typing * inline Typing librispeech * add inline typing ljspeech * add inline typing speechcommands * add inline typing to vctk * add inline typing yesno * apply type to __getitem__ Co-authored-by:Vincent QB <vincentqb@users.noreply.github.com>
-
- 03 Apr, 2020 1 commit
-
-
Tomás Osório authored
* fix download * fix reading tsv archive * add new languages * maintain same structure as other datasets * update CommonVoice Tests * fix * change directory name * remove extra line
-
- 02 Apr, 2020 1 commit
-
-
Tomás Osório authored
* fix issue with VCTK dataset * update docstring * filter out folder p315 * add hidden except_folder has hidden variable * maintain structure * lint * remove space Co-authored-by:Vincent QB <vincentqb@users.noreply.github.com>
-
- 01 Apr, 2020 1 commit
-
-
Bhargav Kathivarapu authored
-
- 22 Feb, 2020 1 commit
-
-
Tomás Osório authored
* add speechcommand dataset and test * prepend the full path to each result * add missing param on docstring in walk_files * add file to run tests on SpeechCommand Dataset * reduce logic * update test on SpeechCommands * correct the indentation on docstring walk_files * flake8 compliance * change tuple type returned. move path split logic in load item. * typo in name. * redundant file path. * filter background noise. Co-authored-by:Vincent QB <vincentqb@users.noreply.github.com>
-
- 20 Feb, 2020 1 commit
-
-
Taras Sereda authored
* LJ Speech dataset * refactoring as per @vincentqb's suggestions
-
- 12 Feb, 2020 1 commit
-
-
Vincent QB authored
-
- 13 Jan, 2020 1 commit
-
-
Vincent QB authored
-
- 27 Dec, 2019 1 commit
-
-
Karl Ostmo authored
* Declare file encoding to support special characters * fix missing utf_8_encoder error in Travis tests * Py 2.7 backwards-compat iterator * ensure integer argument to torch.nn.functional.pad * cast match.ceil result as integer
-