datasets.rst 1.01 KB
Newer Older
1
2
.. py:module:: torchaudio.datasets

3
4
5
6
torchaudio.datasets
====================

All datasets are subclasses of :class:`torch.utils.data.Dataset`
7
and have ``__getitem__`` and ``__len__`` methods implemented.
8

9
10
11
Hence, they can all be passed to a :class:`torch.utils.data.DataLoader`
which can load multiple samples parallelly using :mod:`torch.multiprocessing` workers.
For example:
12

13
.. code::
14

15
16
17
18
19
20
   yesno_data = torchaudio.datasets.YESNO('.', download=True)
   data_loader = torch.utils.data.DataLoader(
       yesno_data,
       batch_size=1,
       shuffle=True,
       num_workers=args.nThreads)
21

22
.. currentmodule:: torchaudio.datasets
23

24
25
26
27
28
29
30
31
32
33
34
.. autosummary::
   :toctree: generated
   :nosignatures:
   :template: autosummary/dataset_class.rst

   CMUARCTIC
   CMUDict
   COMMONVOICE
   DR_VCTK
   FluentSpeechCommands
   GTZAN
Caroline Chen's avatar
Caroline Chen committed
35
   IEMOCAP
36
37
38
39
40
41
42
   LibriMix
   LIBRISPEECH
   LibriLightLimited
   LIBRITTS
   LJSPEECH
   MUSDB_HQ
   QUESST14
Zhaoheng Ni's avatar
Zhaoheng Ni committed
43
   Snips
44
45
46
47
48
49
   SPEECHCOMMANDS
   TEDLIUM
   VCTK_092
   VoxCeleb1Identification
   VoxCeleb1Verification
   YESNO