datasets.rst 1.54 KB
Newer Older
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
torchaudio.datasets
====================

All datasets are subclasses of :class:`torch.utils.data.Dataset`
i.e, they have ``__getitem__`` and ``__len__`` methods implemented.
Hence, they can all be passed to a :class:`torch.utils.data.DataLoader`
which can load multiple samples parallelly using ``torch.multiprocessing`` workers. 
For example: ::
    
    yesno_data = torchaudio.datasets.YESNO('.', download=True)
    data_loader = torch.utils.data.DataLoader(yesno_data, 
                                              batch_size=1,
                                              shuffle=True,
                                              num_workers=args.nThreads)

The following datasets are available:

.. contents:: Datasets
    :local:

All the datasets have almost similar API. They all have two common arguments:
``transform`` and  ``target_transform`` to transform the input and target respectively.


.. currentmodule:: torchaudio.datasets 


Vincent QB's avatar
Vincent QB committed
28
29
30
31
32
33
34
35
COMMONVOICE
~~~~~~~~~~~

.. autoclass:: COMMONVOICE
  :members: __getitem__
  :special-members:


36
37
38
39
40
41
42
43
GTZAN
~~~~~

.. autoclass:: GTZAN
  :members: __getitem__
  :special-members:


Vincent QB's avatar
Vincent QB committed
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
LIBRISPEECH
~~~~~~~~~~~

.. autoclass:: LIBRISPEECH
  :members: __getitem__
  :special-members:


LJSPEECH
~~~~~~~~

.. autoclass:: LJSPEECH
  :members: __getitem__
  :special-members:


SPEECHCOMMANDS
~~~~~~~~~~~~~~

.. autoclass:: SPEECHCOMMANDS
  :members: __getitem__
  :special-members:


68
VCTK
Vincent QB's avatar
Vincent QB committed
69
~~~~
70
71
72
73
74
75

.. autoclass:: VCTK
  :members: __getitem__
  :special-members:

YESNO
Vincent QB's avatar
Vincent QB committed
76
~~~~~
77
78
79
80

.. autoclass:: YESNO
  :members: __getitem__
  :special-members: