datasets.rst 1.46 KB
Newer Older
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
torchaudio.datasets
====================

All datasets are subclasses of :class:`torch.utils.data.Dataset`
i.e, they have ``__getitem__`` and ``__len__`` methods implemented.
Hence, they can all be passed to a :class:`torch.utils.data.DataLoader`
which can load multiple samples parallelly using ``torch.multiprocessing`` workers. 
For example: ::
    
    yesno_data = torchaudio.datasets.YESNO('.', download=True)
    data_loader = torch.utils.data.DataLoader(yesno_data, 
                                              batch_size=1,
                                              shuffle=True,
                                              num_workers=args.nThreads)

The following datasets are available:

.. contents:: Datasets
    :local:

All the datasets have almost similar API. They all have two common arguments:
``transform`` and  ``target_transform`` to transform the input and target respectively.


.. currentmodule:: torchaudio.datasets 


Vincent QB's avatar
Vincent QB committed
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
COMMONVOICE
~~~~~~~~~~~

.. autoclass:: COMMONVOICE
  :members: __getitem__
  :special-members:


LIBRISPEECH
~~~~~~~~~~~

.. autoclass:: LIBRISPEECH
  :members: __getitem__
  :special-members:


LJSPEECH
~~~~~~~~

.. autoclass:: LJSPEECH
  :members: __getitem__
  :special-members:


SPEECHCOMMANDS
~~~~~~~~~~~~~~

.. autoclass:: SPEECHCOMMANDS
  :members: __getitem__
  :special-members:


60
VCTK
Vincent QB's avatar
Vincent QB committed
61
~~~~
62
63
64
65
66
67

.. autoclass:: VCTK
  :members: __getitem__
  :special-members:

YESNO
Vincent QB's avatar
Vincent QB committed
68
~~~~~
69
70
71
72

.. autoclass:: YESNO
  :members: __getitem__
  :special-members: