models.rst 4.23 KB
Newer Older
Tomás Osório's avatar
Tomás Osório committed
1
2
3
4
.. role:: hidden
    :class: hidden-section

torchaudio.models
moto's avatar
moto committed
5
=================
Tomás Osório's avatar
Tomás Osório committed
6
7
8
9
10
11

.. currentmodule:: torchaudio.models

The models subpackage contains definitions of models for addressing common audio tasks.


moto's avatar
moto committed
12
13
ConvTasNet
~~~~~~~~~~
moto's avatar
moto committed
14
15
16
17
18
19

.. autoclass:: ConvTasNet

  .. automethod:: forward


moto's avatar
moto committed
20
21
DeepSpeech
~~~~~~~~~~
discort's avatar
discort committed
22
23
24
25
26
27

.. autoclass:: DeepSpeech

  .. automethod:: forward


28
29
30
Tacotron2
~~~~~~~~~

moto's avatar
moto committed
31
32
33
34
35
36
Model
-----

Tacotoron2
^^^^^^^^^^

37
38
39
40
41
42
43
44
45
46
.. autoclass:: Tacotron2

  .. automethod:: forward

  .. automethod:: infer

Factory Functions
-----------------

tacotron2
moto's avatar
moto committed
47
^^^^^^^^^
48
49
50
51

.. autofunction:: tacotron2


moto's avatar
moto committed
52
53
Wav2Letter
~~~~~~~~~~
Tomás Osório's avatar
Tomás Osório committed
54
55
56
57

.. autoclass:: Wav2Letter

  .. automethod:: forward
jimchen90's avatar
jimchen90 committed
58
59


moto's avatar
moto committed
60
61
Wav2Vec2.0 / HuBERT
~~~~~~~~~~~~~~~~~~~
moto's avatar
moto committed
62

moto's avatar
moto committed
63
64
65
Model
-----

moto's avatar
moto committed
66
Wav2Vec2Model
moto's avatar
moto committed
67
^^^^^^^^^^^^^
moto's avatar
moto committed
68
69
70
71
72
73
74
75
76
77

.. autoclass:: Wav2Vec2Model

  .. automethod:: extract_features

  .. automethod:: forward

Factory Functions
-----------------

78
79
80
81
82
83
wav2vec2_model
^^^^^^^^^^^^^^

.. autofunction:: wav2vec2_model


moto's avatar
moto committed
84
wav2vec2_base
moto's avatar
moto committed
85
^^^^^^^^^^^^^
moto's avatar
moto committed
86

moto's avatar
moto committed
87
88
.. autofunction:: wav2vec2_base

moto's avatar
moto committed
89
wav2vec2_large
moto's avatar
moto committed
90
^^^^^^^^^^^^^^
moto's avatar
moto committed
91

moto's avatar
moto committed
92
93
.. autofunction:: wav2vec2_large

moto's avatar
moto committed
94
wav2vec2_large_lv60k
moto's avatar
moto committed
95
^^^^^^^^^^^^^^^^^^^^
moto's avatar
moto committed
96

moto's avatar
moto committed
97
98
.. autofunction:: wav2vec2_large_lv60k

moto's avatar
moto committed
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
hubert_base
^^^^^^^^^^^

.. autofunction:: hubert_base

hubert_large
^^^^^^^^^^^^

.. autofunction:: hubert_large

hubert_xlarge
^^^^^^^^^^^^^

.. autofunction:: hubert_xlarge

114
115
116
117
118
119
120
121
122
Pre-trained Models
------------------

.. autoclass:: Wav2Vec2PretrainedModelBundle

   .. automethod:: get_model

   .. autoproperty:: labels

123
124
125
126
127
128

WAV2VEC2_BASE
^^^^^^^^^^^^^

.. container:: py attribute

129
130
131
   .. autodata:: WAV2VEC2_BASE
      :no-value:

132
133
134
135
136
137
WAV2VEC2_ASR_BASE_10M
^^^^^^^^^^^^^^^^^^^^^

.. container:: py attribute

   .. autodata:: torchaudio.models.WAV2VEC2_ASR_BASE_10M
138
139
      :no-value:

140
141
142
143
144
WAV2VEC2_ASR_BASE_100H
^^^^^^^^^^^^^^^^^^^^^^
      
.. container:: py attribute

145
146
147
   .. autodata:: WAV2VEC2_ASR_BASE_100H
      :no-value:

148
149
150
151
152
WAV2VEC2_ASR_BASE_960H
^^^^^^^^^^^^^^^^^^^^^^

.. container:: py attribute

153
154
155
   .. autodata:: WAV2VEC2_ASR_BASE_960H
      :no-value:

156
157
158
159
160
WAV2VEC2_LARGE
^^^^^^^^^^^^^^

.. container:: py attribute

161
162
163
   .. autodata:: WAV2VEC2_LARGE
      :no-value:

164
165
166
167
168
WAV2VEC2_ASR_LARGE_10M
^^^^^^^^^^^^^^^^^^^^^^

.. container:: py attribute

169
170
171
   .. autodata:: WAV2VEC2_ASR_LARGE_10M
      :no-value:

172
173
174
175
176
WAV2VEC2_ASR_LARGE_100H
^^^^^^^^^^^^^^^^^^^^^^^

.. container:: py attribute

177
178
179
   .. autodata:: WAV2VEC2_ASR_LARGE_100H
      :no-value:

180
181
182
183
184
WAV2VEC2_ASR_LARGE_960H
^^^^^^^^^^^^^^^^^^^^^^^

.. container:: py attribute

185
186
187
   .. autodata:: WAV2VEC2_ASR_LARGE_960H
      :no-value:

188
189
190
191
192
WAV2VEC2_LARGE_LV60K
^^^^^^^^^^^^^^^^^^^^

.. container:: py attribute

193
194
195
   .. autodata:: WAV2VEC2_LARGE_LV60K
      :no-value:

196
197
198
199
200
WAV2VEC2_ASR_LARGE_LV60K_10M
^^^^^^^^^^^^^^^^^^^^^^^^^^^^

.. container:: py attribute

201
202
203
   .. autodata:: WAV2VEC2_ASR_LARGE_LV60K_10M
      :no-value:

204
205
206
207
208
WAV2VEC2_ASR_LARGE_LV60K_100H
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

.. container:: py attribute

209
210
211
   .. autodata:: WAV2VEC2_ASR_LARGE_LV60K_100H
      :no-value:

212
213
214
215
216
WAV2VEC2_ASR_LARGE_LV60K_960H
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

.. container:: py attribute

217
218
219
   .. autodata:: WAV2VEC2_ASR_LARGE_LV60K_960H
      :no-value:

220
221
222
223
224
WAV2VEC2_XLSR53
^^^^^^^^^^^^^^^

.. container:: py attribute

225
226
227
   .. autodata:: WAV2VEC2_XLSR53
      :no-value:

228
229
230
231
232
HUBERT_BASE
^^^^^^^^^^^

.. container:: py attribute

233
234
235
   .. autodata:: HUBERT_BASE
      :no-value:

236
237
238
239
240
HUBERT_LARGE
^^^^^^^^^^^^

.. container:: py attribute

241
242
243
   .. autodata:: HUBERT_LARGE
      :no-value:

244
245
246
247
248
HUBERT_XLARGE
^^^^^^^^^^^^^

.. container:: py attribute

249
250
251
   .. autodata:: HUBERT_XLARGE
      :no-value:

252
253
254
255
256
HUBERT_ASR_LARGE
^^^^^^^^^^^^^^^^

.. container:: py attribute

257
258
   .. autodata:: HUBERT_ASR_LARGE
      :no-value:
259

260
261
262
263
264
HUBERT_ASR_XLARGE
^^^^^^^^^^^^^^^^^

.. container:: py attribute

265
266
267
   .. autodata:: HUBERT_ASR_XLARGE
      :no-value:

268
269
270
Utility Functions
-----------------

271
272
.. currentmodule:: torchaudio.models.wav2vec2.utils

moto's avatar
moto committed
273
import_huggingface_model
moto's avatar
moto committed
274
^^^^^^^^^^^^^^^^^^^^^^^^
moto's avatar
moto committed
275

276
277
.. autofunction:: import_huggingface_model

moto's avatar
moto committed
278
import_fairseq_model
moto's avatar
moto committed
279
^^^^^^^^^^^^^^^^^^^^
moto's avatar
moto committed
280

moto's avatar
moto committed
281
282
.. autofunction:: import_fairseq_model

283
.. currentmodule:: torchaudio.models
moto's avatar
moto committed
284

moto's avatar
moto committed
285
286
WaveRNN
~~~~~~~
jimchen90's avatar
jimchen90 committed
287

moto's avatar
moto committed
288
289
290
291
292
293
Model
-----

WaveRNN
^^^^^^^

jimchen90's avatar
jimchen90 committed
294
295
296
.. autoclass:: WaveRNN

  .. automethod:: forward
moto's avatar
moto committed
297

298
299
  .. automethod:: infer

300
301
302
303
Factory Functions
-----------------

wavernn
moto's avatar
moto committed
304
^^^^^^^
305
306
307

.. autofunction:: wavernn

moto's avatar
moto committed
308
309
310
311
References
~~~~~~~~~~

.. footbibliography::