models.rst 1.42 KB
Newer Older
1
.. py:module:: torchaudio.models
Tomás Osório's avatar
Tomás Osório committed
2
3

torchaudio.models
moto's avatar
moto committed
4
=================
Tomás Osório's avatar
Tomás Osório committed
5
6
7

.. currentmodule:: torchaudio.models

8
The ``torchaudio.models`` subpackage contains definitions of models for addressing common audio tasks.
moto's avatar
moto committed
9

10
For pre-trained models, please refer to :mod:`torchaudio.pipelines` module.
11

12
Model Definitions
13
14
-----------------

15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
Model defintions are responsible for constructing computation graphs and executing them.

Some models have complex structure and variations.
For such models, `Factory Functions`_ are provided.

.. autosummary::
   :toctree: generated
   :nosignatures:
   :template: autosummary/model_class.rst

   Conformer
   ConvTasNet
   DeepSpeech
   Emformer
   HDemucs
   HuBERTPretrainModel
   RNNT
   RNNTBeamSearch
   Tacotron2
   Wav2Letter
   Wav2Vec2Model
   WaveRNN
37

moto's avatar
moto committed
38
39
40
Factory Functions
-----------------

41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
.. autosummary::
   :toctree: generated
   :nosignatures:

   conv_tasnet_base
   emformer_rnnt_model
   emformer_rnnt_base
   wav2vec2_model
   wav2vec2_base
   wav2vec2_large
   wav2vec2_large_lv60k
   hubert_base
   hubert_large
   hubert_xlarge
   hubert_pretrain_model
   hubert_pretrain_base
   hubert_pretrain_large
   hubert_pretrain_xlarge
   hdemucs_low
   hdemucs_medium
   hdemucs_high
62
63
64
   wavlm_model
   wavlm_base
   wavlm_large
65

66
67
68
Utility Functions
-----------------

69
70
71
.. autosummary::
   :toctree: generated
   :nosignatures:
moto's avatar
moto committed
72

73
74
   ~wav2vec2.utils.import_fairseq_model
   ~wav2vec2.utils.import_huggingface_model