audio_utils.md 1.44 KB
Newer Older
1
2
3
4
5
6
7
8
9
10
<!--Copyright 2023 The HuggingFace Team. All rights reserved.

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with
the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on
an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the
specific language governing permissions and limitations under the License.
11
12
13
14

鈿狅笍 Note that this file is in Markdown but contain specific syntax for our doc-builder (similar to MDX) that may not be
rendered properly in your Markdown viewer.

15
16
17
18
-->

# Utilities for `FeatureExtractors`

19
This page lists all the utility functions that can be used by the audio [`FeatureExtractor`] in order to compute special features from a raw audio using common algorithms such as *Short Time Fourier Transform* or *log mel spectrogram*.
20

21
Most of those are only useful if you are studying the code of the audio processors in the library.
22
23
24
25
26
27
28

## Audio Transformations

[[autodoc]] audio_utils.hertz_to_mel

[[autodoc]] audio_utils.mel_to_hertz

29
[[autodoc]] audio_utils.mel_filter_bank
30

31
[[autodoc]] audio_utils.optimal_fft_length
32

33
[[autodoc]] audio_utils.window_function
34

35
[[autodoc]] audio_utils.spectrogram
36

37
[[autodoc]] audio_utils.power_to_db
38

39
[[autodoc]] audio_utils.amplitude_to_db