Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
chenpangpang
transformers
Commits
9b42c68f
Unverified
Commit
9b42c68f
authored
Jan 19, 2023
by
Matthijs Hollemans
Committed by
GitHub
Jan 19, 2023
Browse files
hertz is already per second (#21188)
parent
4bc18e7a
Changes
6
Hide whitespace changes
Inline
Side-by-side
Showing
6 changed files
with
6 additions
and
6 deletions
+6
-6
src/transformers/feature_extraction_sequence_utils.py
src/transformers/feature_extraction_sequence_utils.py
+1
-1
src/transformers/models/audio_spectrogram_transformer/feature_extraction_audio_spectrogram_transformer.py
...ormer/feature_extraction_audio_spectrogram_transformer.py
+1
-1
src/transformers/models/mctct/feature_extraction_mctct.py
src/transformers/models/mctct/feature_extraction_mctct.py
+1
-1
src/transformers/models/speech_to_text/feature_extraction_speech_to_text.py
...odels/speech_to_text/feature_extraction_speech_to_text.py
+1
-1
src/transformers/models/wav2vec2/feature_extraction_wav2vec2.py
...ansformers/models/wav2vec2/feature_extraction_wav2vec2.py
+1
-1
src/transformers/models/whisper/feature_extraction_whisper.py
...transformers/models/whisper/feature_extraction_whisper.py
+1
-1
No files found.
src/transformers/feature_extraction_sequence_utils.py
View file @
9b42c68f
...
@@ -34,7 +34,7 @@ class SequenceFeatureExtractor(FeatureExtractionMixin):
...
@@ -34,7 +34,7 @@ class SequenceFeatureExtractor(FeatureExtractionMixin):
feature_size (`int`):
feature_size (`int`):
The feature dimension of the extracted features.
The feature dimension of the extracted features.
sampling_rate (`int`):
sampling_rate (`int`):
The sampling rate at which the audio files should be digitalized expressed in
H
ertz
per second
(Hz).
The sampling rate at which the audio files should be digitalized expressed in
h
ertz (Hz).
padding_value (`float`):
padding_value (`float`):
The value that is used to fill the padding values / vectors.
The value that is used to fill the padding values / vectors.
"""
"""
...
...
src/transformers/models/audio_spectrogram_transformer/feature_extraction_audio_spectrogram_transformer.py
View file @
9b42c68f
...
@@ -44,7 +44,7 @@ class ASTFeatureExtractor(SequenceFeatureExtractor):
...
@@ -44,7 +44,7 @@ class ASTFeatureExtractor(SequenceFeatureExtractor):
feature_size (`int`, *optional*, defaults to 1):
feature_size (`int`, *optional*, defaults to 1):
The feature dimension of the extracted features.
The feature dimension of the extracted features.
sampling_rate (`int`, *optional*, defaults to 16000):
sampling_rate (`int`, *optional*, defaults to 16000):
The sampling rate at which the audio files should be digitalized expressed in
H
ertz
per second
(Hz).
The sampling rate at which the audio files should be digitalized expressed in
h
ertz (Hz).
num_mel_bins (`int`, *optional*, defaults to 128):
num_mel_bins (`int`, *optional*, defaults to 128):
Number of Mel-frequency bins.
Number of Mel-frequency bins.
max_length (`int`, *optional*, defaults to 1024):
max_length (`int`, *optional*, defaults to 1024):
...
...
src/transformers/models/mctct/feature_extraction_mctct.py
View file @
9b42c68f
...
@@ -53,7 +53,7 @@ class MCTCTFeatureExtractor(SequenceFeatureExtractor):
...
@@ -53,7 +53,7 @@ class MCTCTFeatureExtractor(SequenceFeatureExtractor):
feature_size (`int`, defaults to 80):
feature_size (`int`, defaults to 80):
The feature dimension of the extracted features. This is the number of mel_frequency
The feature dimension of the extracted features. This is the number of mel_frequency
sampling_rate (`int`, defaults to 16000):
sampling_rate (`int`, defaults to 16000):
The sampling rate at which the audio files should be digitalized expressed in
H
ertz
per second
(Hz).
The sampling rate at which the audio files should be digitalized expressed in
h
ertz (Hz).
padding_value (`float`, defaults to 0.0):
padding_value (`float`, defaults to 0.0):
The value that is used to fill the padding values.
The value that is used to fill the padding values.
hop_length (`int`, defaults to 10):
hop_length (`int`, defaults to 10):
...
...
src/transformers/models/speech_to_text/feature_extraction_speech_to_text.py
View file @
9b42c68f
...
@@ -44,7 +44,7 @@ class Speech2TextFeatureExtractor(SequenceFeatureExtractor):
...
@@ -44,7 +44,7 @@ class Speech2TextFeatureExtractor(SequenceFeatureExtractor):
feature_size (`int`, defaults to 80):
feature_size (`int`, defaults to 80):
The feature dimension of the extracted features.
The feature dimension of the extracted features.
sampling_rate (`int`, defaults to 16000):
sampling_rate (`int`, defaults to 16000):
The sampling rate at which the audio files should be digitalized expressed in
H
ertz
per second
(Hz).
The sampling rate at which the audio files should be digitalized expressed in
h
ertz (Hz).
num_mel_bins (`int`, defaults to 80):
num_mel_bins (`int`, defaults to 80):
Number of Mel-frequency bins.
Number of Mel-frequency bins.
padding_value (`float`, defaults to 0.0):
padding_value (`float`, defaults to 0.0):
...
...
src/transformers/models/wav2vec2/feature_extraction_wav2vec2.py
View file @
9b42c68f
...
@@ -39,7 +39,7 @@ class Wav2Vec2FeatureExtractor(SequenceFeatureExtractor):
...
@@ -39,7 +39,7 @@ class Wav2Vec2FeatureExtractor(SequenceFeatureExtractor):
feature_size (`int`, defaults to 1):
feature_size (`int`, defaults to 1):
The feature dimension of the extracted features.
The feature dimension of the extracted features.
sampling_rate (`int`, defaults to 16000):
sampling_rate (`int`, defaults to 16000):
The sampling rate at which the audio files should be digitalized expressed in
H
ertz
per second
(Hz).
The sampling rate at which the audio files should be digitalized expressed in
h
ertz (Hz).
padding_value (`float`, defaults to 0.0):
padding_value (`float`, defaults to 0.0):
The value that is used to fill the padding values.
The value that is used to fill the padding values.
do_normalize (`bool`, *optional*, defaults to `True`):
do_normalize (`bool`, *optional*, defaults to `True`):
...
...
src/transformers/models/whisper/feature_extraction_whisper.py
View file @
9b42c68f
...
@@ -43,7 +43,7 @@ class WhisperFeatureExtractor(SequenceFeatureExtractor):
...
@@ -43,7 +43,7 @@ class WhisperFeatureExtractor(SequenceFeatureExtractor):
feature_size (`int`, defaults to 80):
feature_size (`int`, defaults to 80):
The feature dimension of the extracted features.
The feature dimension of the extracted features.
sampling_rate (`int`, defaults to 16000):
sampling_rate (`int`, defaults to 16000):
The sampling rate at which the audio files should be digitalized expressed in
H
ertz
per second
(Hz).
The sampling rate at which the audio files should be digitalized expressed in
h
ertz (Hz).
hop_length (`int`, defaults to 160):
hop_length (`int`, defaults to 160):
Length of the overlaping windows for the STFT used to obtain the Mel Frequency coefficients.
Length of the overlaping windows for the STFT used to obtain the Mel Frequency coefficients.
chunk_length (`int`, defaults to 30):
chunk_length (`int`, defaults to 30):
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment