Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
chenpangpang
transformers
Commits
9b42c68f
"...git@developer.sourcefind.cn:OpenDAS/torchaudio.git" did not exist on "d1f747fb639235cfcf5e6828038218e72dfb8e0f"
Unverified
Commit
9b42c68f
authored
Jan 19, 2023
by
Matthijs Hollemans
Committed by
GitHub
Jan 19, 2023
Browse files
hertz is already per second (#21188)
parent
4bc18e7a
Changes
6
Hide whitespace changes
Inline
Side-by-side
Showing
6 changed files
with
6 additions
and
6 deletions
+6
-6
src/transformers/feature_extraction_sequence_utils.py
src/transformers/feature_extraction_sequence_utils.py
+1
-1
src/transformers/models/audio_spectrogram_transformer/feature_extraction_audio_spectrogram_transformer.py
...ormer/feature_extraction_audio_spectrogram_transformer.py
+1
-1
src/transformers/models/mctct/feature_extraction_mctct.py
src/transformers/models/mctct/feature_extraction_mctct.py
+1
-1
src/transformers/models/speech_to_text/feature_extraction_speech_to_text.py
...odels/speech_to_text/feature_extraction_speech_to_text.py
+1
-1
src/transformers/models/wav2vec2/feature_extraction_wav2vec2.py
...ansformers/models/wav2vec2/feature_extraction_wav2vec2.py
+1
-1
src/transformers/models/whisper/feature_extraction_whisper.py
...transformers/models/whisper/feature_extraction_whisper.py
+1
-1
No files found.
src/transformers/feature_extraction_sequence_utils.py
View file @
9b42c68f
...
...
@@ -34,7 +34,7 @@ class SequenceFeatureExtractor(FeatureExtractionMixin):
feature_size (`int`):
The feature dimension of the extracted features.
sampling_rate (`int`):
The sampling rate at which the audio files should be digitalized expressed in
H
ertz
per second
(Hz).
The sampling rate at which the audio files should be digitalized expressed in
h
ertz (Hz).
padding_value (`float`):
The value that is used to fill the padding values / vectors.
"""
...
...
src/transformers/models/audio_spectrogram_transformer/feature_extraction_audio_spectrogram_transformer.py
View file @
9b42c68f
...
...
@@ -44,7 +44,7 @@ class ASTFeatureExtractor(SequenceFeatureExtractor):
feature_size (`int`, *optional*, defaults to 1):
The feature dimension of the extracted features.
sampling_rate (`int`, *optional*, defaults to 16000):
The sampling rate at which the audio files should be digitalized expressed in
H
ertz
per second
(Hz).
The sampling rate at which the audio files should be digitalized expressed in
h
ertz (Hz).
num_mel_bins (`int`, *optional*, defaults to 128):
Number of Mel-frequency bins.
max_length (`int`, *optional*, defaults to 1024):
...
...
src/transformers/models/mctct/feature_extraction_mctct.py
View file @
9b42c68f
...
...
@@ -53,7 +53,7 @@ class MCTCTFeatureExtractor(SequenceFeatureExtractor):
feature_size (`int`, defaults to 80):
The feature dimension of the extracted features. This is the number of mel_frequency
sampling_rate (`int`, defaults to 16000):
The sampling rate at which the audio files should be digitalized expressed in
H
ertz
per second
(Hz).
The sampling rate at which the audio files should be digitalized expressed in
h
ertz (Hz).
padding_value (`float`, defaults to 0.0):
The value that is used to fill the padding values.
hop_length (`int`, defaults to 10):
...
...
src/transformers/models/speech_to_text/feature_extraction_speech_to_text.py
View file @
9b42c68f
...
...
@@ -44,7 +44,7 @@ class Speech2TextFeatureExtractor(SequenceFeatureExtractor):
feature_size (`int`, defaults to 80):
The feature dimension of the extracted features.
sampling_rate (`int`, defaults to 16000):
The sampling rate at which the audio files should be digitalized expressed in
H
ertz
per second
(Hz).
The sampling rate at which the audio files should be digitalized expressed in
h
ertz (Hz).
num_mel_bins (`int`, defaults to 80):
Number of Mel-frequency bins.
padding_value (`float`, defaults to 0.0):
...
...
src/transformers/models/wav2vec2/feature_extraction_wav2vec2.py
View file @
9b42c68f
...
...
@@ -39,7 +39,7 @@ class Wav2Vec2FeatureExtractor(SequenceFeatureExtractor):
feature_size (`int`, defaults to 1):
The feature dimension of the extracted features.
sampling_rate (`int`, defaults to 16000):
The sampling rate at which the audio files should be digitalized expressed in
H
ertz
per second
(Hz).
The sampling rate at which the audio files should be digitalized expressed in
h
ertz (Hz).
padding_value (`float`, defaults to 0.0):
The value that is used to fill the padding values.
do_normalize (`bool`, *optional*, defaults to `True`):
...
...
src/transformers/models/whisper/feature_extraction_whisper.py
View file @
9b42c68f
...
...
@@ -43,7 +43,7 @@ class WhisperFeatureExtractor(SequenceFeatureExtractor):
feature_size (`int`, defaults to 80):
The feature dimension of the extracted features.
sampling_rate (`int`, defaults to 16000):
The sampling rate at which the audio files should be digitalized expressed in
H
ertz
per second
(Hz).
The sampling rate at which the audio files should be digitalized expressed in
h
ertz (Hz).
hop_length (`int`, defaults to 160):
Length of the overlaping windows for the STFT used to obtain the Mel Frequency coefficients.
chunk_length (`int`, defaults to 30):
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment