Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
chenpangpang
transformers
Commits
9799f4e1
"vscode:/vscode.git/clone" did not exist on "c673727e0efff7f63674abe10685e9b7adef3174"
Unverified
Commit
9799f4e1
authored
Oct 26, 2021
by
Patrick von Platen
Committed by
GitHub
Oct 26, 2021
Browse files
Update README.md
parent
bfd81766
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
4 additions
and
0 deletions
+4
-0
examples/pytorch/speech-recognition/README.md
examples/pytorch/speech-recognition/README.md
+4
-0
No files found.
examples/pytorch/speech-recognition/README.md
View file @
9799f4e1
...
@@ -118,6 +118,7 @@ It has been verified that the script works for the following datasets:
...
@@ -118,6 +118,7 @@ It has been verified that the script works for the following datasets:
-
[
Common Voice
](
https://huggingface.co/datasets/common_voice
)
-
[
Common Voice
](
https://huggingface.co/datasets/common_voice
)
-
[
Librispeech
](
https://huggingface.co/datasets/librispeech_asr
)
-
[
Librispeech
](
https://huggingface.co/datasets/librispeech_asr
)
-
[
TIMIT
](
https://huggingface.co/datasets/timit_asr
)
| Dataset | Dataset Config | Pretrained Model | Word error rate on eval | GPU setup | Training time | Fine-tuned Model & Logs |
| Dataset | Dataset Config | Pretrained Model | Word error rate on eval | GPU setup | Training time | Fine-tuned Model & Logs |
|-------|------------------------------|-------------|---------------|---------------|----------------------|-------------|
|-------|------------------------------|-------------|---------------|---------------|----------------------|-------------|
...
@@ -125,3 +126,6 @@ It has been verified that the script works for the following datasets:
...
@@ -125,3 +126,6 @@ It has been verified that the script works for the following datasets:
|
[
Librispeech
](
https://huggingface.co/datasets/librispeech_asr
)
|
`"clean"`
-
`"train.100"`
|
[
facebook/hubert-large-ll60k
](
https://huggingface.co/facebook/hubert-large-ll60k
)
| 0.088 | 8 GPU V100 | 1h30min |
[
here
](
https://huggingface.co/patrickvonplaten/hubert-librispeech-clean-100h-demo-dist
)
|
|
[
Librispeech
](
https://huggingface.co/datasets/librispeech_asr
)
|
`"clean"`
-
`"train.100"`
|
[
facebook/hubert-large-ll60k
](
https://huggingface.co/facebook/hubert-large-ll60k
)
| 0.088 | 8 GPU V100 | 1h30min |
[
here
](
https://huggingface.co/patrickvonplaten/hubert-librispeech-clean-100h-demo-dist
)
|
|
[
Common Voice
](
https://huggingface.co/datasets/common_voice
)
|
`"tr"`
|
[
facebook/wav2vec2-large-xlsr-53
](
https://huggingface.co/facebook/wav2vec2-large-xlsr-53
)
| 0.36 | 8 GPU V100 | 18min |
[
here
](
https://huggingface.co/patrickvonplaten/wav2vec2-common_voice-tr-demo-dist
)
|
|
[
Common Voice
](
https://huggingface.co/datasets/common_voice
)
|
`"tr"`
|
[
facebook/wav2vec2-large-xlsr-53
](
https://huggingface.co/facebook/wav2vec2-large-xlsr-53
)
| 0.36 | 8 GPU V100 | 18min |
[
here
](
https://huggingface.co/patrickvonplaten/wav2vec2-common_voice-tr-demo-dist
)
|
|
[
Common Voice
](
https://huggingface.co/datasets/common_voice
)
|
`"tr"`
|
[
facebook/wav2vec2-large-xlsr-53
](
https://huggingface.co/facebook/wav2vec2-large-xlsr-53
)
| 0.35 | 1 GPU V100 | 1h20min |
[
here
](
https://huggingface.co/patrickvonplaten/wav2vec2-common_voice-tr-demo
)
|
|
[
Common Voice
](
https://huggingface.co/datasets/common_voice
)
|
`"tr"`
|
[
facebook/wav2vec2-large-xlsr-53
](
https://huggingface.co/facebook/wav2vec2-large-xlsr-53
)
| 0.35 | 1 GPU V100 | 1h20min |
[
here
](
https://huggingface.co/patrickvonplaten/wav2vec2-common_voice-tr-demo
)
|
|
[
TIMIT
](
https://huggingface.co/datasets/timit_asr
)
| - |
[
wav2vec2-base
](
https://huggingface.co/facebook/wav2vec2-base
)
| 0.21 | 1 GPU TITAN RTX | 32min |
[
here
](
https://huggingface.co/patrickvonplaten/wav2vec2-base-timit-fine-tuned
)
|
|
[
TIMIT
](
https://huggingface.co/datasets/timit_asr
)
| - |
[
unispeech-large-1500h-cv
](
https://huggingface.co/microsoft/unispeech-large-1500h-cv
)
| 0.22 | 1 GPU TITAN RTX | 35min |
[
here
](
https://huggingface.co/patrickvonplaten/unispeech-large-1500h-cv-timit
)
|
|
[
TIMIT
](
https://huggingface.co/datasets/timit_asr
)
| - |
[
unispeech-sat-base
](
https://huggingface.co/microsoft/unispeech-sat-base
)
| 0.41 | 1 GPU TITAN RTX | 32min |
[
here
](
https://huggingface.co/patrickvonplaten/unispeech-sat-base-timit-ft
)
|
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment