Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
chenpangpang
transformers
Commits
efea0f86
Unverified
Commit
efea0f86
authored
Nov 18, 2021
by
Patrick von Platen
Committed by
GitHub
Nov 18, 2021
Browse files
[Speech Recognition] More examples
Add more XLS-R training runs to the official examples
parent
72a6bf33
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
10 additions
and
0 deletions
+10
-0
examples/pytorch/speech-recognition/README.md
examples/pytorch/speech-recognition/README.md
+10
-0
No files found.
examples/pytorch/speech-recognition/README.md
View file @
efea0f86
...
@@ -142,4 +142,14 @@ they can serve as a baseline to improve upon.
...
@@ -142,4 +142,14 @@ they can serve as a baseline to improve upon.
| Dataset | Dataset Config | Pretrained Model | Word error rate on eval | GPU setup | Training time | Fine-tuned Model & Logs | Command to reproduce |
| Dataset | Dataset Config | Pretrained Model | Word error rate on eval | GPU setup | Training time | Fine-tuned Model & Logs | Command to reproduce |
|-------|------------------------------|-------------|---------------|---------------|----------------------|-------------| -------------|
|-------|------------------------------|-------------|---------------|---------------|----------------------|-------------| -------------|
|
[
Common Voice
](
https://huggingface.co/datasets/common_voice
)
|
`"tr"`
|
[
facebook/wav2vec2-large-xlsr-53
](
https://huggingface.co/facebook/wav2vec2-large-xlsr-53
)
| 0.36 | 8 GPU V100 | 18min |
[
here
](
https://huggingface.co/patrickvonplaten/wav2vec2-common_voice-tr-demo-dist
)
|
[
run.sh
](
https://huggingface.co/patrickvonplaten/wav2vec2-common_voice-tr-demo-dist/blob/main/run_dist.sh
)
|
|
[
Common Voice
](
https://huggingface.co/datasets/common_voice
)
|
`"tr"`
|
[
facebook/wav2vec2-large-xlsr-53
](
https://huggingface.co/facebook/wav2vec2-large-xlsr-53
)
| 0.36 | 8 GPU V100 | 18min |
[
here
](
https://huggingface.co/patrickvonplaten/wav2vec2-common_voice-tr-demo-dist
)
|
[
run.sh
](
https://huggingface.co/patrickvonplaten/wav2vec2-common_voice-tr-demo-dist/blob/main/run_dist.sh
)
|
|
[
Common Voice
](
https://huggingface.co/datasets/common_voice
)
|
`"tr"`
|
[
facebook/wav2vec2-large-xlsr-53
](
https://huggingface.co/facebook/wav2vec2-large-xlsr-53
)
| 0.31 | 8 GPU V100 | 1h05 |
[
here
](
https://huggingface.co/patrickvonplaten/wav2vec2-large-xlsr-53-common_voice-tr-ft
)
|
[
run.sh
](
https://huggingface.co/patrickvonplaten/wav2vec2-large-xlsr-53-common_voice-tr-ft/blob/main/run.sh
)
|
|
[
Common Voice
](
https://huggingface.co/datasets/common_voice
)
|
`"tr"`
|
[
facebook/wav2vec2-large-xlsr-53
](
https://huggingface.co/facebook/wav2vec2-large-xlsr-53
)
| 0.35 | 1 GPU V100 | 1h20min |
[
here
](
https://huggingface.co/patrickvonplaten/wav2vec2-common_voice-tr-demo
)
|
[
run.sh
](
https://huggingface.co/patrickvonplaten/wav2vec2-common_voice-tr-demo/blob/main/run.sh
)
|
|
[
Common Voice
](
https://huggingface.co/datasets/common_voice
)
|
`"tr"`
|
[
facebook/wav2vec2-large-xlsr-53
](
https://huggingface.co/facebook/wav2vec2-large-xlsr-53
)
| 0.35 | 1 GPU V100 | 1h20min |
[
here
](
https://huggingface.co/patrickvonplaten/wav2vec2-common_voice-tr-demo
)
|
[
run.sh
](
https://huggingface.co/patrickvonplaten/wav2vec2-common_voice-tr-demo/blob/main/run.sh
)
|
|
[
Common Voice
](
https://huggingface.co/datasets/common_voice
)
|
`"tr"`
|
[
facebook/wav2vec2-xls-r-300m
](
https://huggingface.co/facebook/wav2vec2-xls-r-300m
)
| 0.31 | 8 GPU V100 | 1h05 |
[
here
](
https://huggingface.co/patrickvonplaten/wav2vec2-large-xls-r-300m-common_voice-tr-ft
)
|
[
run.sh
](
https://huggingface.co/patrickvonplaten/wav2vec2-large-xls-r-300m-common_voice-tr-ft/blob/main/run.sh
)
|
|
[
Common Voice
](
https://huggingface.co/datasets/common_voice
)
|
`"tr"`
|
[
facebook/wav2vec2-xls-r-1b
](
https://huggingface.co/facebook/wav2vec2-xls-r-1b
)
| 0.21 | 2 GPU Titan 24 GB RAM | 15h10 |
[
here
](
https://huggingface.co/patrickvonplaten/wav2vec2-xls-r-1b-common_voice-tr-ft
)
|
[
run.sh
](
https://huggingface.co/patrickvonplaten/wav2vec2-large-xls-r-1b-common_voice-tr-ft/blob/main/run.sh
)
|
-
[
Multilingual Librispeech
](
https://huggingface.co/datasets/multilingual_librispeech
)
| Dataset | Dataset Config | Pretrained Model | Word error rate on eval | GPU setup | Training time | Fine-tuned Model & Logs | Command to reproduce |
|-------|------------------------------|-------------|---------------|---------------|----------------------|-------------| -------------|
|
[
Multilingual Librispeech
](
https://huggingface.co/datasets/multilingual_librispeech
)
|
`"german"`
|
[
facebook/wav2vec2-large-xlsr-53
](
https://huggingface.co/facebook/wav2vec2-large-xlsr-53
)
| 0.13 | 1 GPU Titan 24 GB RAM | 15h04 |
[
here
](
https://huggingface.co/patrickvonplaten/wav2vec2-xlsr-53-300m-mls-german-ft
)
|
[
run.sh
](
https://huggingface.co/patrickvonplaten/wav2vec2-xlsr-53-300m-mls-german-ft/blob/main/run.sh
)
|
|
[
Multilingual Librispeech
](
https://huggingface.co/datasets/multilingual_librispeech
)
|
`"german"`
|
[
facebook/wav2vec2-xls-r-300m
](
https://huggingface.co/facebook/wav2vec2-xls-r-300m
)
| 0.15 | 1 GPU Titan 24 GB RAM | 15h04 |
[
here
](
https://huggingface.co/patrickvonplaten/wav2vec2-300m-mls-german-ft
)
|
[
run.sh
](
https://huggingface.co/patrickvonplaten/wav2vec2-300m-mls-german-ft/blob/main/run.sh
)
|
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment