Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
chenpangpang
transformers
Commits
e8968bd0
Unverified
Commit
e8968bd0
authored
Mar 19, 2021
by
Patrick von Platen
Committed by
GitHub
Mar 19, 2021
Browse files
[XLSR-Wav2Vec2 Info doc] Add a couple of lines (#10806)
* finish * fix * fix * fix * fix
parent
117dba99
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
13 additions
and
9 deletions
+13
-9
examples/research_projects/wav2vec2/FINE_TUNE_XLSR_WAV2VEC2.md
...les/research_projects/wav2vec2/FINE_TUNE_XLSR_WAV2VEC2.md
+13
-9
No files found.
examples/research_projects/wav2vec2/FINE_TUNE_XLSR_WAV2VEC2.md
View file @
e8968bd0
...
...
@@ -25,12 +25,13 @@ It is very much possible that prizes will be given to groups of people instead o
-
[
Google colab setup
](
#google-colab-setup
)
-
[
Local machine
](
#local-machine
)
-
[
How to upload my trained checkpoint
](
#how-to-upload-my-trained-checkpoint
)
-
[
How to create the README
](
#how-to-create-the-
README
)
-
[
How to create the README
](
#how-to-create-the-
readme
)
-
[
How to evaluate my trained checkpoint
](
#how-to-evaluate-my-trained-checkpoint
)
-
[
Rules of training and evaluation
](
#rules-of-training-and-evaluation
)
-
[
Tips and tricks
for training
](
#tips-and-tricks
-for-training
)
-
[
Tips and tricks
](
#tips-and-tricks
)
-
[
How to combine multiple datasests into one
](
#how-to-combine-multiple-datasets-into-one
)
-
[
How to effectively preprocess the data
](
#how-to-effectively-preprocess-the-data
)
-
[
How to efficiently preproces the data
](
#how-to-do-efficiently-load-datasets-with-limited-ram-and-hard-drive-space
)
-
[
How to do hyperparameter tuning
](
#how-to-do-hyperparameter-tuning
)
-
[
How to preprocess and evaluate character based languages
](
#how-to-preprocess-and-evaluate-character-based-languages
)
-
[
Further reading material
](
#further-reading-material
)
...
...
@@ -284,7 +285,7 @@ result = test_dataset.map(evaluate, batched=True, batch_size=8)
print
(
"WER: {:2f}"
.
format
(
100
*
wer
.
compute
(
predictions
=
result
[
"pred_strings"
],
references
=
result
[
"sentence"
])))
```
**Result**
: XX.XX % # TODO: write output of print here
**
Test
Result**
: XX.XX % # TODO: write output of print here
## Training
...
...
@@ -325,21 +326,24 @@ done, *e.g.* [here](https://discuss.huggingface.co/t/spanish-asr-fine-tuning-wav
## Tips and tricks
T
ODO..
.
T
his section summarizes a couple of tips and tricks across various topics. It will continously be updated during the week
.
### How to combine multiple datasets into one
Check out
[
this
](
https://discuss.huggingface.co/t/how-to-combine-local-data-files-with-an-official-dataset/4685
)
post.
### How to effectively preprocess the data
### How to do
hyperparameter turing for my languag
e
### How to do
efficiently load datasets with limited ram and hard drive spac
e
Check out
[
this
](
https://discuss.huggingface.co/t/german-asr-fine-tuning-wav2vec2/4558/8?u=patrickvonplaten
)
post.
### How to preprocess and evaluate character based languages
### How to do hyperparameter tuning
### How to
do lazy data loading
### How to
preprocess and evaluate character based languages
## Further reading material
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment