Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
chenpangpang
parler-tts
Commits
92ad4bd8
"docs/vscode:/vscode.git/clone" did not exist on "7e1d5e5308fa3549dfed1821188d588260a03c8a"
Commit
92ad4bd8
authored
Feb 23, 2024
by
sanchit-gandhi
Browse files
concat with norm labels
parent
74688124
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
12 additions
and
2 deletions
+12
-2
dataset_concatenation_scripts/run_dataset_concatenation.sh
dataset_concatenation_scripts/run_dataset_concatenation.sh
+12
-2
No files found.
dataset_concatenation_scripts/run_dataset_concatenation.sh
View file @
92ad4bd8
#!/usr/bin/env bash
python run_dataset_concatenation.py
\
--dataset_name
"sanchit-gandhi/vctk+facebook/voxpopuli+sanchit-gandhi/edacc"
\
--dataset_name
"sanchit-gandhi/vctk+facebook/voxpopuli+sanchit-gandhi/edacc
-normalized
"
\
--dataset_config_name
"default+en_accented+default"
\
--dataset_split_name
"train+test+validation"
\
--label_column_name
"accent+accent+accent"
\
--text_column_name
"text+normalized_text+text"
\
--speaker_column_name
"speaker_id+speaker_id+speaker"
\
--batch_size
2
50
\
--batch_size
5
0
0
\
--output_dir
"./concatenated-dataset"
python run_dataset_concatenation.py
\
--dataset_name
"sanchit-gandhi/edacc-normalized"
\
--dataset_config_name
"default"
\
--dataset_split_name
"test"
\
--label_column_name
"accent"
\
--text_column_name
"text"
\
--speaker_column_name
"speaker"
\
--batch_size
500
\
--output_dir
"./concatenated-dataset-test"
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment