"Mid-atlantic united states english,philadelphia, pennsylvania, united states english,united states english,philadelphia style united states english":"American",
"Mid-atlantic,england english,united states english":"American",
"Midatlantic,england english":"American",
"Midwestern states (michigan),united states english":"American",
"Mild northern england english":"English",
"Minor french accent":"French",
"Mix of american and british ,native polish":"Polish",
"Mix of american and british accent":"Unknown",# Combination not clearly mapped
"Mostly american with some british and australian inflections":"Unknown",# Combination not clearly mapped
"My accent is influenced by the phones of all letters within a sentence.,southern african (south africa, zimbabwe, namibia)":"South african",
"New zealand english":"New Zealand English",
"Nigeria english":"Nigerian",# Note: added new
"Non native speaker from france":"French",
"Non-native":"Unknown",# Too vague
"Non-native,german accent":"German",
"North european english":"Unknown",# Too broad
"Norwegian":"Norwegian",# Note: added new
"Ontario,canadian english":"Canadian",# Note: added new
"Polish english":"Polish",
"Rhode island new england accent":"American",
"Singaporean english":"Singaporean",# Note: added new
"Slavic":"Eastern european",
"Slighty southern affected by decades in the midwest, 4 years in spain and germany, speak some german, spanish, polish. have lived in nine states.":"Unknown",# Complex blend
"South african":"South african",
"South atlantic (falkland islands, saint helena)":"Unknown",# Specific regions not listed
"South australia":"Australian",
"South indian":"Indian",
"Southern drawl":"American",
"Southern texas accent,united states english":"American",
"Southern united states,united states english":"American",
"Spanish bilingual":"Spanish",
"Spanish,foreign,non-native":"Spanish",
"Strong latvian accent":"Latvian",
"Swedish accent":"Swedish",# Note: added new
"Transnational englishes blend":"Unknown",# Too vague
"U.k. english":"English",
"Very slight russian accent,standard american english,boston influence":"American",
"Welsh english":"Welsh",
"West african":"Unknown",# No specific West African category
"West indian":"Unknown",# Caribbean, but no specific match
PROMPT=""" We have seven keywords that describe different attributes of an audio sample spoken by a given speaker: the speaker's gender, the speaker's accent, the amount of reverberation in the sample (high or low reverberation), the amount of noise in the sample (how clear or noisy), how monotone or animated the sample is, the speaker's pitch (high or low voice), the speaker's speed (how fast or slow the speaker is speaking).
PROMPT=(
Given these keywords, form a coherent sentence that summarises the seven attributes in a meaningful way. You can change the order of the keywords in the sentence and use common synonyms for these words, provided that the sentence summarises the attributes clearly. Keep the sentence simple - don't introduce additional information other than the keywords provided. Only return the generated sentence, not any other assistant remarks.
"You will be given six descriptive keywords related to an audio sample of a person's speech. These keywords include:\n"
For example, given the following descriptors: 'female', 'Hungarian', 'slightly roomy sounding', 'fairly noisy', 'quite monotone', 'fairly low pitch', 'very slowly', a valid sentence would be: 'a woman with a deep voice speaking slowly and somewhat monotonously with a Hungarian accent in an echoey room with background noise'. Note how the seven attributes have been combined together in a simple sentence, with the ordering changed but no additional information added.
"1. The gender (e.g., male, female)\n"
For the descriptors: {gender}, {accent}, {reverberation}, {noise}, {speech_monotony}, {pitch}, {speaking_rate}, the corresponding sentence is:"""
"2. The level of reverberation (e.g., very roomy sounding, quite roomy sounding, slightly roomy sounding, moderate reverberation, slightly confined sounding, quite confined sounding, very confined sounding)\n"
"3. The amount of noise the sample (e.g., very noisy, quite noisy, slightly noisy, moderate ambient sound, slightly clear, quite clear, very clear)\n"
SUBSET_PROMPT=""" We have six keywords that describe different attributes of an audio sample spoken by a given speaker: the speaker's gender, the amount of reverberation in the sample (high or low reverberation), the amount of noise in the sample (how clear or noisy), how monotone or animated the sample is, the speaker's pitch (high or low voice), the speaker's speed (how fast or slow the speaker is speaking).
"4. The tone of the speaker's voice (e.g., very monotone, quite monotone, slightly monotone, moderate intonation, slightly expressive, quite expressive, very expressive)\n"
Given these keywords, form a coherent sentence that summarises the six attributes in a meaningful way. You can change the order of the keywords in the sentence and use common synonyms for these words, provided that the sentence summarises the attributes clearly. Keep the sentence simple - don't introduce additional information other than the keywords provided. Only return the generated sentence, not any other assistant remarks.
"5. The pace of the speaker's delivery (e.g., very slowly, quite slowly, slightly slowly, moderate speed, slightly fast, quite fast, very fast)\n"
For example, given the following descriptors: 'female', 'slightly roomy sounding', 'fairly noisy', 'quite monotone', 'fairly low pitch', 'very slowly', a valid sentence would be: 'a woman with a deep voice speaking slowly and somewhat monotonously in an echoey room with background noise'. Note how the six attributes have been combined together in a simple sentence, with the ordering changed but no additional information added.
"6. The pitch of the speaker's voice (e.g., very low pitch, quite low pitch, slightly low pitch, moderate pitch, slightly high pitch, quite high pitch, very high pitch)\n"
For the descriptors: {gender}, {reverberation}, {noise}, {speech_monotony}, {pitch}, {speaking_rate}, the corresponding sentence is:"""
"Your task is to create a text description using these keywords that accurately describes the speech sample while ensuring the description remains grammatically correct and easy to understand. You can rearrange the keyword order as necessary, and substitute synonymous terms where appropriate. If the amount of noise is 'very noisy' and the level of reverberation is 'very roomy sounding', include the term 'very bad recording' in the description. Likewise, if the amount of noise is 'very clear' and the level of reverberation is 'very confined sounding', include the term 'very good recording' in the description. Otherwise, do not add extra details beyond what has been provided, and only return the generated description.\n"
"For example, given the following keywords: 'female', 'slightly roomy sounding', 'slightly noisy', 'quite monotone', 'slightly low pitch', 'very slowly', a valid description would be: 'a woman with a deep voice speaking slowly and somewhat monotonously in an echoey room with background noise'.\n"
"For the keywords: '[gender]', '[reverberation]', '[noise]', '[speech_monotony]', '[pitch]', '[speaking_rate]', the corresponding description is:"