Accented English Scripted Monologue
Generic
Scripted Speech
Spanish
Arabic
French
English
About the Dataset
434 hours
This audio dataset contains 434 hours recorded by speakers of French, Arabic, Spanish, and other languages:
- 74.73 hours of English recorded by native French speakers
- 50 hours recorded by native Arabic speakers
- 40 hours recorded by native Spanish speakers
- 269.8 hours recorded by speakers of other languages
The speakers are presented with a prompt (script) and asked to read it out loud and record. Our clients will receive an audio recording, the prompt and information about the speaker. The audio is recorded on-device, typically in 16kHz 16 bit. We also provide information on which device each record was recorded.
The dataset is covered by Defined.ai's standard license agreement. The license agreement is perpetual and allows for the commercialization of all models built on the data.
Metadata Distribution
Arabic Accent
Samples
- 5-minutes sample English Global Accents. Transcription for the sample is also available
- 5-minutes sample English French Accented. Transcription for the sample is also available
- 5-minutes sample English Arabic. Transcription for the sample is also available