Accented English Scripted Monologue

Scripted Speech

About the Dataset

434 hours

This audio dataset contains 434 hours recorded by speakers of French, Arabic, Spanish, and other languages:

  • 74.73 hours of English recorded by native French speakers
  • 50 hours recorded by native Arabic speakers
  • 40 hours recorded by native Spanish speakers
  • 269.8 hours recorded by speakers of other languages

The speakers are presented with a prompt (script) and asked to read it out loud and record. Our clients will receive an audio recording, the prompt and information about the speaker. The audio is recorded on-device, typically in 16kHz 16 bit. We also provide information on which device each record was recorded.

The dataset is covered by's standard license agreement. The license agreement is perpetual and allows for the commercialization of all models built on the data.

Metadata Distribution

Arabic Accent

Scripted_English_Accented_Arabic_Age.png Scripted_English_Accented_Arabic_Gender.png


Download Free 30-minute Sample

All fields are required

By clicking on the appropriate button or by downloading, installing, accessing, and/or using the data sample, you are agreeing with Privacy Policy, Terms of Use, and Data License Agreement.

You might also be interested in these audio datasets:

French Spontaneous Dialogue

305 hours recorded by speakers from France, 504 hours by speakers from Canada
DAI logo hosts the leading online marketplace for buying and selling AI data, tools and models, and offers professional services to help deliver success in complex machine learning projects. is a community of AI professionals building fair, accessible and ethical AI of the future.
1201 3rd Avenue, STE 2200, Seattle WA
[email protected]
Wired logo
Forbes 2019 AI50 logo
CB insights logo
Forbes 2020 logo
Inc. 5000 logo
PME logo

© 2023 DefinedCrowd. All rights reserved.