Portuguese Scripted Monologue

Generic
Portuguese
Banking
Insurance
Retail
Telecommunication
Scripted Speech

About the Dataset

1091 hours

This audio dataset contains 720 hours of Portuguese Speech Data recored by native Portuguese speakers from Portugal, and 371 hours recorded by speakers from Brazil.

There are 720 hours of Portuguese (Portugal) scripted monologue, generic domain.

There are 371 hours of Portuguese (Brazil) scripted monologue with the following domain distribution per dataset:

  • 56.58 hours of Banking
  • 150.52 hours of Generic domain
  • 69.52 hours of Insurance
  • 43.12 hours of Retail
  • 51.92 hours of Telecommunication

The speakers are presented with a prompt (script) and asked to read it out loud and record. Our clients will receive an audio recording, the prompt and information about the speaker. The audio is recorded on-device, typically in 16Khz 16 bit. We also provide information on which device each record was recorded.

The dataset is covered by Defined.ai's standard license agreement. The license agreement is perpetual and allows for the commercialization of all models built on the data.

Metadata Distribution

Portuguese (Portugal)

Scripted_PT_PT_Gender.png Scripted_PT_PT_Age.png

Portuguese (Brazil)

Scripted_PT_BR_Gender.png Scripted_PT_BR_Age.png

Short Audio Samples

Download Free 30-minute Sample

All fields are required

By downloading, installing, accessing, and/or using this data sample, you consent to receive communications from Defined.ai and affirm your acceptance of our Privacy Policy, Terms of Use, and Data License Agreement. Consent can be revoked at your discretion.

You might also be interested in these audio datasets:

Dutch Scripted Monologue

1045 hours of Dutch (Belgium), 1265 hours of Dutch (Netherlands)
Automotive
Insurance
Retail
+1
DAI logo
Defined.ai hosts the leading online marketplace for buying and selling AI data, tools and models, and offers professional services to help deliver success in complex machine learning projects. Defined.ai is a community of AI professionals building fair, accessible and ethical AI of the future.
Datasets
Contact
1201 3rd Avenue, STE 2200, Seattle WA
[email protected]
Wired logo
Forbes 2019 AI50 logo
CB insights logo
Forbes 2020 logo
Inc. 5000 logo
PME logo

© 2024 DefinedCrowd. All rights reserved.