2025 in Review: 65% Revenue Growth & 1,200% Marketplace Expansion— Get the Full Story!

Become a partnerGet in touch
Get in touch
  • Browse Marketplace
  • Data Annotation

    Human-led labeling for text, audio, image and video

    Machine Translation

    High-quality multilingual content for global AI systems

    Data Collection

    Global, diverse datasets for AI training at scale

    Conversational AI

    Natural, bias-free voice and chat experiences worldwide

    Data & Model Evaluation

    Rigorous testing to ensure accuracy, fairness and quality

    Accelerat.ai

    Smarter multilingual AI agent support for global businesses


    Industries

Find the right datasets for you

Suggested filters

Healthcareimage

Dataset title

Domain

Type

Locale

Amount

Filipino accented English Podcasts

10000 hours of Filipino accented English live, recorded by real podcasters in our partner network.

General

EN

10K hours

Japanese accented English Podcasts

10000 hours of Japanese accented English live, non-simulated podcasts, recorded by real podcasters in our partner network.

General

EN

10K hours

Chinese accented English Podcasts

10000 hours of Chinese accented English live, non-simulated podcasts, recorded by real podcasters in our partner network.

General

EN

10K hours

Malasian accented English Podcasts

10000 hours of Malasian accented English live, non-simulated podcasts, recorded by real podcasters in our partner network.

General

EN

10K hours

Thai accented English Podcasts

10000 hours of Thai accented English live, non-simulated podcasts, recorded by real podcasters in our partner network.

General

TH,

EN

10K hours

Indian English Podcasts

250 hours of Indian English live, non-simulated podcasts, recorded by real podcasters in our partner network.

Narration

EN,

en-IN

250 hours

Emirati Arabic Podcasts

1 hours of Emirati Arabic live, non-simulated podcasts, recorded by real podcasters in our partner network.

General

AR,

ar-AE

1 hours

Saudi Arabic Podcasts

3 hours of Moroccan Arabicsa live, non-simulated podcasts, recorded by real podcasters in our partner network.

AR

3 hours

Sudanian Arabic Podcasts

7 hours of Moroccan Arabicsd live, non-simulated podcasts, recorded by real podcasters in our partner network.

ar-SD,

AR

7 hours

Modern Standard Arabic Podcasts

51 hours of Modern Standard Arabic live, non-simulated podcasts, recorded by real podcasters in our partner network.

General

MS,

AR,

ar-MSA

116 hours

Showing 10 of 152 datasets

...

Datasets per page

Filipino accented English Podcasts

Domain:

General

Amount:

10K hours

Locale:

EN

Japanese accented English Podcasts

Amount:

10K hours

Locale:

EN

Chinese accented English Podcasts

Amount:

10K hours

Locale:

EN

Malasian accented English Podcasts

Amount:

10K hours

Locale:

EN

Thai accented English Podcasts

Amount:

10K hours

Locale:

TH, EN

Indian English Podcasts

Domain:

Narration

Amount:

250 hours

Locale:

EN, en-IN

Emirati Arabic Podcasts

Amount:

1 hours

Locale:

AR, ar-AE

Saudi Arabic Podcasts

Domain:

Amount:

3 hours

Locale:

AR

Sudanian Arabic Podcasts

Domain:

Amount:

7 hours

Locale:

ar-SD, AR

Modern Standard Arabic Podcasts

Amount:

116 hours

Locale:

MS, AR, ar-MSA

Showing 10 of 152 datasets

1/16

New datasets

Medical Claims Data for AI Model Training

Healthcare

Longitudinal Data in Oncology for AI Model Development

Healthcare

Wearable Health Data for AI Model Training

Healthcare

Couldn’t find the right dataset for you?

Get in touch

© 2026 DefinedCrowd. All rights reserved.

Award logo
Award logo
Award logo
Award logo
Award logo
Award logo

Datasets

Marketplace

Solutions

Privacy and Cookie PolicyTerms & Conditions (T&M)Data License AgreementSupplier Program
Privacy and Cookie PolicyTerms & Conditions (T&M)Data License AgreementSupplier ProgramCCPA Privacy StatementWhistleblowing ChannelCandidate Privacy Statement

© 2026 DefinedCrowd. All rights reserved.

Award logo
Award logo
Award logo
Award logo
Award logo
Award logo