2025 in Review: 65% Revenue Growth & 1,200% Marketplace Expansion— Get the Full Story!

Become a partnerGet in touch
Get in touch
  • Browse Marketplace
  • Data Annotation

    Human-led labeling for text, audio, image and video

    Machine Translation

    High-quality multilingual content for global AI systems

    Data Collection

    Global, diverse datasets for AI training at scale

    Conversational AI

    Natural, bias-free voice and chat experiences worldwide

    Data & Model Evaluation

    Rigorous testing to ensure accuracy, fairness and quality

    Accelerat.ai

    Smarter multilingual AI agent support for global businesses


    Industries

Find the right datasets for you

Suggested filters

Healthcareimage

Dataset title

Domain

Type

Locale

Amount

Italian books

42K+ books, scanned and digitized, both fiction and non-fiction.

Various
General

it-IT

42.6K

Italian Spontaneous Dialogue, insurance

600 hours of Italian simulated call center conversations between an agent and a client, recorded over telephony in the insurance domain.

Insurance

it-IT

600 hours

Italian Spontaneous Dialogue, retail

350 hours of Italian simulated call center conversations between an agent and a client, recorded over telephony in the retail domain.

Retail

it-IT

350 hours

Italian Spontaneous Dialogue, banking

496 hours of Italian simulated call center conversations between an agent and a client, recorded over telephony in the banking domain.

Banking

it-IT

496 hours

Italian Spontaneous Dialogue, telco

351 hours of Italian simulated call center conversations between an agent and a client, recorded over telephony in the telco domain.

Telco

it-IT

351 hours

Italian IVR, retail

53 hours of Italian queries between an client and an IVR system, recorded over telephony in the retail domain.

Retail

it-IT

53 hours

Italian IVR, insurance

47 hours of Italian queries between an client and an IVR system, recorded over telephony in the insurance domain.

Insurance

it-IT

47 hours

Italian IVR, banking

46 hours of Italian queries between an client and an IVR system, recorded over telephony in the banking domain.

Banking

it-IT

46 hours

Italian IVR, telco

44 hours of Italian queries between an client and an IVR system, recorded over telephony in the telco domain.

Telco

it-IT

44 hours

Italian Podcasts

293 hours of Italian simulated podcasts, recorded with studio quality.

Various
General

it-IT

22.3K hours

Showing 10 of 21 datasets

Datasets per page

Italian books

Domain:

Various
General

Amount:

42.6K

Locale:

it-IT

Italian Spontaneous Dialogue, insurance

Amount:

600 hours

Locale:

it-IT

Italian Spontaneous Dialogue, retail

Domain:

Retail

Amount:

350 hours

Locale:

it-IT

Italian Spontaneous Dialogue, banking

Domain:

Banking

Amount:

496 hours

Locale:

it-IT

Italian Spontaneous Dialogue, telco

Amount:

351 hours

Locale:

it-IT

Italian IVR, retail

Amount:

53 hours

Locale:

it-IT

Italian IVR, insurance

Amount:

47 hours

Locale:

it-IT

Italian IVR, banking

Amount:

46 hours

Locale:

it-IT

Italian IVR, telco

Amount:

44 hours

Locale:

it-IT

Italian Podcasts

Amount:

22.3K hours

Locale:

it-IT

Showing 10 of 21 datasets

1/3

New datasets

Medical Claims Data for AI Model Training

Healthcare

Longitudinal Data in Oncology for AI Model Development

Healthcare

Wearable Health Data for AI Model Training

Healthcare

Couldn’t find the right dataset for you?

Get in touch

© 2026 DefinedCrowd. All rights reserved.

Award logo
Award logo
Award logo
Award logo
Award logo
Award logo

Datasets

Marketplace

Solutions

Privacy and Cookie PolicyTerms & Conditions (T&M)Data License AgreementSupplier Program
Privacy and Cookie PolicyTerms & Conditions (T&M)Data License AgreementSupplier ProgramCCPA Privacy StatementWhistleblowing ChannelCandidate Privacy Statement

© 2026 DefinedCrowd. All rights reserved.

Award logo
Award logo
Award logo
Award logo
Award logo
Award logo