Scam Alert: We’ve detected unauthorized use of the Defined.ai name.Read the notice

Become a partnerGet in touch
Get in touch
  • Browse Marketplace
  • Data Annotation

    Model-in-the-loop, expert-verified labeling for text, audio, image and video

    Machine Translation

    High-quality multilingual content for global AI systems

    Data Collection

    Global, diverse datasets for AI training at scale

    Conversational AI

    Natural, bias-free voice and chat experiences worldwide

    Data & Model Evaluation

    Rigorous testing to ensure accuracy, fairness and quality

    Accelerat.ai

    Smarter multilingual AI agent support for global businesses


    Industries

Find the right datasets for you

Suggested filters

Healthcareimage

Dataset title

Domain

Type

Locale

Amount

Hindi Spontaneous Dialogue, insurance

206 hours of Hindi simulated call center conversations between an agent and a client, recorded over telephony in the insurance domain.

Insurance
Conversational

hi-IN

206 hours

Hindi Spontaneous Dialogue, banking

199 hours of Hindi simulated call center conversations between an agent and a client, recorded over telephony in the banking domain.

Banking
Conversational

hi-IN

199 hours

Hindi Spontaneous Dialogue, telco

192 hours of Hindi simulated call center conversations between an agent and a client, recorded over telephony in the telco domain.

Telco
Conversational

hi-IN

192 hours

Hindi Spontaneous Dialogue, retail

192 hours of Hindi simulated call center conversations between an agent and a client, recorded over telephony in the retail domain.

Retail
Conversational

hi-IN

192 hours

Hindi IVR, banking

75 hours of Hindi queries between an client and an IVR system, recorded over telephony in the banking domain.

Banking
IVR

hi-IN

75 hours

Hindi IVR, insurance

78 hours of Hindi queries between an client and an IVR system, recorded over telephony in the insurance domain.

Insurance
IVR

hi-IN

78 hours

Hindi IVR, retail

70 hours of Hindi queries between an client and an IVR system, recorded over telephony in the retail domain.

Retail
IVR

hi-IN

70 hours

Hindi IVR, telco

76 hours of Hindi queries between an client and an IVR system, recorded over telephony in the telco domain.

Telco
IVR

hi-IN

76 hours

Hindi Podcasts

336 hours of Hindi simulated podcasts, recorded with studio quality.

Various
Podcast

hi-IN

52.8K hours

Music - Instrumental

6000 tracks in the Instrumental genre, ready for AI training.

Music

NO,

EL,

NE,

la,

si-lk,

brx-in,

nl-NL,

tl-PH,

or-in,

et-ee,

haz-af,

ca-es,

gjr-in,

ro-ro,

tcy-in,

bto-ph,

qaz-ir,

eu-es,

haw-us,

he-IL,

nb-NO,

ml-in,

ZH,

zh-CN,

en-AU,

dhd-in,

wuu-cn,

mni-in,

gl-es,

ahr-in,

pt-PT,

af-za,

hu-hu,

fi-fi,

gon-in,

bn-IN,

LV,

ar-LAV,

ar-SD,

KO,

MS,

AR,

pa-IN,

ta-IN,

te-IN,

es-ES,

it-IT,

en-GB,

fr-MX,

de-DE,

en-US,

en-IN,

fr-FR,

TH,

HE,

SO,

ZU,

TL,

SR,

EN,

DA,

VI,

mr-IN,

hi-IN,

ID,

pl-PL,

kn-IN,

FA,

UR,

fr-CA,

TR,

YUE,

es-MX,

CZ,

es-AR,

JA,

sv-SE,

DE,

RU,

FR,

pt-BR,

es-VE,

ar-MA,

ar-LB,

hi-US,

fr-MA,

ar-TN,

es-US,

ar-JO,

ar-JS,

ar-IQ,

ar-YE,

ar-DZ,

ar-AR,

de-US,

ar-EG,

ja-JP,

ar-SA,

ar-AE,

es-BO,

ar-TR,

ja-US,

es-PE,

ar-Kw,

es-EC,

es-LA,

es-CO,

es-CL,

fr-US,

en-CA,

ko-KR,

da-DK,

ru-RU,

nl-BE,

cs-CZ,

vi-VN,

gu-IN,

ar-MSA,

fa-IR,

en-IE,

is-is,

sk-sk,

lt-lt,

uk-ua,

rmn-ro,

cy-gb,

kxu-in,

sgs-lt,

la-latn

6K

Showing 10 of 26 datasets

Datasets per page

Hindi Spontaneous Dialogue, insurance

Domain:

Insurance
Conversational

Amount:

206 hours

Locale:

hi-IN

Hindi Spontaneous Dialogue, banking

Domain:

Banking
Conversational

Amount:

199 hours

Locale:

hi-IN

Hindi Spontaneous Dialogue, telco

Domain:

Telco
Conversational

Amount:

192 hours

Locale:

hi-IN

Hindi Spontaneous Dialogue, retail

Amount:

192 hours

Locale:

hi-IN

Hindi IVR, banking

Amount:

75 hours

Locale:

hi-IN

Hindi IVR, insurance

Amount:

78 hours

Locale:

hi-IN

Hindi IVR, retail

Amount:

70 hours

Locale:

hi-IN

Hindi IVR, telco

Amount:

76 hours

Locale:

hi-IN

Hindi Podcasts

Domain:

Various
Podcast

Amount:

52.8K hours

Locale:

hi-IN

Music - Instrumental

Domain:

Music

Amount:

6K

Locale:

NO, EL, NE, la, si-lk, brx-in, nl-NL, tl-PH, or-in, et-ee, haz-af, ca-es, gjr-in, ro-ro, tcy-in, bto-ph, qaz-ir, eu-es, haw-us, he-IL, nb-NO, ml-in, ZH, zh-CN, en-AU, dhd-in, wuu-cn, mni-in, gl-es, ahr-in, pt-PT, af-za, hu-hu, fi-fi, gon-in, bn-IN, LV, ar-LAV, ar-SD, KO, MS, AR, pa-IN, ta-IN, te-IN, es-ES, it-IT, en-GB, fr-MX, de-DE, en-US, en-IN, fr-FR, TH, HE, SO, ZU, TL, SR, EN, DA, VI, mr-IN, hi-IN, ID, pl-PL, kn-IN, FA, UR, fr-CA, TR, YUE, es-MX, CZ, es-AR, JA, sv-SE, DE, RU, FR, pt-BR, es-VE, ar-MA, ar-LB, hi-US, fr-MA, ar-TN, es-US, ar-JO, ar-JS, ar-IQ, ar-YE, ar-DZ, ar-AR, de-US, ar-EG, ja-JP, ar-SA, ar-AE, es-BO, ar-TR, ja-US, es-PE, ar-Kw, es-EC, es-LA, es-CO, es-CL, fr-US, en-CA, ko-KR, da-DK, ru-RU, nl-BE, cs-CZ, vi-VN, gu-IN, ar-MSA, fa-IR, en-IE, is-is, sk-sk, lt-lt, uk-ua, rmn-ro, cy-gb, kxu-in, sgs-lt, la-latn

Showing 10 of 26 datasets

1/3

New datasets

Medical Claims Data for AI Model Training

Healthcare

Longitudinal Data in Oncology for AI Model Development

Healthcare

Wearable Health Data for AI Model Training

Healthcare

Couldn’t find the right dataset for you?

Get in touch

© 2026 DefinedCrowd. All rights reserved.

Award logo
Award logo
Award logo
Award logo
Award logo
Award logo

Datasets

Marketplace

Solutions

Privacy and Cookie PolicyTerms & Conditions (T&M)Data License AgreementSupplier Program
Privacy and Cookie PolicyTerms & Conditions (T&M)Data License AgreementSupplier ProgramCCPA Privacy StatementWhistleblowing ChannelCandidate Privacy Statement

© 2026 DefinedCrowd. All rights reserved.

Award logo
Award logo
Award logo
Award logo
Award logo
Award logo