Accented English Scripted Monologue

Generic
Scripted Speech
Spanish
Arabic
French
English

About the Dataset

434 hours

This audio dataset contains 434 hours recorded by speakers of French, Arabic, Spanish, and other languages:

  • 74.73 hours of English recorded by native French speakers
  • 50 hours recorded by native Arabic speakers
  • 40 hours recorded by native Spanish speakers
  • 269.8 hours recorded by speakers of other languages

The speakers are presented with a prompt (script) and asked to read it out loud and record. Our clients will receive an audio recording, the prompt and information about the speaker. The audio is recorded on-device, typically in 16kHz 16 bit. We also provide information on which device each record was recorded.

The dataset is covered by Defined.ai's standard license agreement. The license agreement is perpetual and allows for the commercialization of all models built on the data.

Metadata Distribution

Arabic Accent

Scripted_English_Accented_Arabic_Age.png Scripted_English_Accented_Arabic_Gender.png

Samples

Download Free 30-minute Sample

All fields are required

By downloading, installing, accessing, and/or using this data sample, you consent to receive communications from Defined.ai and affirm your acceptance of our Privacy Policy, Terms of Use, and Data License Agreement. Consent can be revoked at your discretion.

You might also be interested in these audio datasets:

Arabic Spontaneous Dialogue

308 hours recorded by speakers from Egypt, Jordan, MSA, and Yemen
Insurance
Retail
Telecommunication
+3

© 2025 DefinedCrowd. All rights reserved.