## About the Dataset
### 102 hours

This audio dataset contains 102 hours of Japanese Speech Data in generic domain.

The speakers are presented with a prompt (script) and asked to read it out loud and record. Our clients will receive an audio recording, the prompt and information about the speaker. The audio is recorded on-device, typically in 16kHz 16 bit. We also provide information on which device each record was recorded.

The dataset is covered by [Defined.ai's standard license agreement](https://www.defined.ai/dataset/data-license-agreement). The license agreement is perpetual and allows for the commercialization of all models built on the data.

### Metadata Distribution

![Scripted_Japanese_Age.png](https://prdstrapimediastorage.blob.core.windows.net/prdstrapimediastorage/assets/Scripted_Japanese_Age_2b6944514f.png)
![Scripted_Japanese_Gender.png](https://prdstrapimediastorage.blob.core.windows.net/prdstrapimediastorage/assets/Scripted_Japanese_Gender_544dd602e6.png)

### Samples
[Single file sample](https://defineddata.blob.core.windows.net/samples/DDM_ja-jp_single-scripted_generic_30m_v01_Sample/Audio/192602942.wav?se=2024-06-15T16%3A08%3A40Z&sp=r&sv=2020-06-12&ss=b&srt=o&sig=KqPr2WL6vqVVHAZgrfal%2BkUyX2i8KFsE%2BxBfDJSLGuo%3D). Transcription for the sample is also [available](https://prdstrapimediastorage.blob.core.windows.net/prdstrapimediastorage/assets/Scripted_Japanese_Japan_Generic_Short_Transcription_eed49d002b.tsv)

[DDM_ja-jp_single-scripted_generic_30m_v01_Sample.zip](https://defineddata.blob.core.windows.net/samples/DDM_ja-jp_single-scripted_generic_30m_v01_Sample/DDM_ja-jp_single-scripted_generic_30m_v01_Sample.zip?se=2024-06-15T16%3A08%3A40Z&sp=r&sv=2020-06-12&ss=b&srt=o&sig=KqPr2WL6vqVVHAZgrfal%2BkUyX2i8KFsE%2BxBfDJSLGuo%3D)

Download Free 30-minute Sample

Spontaneous IVR

Japanese Spontaneous IVR

Speech

Japanese Scripted Monologue

Download Free 30-minute Sample

You might also be interested in these audio datasets:

Japanese Spontaneous IVR

Mandarin Chinese (PRC) Scripted Monologue

Korean Spontaneous Dialogue

Japanese Spontaneous Dialogue