VoxForge - Persian | Mozilla Data Collective

Voice data contributed by volunteers who read prompts out loud. For ‭فارسی‬ (Persian), there are 16 minutes of recorded speech.

The following is a breakdown of the number of utterances per speaker (at least 4 speakers):

Name	Count
anonymous	127
PCRider	10
Sina	10
spin313	10

Dataset format

The top-level directory contains a number of subdirectories corresponding to speaker/session recorded. Each of these subdirectories is structured as follows:

├── wav/
│   ├── file1.wav
│   ├── file2.wav
│   ├── ...
├── etc/
│   ├── GPL_license.txt  
│   ├── PROMPTS  
│   ├── prompts-original  
│   ├── README

where PROMPTS and prompts-original contain an audio id followed by a space and the prompt text (transcript).

See https://www.voxforge.org/home/about for more details about the project and dataset.

VoxForge - Persian

Description

Specifics

Considerations

Processes

Metadata

Dataset format