LibriVox Italian TTS Female Voice

License icon

License:

CC0-1.0

Shield icon

Steward:

MDC Curators

Task: TTS

Release Date: 4/16/2026

Format: MP3, TSV

Size: 61.74 MB


Share

Description

4 hours of sentence-aligned speech/text from "Le avventure di Pinocchio" by Carlo Collodi, on LibriVox, containing 2,175 utterances and 41,642 tokens.

Specifics

Licensing

Creative Commons Zero v1.0 Universal (CC0-1.0)

https://spdx.org/licenses/CC0-1.0.html

Considerations

Restrictions/Special Constraints

n/a

Forbidden Usage

You agree not to attempt to determine the identity of the speaker in this dataset.

Processes

Intended Use

Training neural TTS acoustic models Fine-tuning pre-trained multilingual TTS models for Italian Benchmarking Italian speech synthesis quality Linguistic research on Italian prosody and phonetics

Metadata

Datasheet: Le Avventure di Pinocchio — Italian TTS Dataset

Dataset Overview

LanguageItalian (it)
Source TextLe Avventure di Pinocchio by Carlo Collodi
Source AudioLibriVox public domain recording (https://librivox.org/le-avventure-di-pinocchio-by-c-collodi/)
SpeakerSingle female speaker
AlignmentSentence-level
LicenseCC-0

The Source Text

Title: Le Avventure di Pinocchio (English: The Adventures of Pinocchio)

Author: Carlo Collodi (1826 - 1890)

First Published: 1883

The text is in the public domain worldwide, as the author died in 1890 and more than 70 years have elapsed since his death.

The Source Audio

The audio was sourced from LibriVox, a volunteer-driven project founded in 2005 with the goal of recording all books in the public domain and making them freely available as audiobooks. All recordings are released into the public domain under the LibriVox license, meaning they may be freely used, distributed, and adapted for any purpose, including the creation of speech datasets. The recording used for this dataset features a single female reader.

Dataset Construction

Alignment Method

The dataset was constructed by sentence-aligning the source text with the LibriVox audio recording of Le Avventure di Pinocchio. In this context, "sentence" is a best approximation using sentence-final punctuation. In order to obtain sentence-level alignments, the Montreal Forced Aligner was used to produce word-level alignments, which were then rolled up to the sentence level.

Preprocessing

  • The original audio contains a LibriVox introduction that is not represented in the source text. This was removed for each chapter.

  • The original mp3 files used a variable bitrate. To ensure compatibility and simplify data validation, they were converted to a constant bitrate (128 kb/s).

Manual inspection

The alignments were manually inspected and small typos in the source text were fixed.