RAPNIC - Catalan Impaired Speech (Example Set)

Description

RAPNIC (Reconeixement Automatic de la Parla No Intel-ligible en Catala) is the first Catalan speech corpus collected from individuals with speech disorders, primarily cerebral palsy and Down syndrome. It was built to develop and evaluate Automatic Speech Recognition (ASR) systems accessible to Catalan speakers with impaired speech. This package is an EXAMPLE SET ONLY: it contains 1,000 clean recordings (10 per speaker) from 100 speakers, totalling about 1.33 hours of 16 kHz mono WAV audio in a single train split, with an accompanying metadata TSV (transcriptions, durations, task IDs, etc.). To protect speaker privacy, no per-recording demographic metadata is included. The full RAPNIC corpus is NOT yet publicly released and has no planned release date. It currently comprises around 100 participants and over 22,000 recordings, collected under controlled conditions via a read-speech protocol following participatory and ethical protocols. Recordings were collected with a web platform adapted from Project Euphonia, with participants supported by speech therapists and caregivers. (Complete-set demographics are given under Other information below.) If you are interested in the full dataset, please get in touch (see Contact information). This example set was produced by iSocial in collaboration with the CLiC research group at the Universitat de Barcelona (UB).

This is an EXAMPLE SET of the RAPNIC corpus, the first impaired-speech corpus for Catalan, covering two of the most frequent developmental speech disorders: Down syndrome and cerebral palsy. The full corpus is not yet published (no planned date); please contact iSocial for access or more information.

Contents of this package: 1,000 clean recordings (10 per speaker) from 100 speakers, ~1.33 hours of 16 kHz mono WAV audio, single train split, plus a metadata TSV. No per-recording demographic metadata is shipped, to prevent re-identification.

Full corpus (unreleased): ~100 participants and over 22,000 recordings collected via a read-speech protocol. Demographics: Down syndrome 51%, cerebral palsy 35%, no response 10%, other 4%; female 54% / male 44%; age mostly 31-45; dialect predominantly Central Catalan (Barcelona-Tarragona), with Girona, Septentrional and Nord-Occidental varieties represented.

Data collection: Recordings were made with a web-based platform, a fork of Google's Project Euphonia Audio Tool. Participants read prompts on screen, supported as needed by speech therapists and caregivers. Each recording has 2 s trimmed from the end; only clean (non-duplicate, within-threshold) recordings are included here.

Example set on Hugging Face: https://huggingface.co/datasets/CLiC-UB/rapnic-example

Citation

A formal citation is not yet available. A methodology paper with experimental results on ASR is in preparation.

Please check README.md for more information.

RAPNIC - Catalan Impaired Speech (Example Set)

Description

Specifics

Considerations

Processes

Metadata

Citation