Sample Medumba-TTS-Dataset

Description

Medumba-TTS-Dataset is a scripted speech dataset dedicated to the documentation and technological development of Medumba (ISO 639-3: byv), a Grassfields Bantu language spoken in the Ndé Division of the West Region of Cameroon. The dataset was compiled in the framework of the Mozilla Data Collective initiative (2026), in addition to the existing Common Voice Scripted Speech 25.0 – Medumba dataset (https://mozilladatacollective.com/datasets/cmn2chivm01cbo107vqvbgn2i). The dataset comprises 994 high-quality MP3 audio recordings of Medumba sentences read by a native speaker across 10 recording sessions, together with per-session sentence-to-audio mapping files enabling precise alignment between textual and acoustic data. Sentences were drawn from a scripted speech prompt list and read in a controlled environment. The transcription of all sentences follows the General Alphabet of Cameroon's Languages (AGLC; French acronym: Alphabet Général des Langues Camerounaises), the reference standard for Cameroonian national languages. The Medumba AGLC orthography is distinguished by an extended vowel inventory — including the low back unrounded vowel ɑ, the open-mid front unrounded vowel ɛ, the open-mid back rounded vowel ɔ, the high central rounded vowel ʉ, and the mid central schwa ə — as well as a set of labialized consonants written by appending w to the base consonant (e.g., kw, gw, sw, bw), a series of pre-nasalized consonants written as digraphs or trigraphs (e.g., mb, nd, ŋg, ns, nsw), a two-level tone-marking system using grave (low) and contour diacritics (caron for rising LH; circumflex for falling HL) applied to vowels — high tone being unmarked — and the modifier letter apostrophe (ʼ) for the glottal stop. The parallel availability of AGLC-transcribed text and aligned speech makes the dataset suitable for a wide range of applications, including text-to-speech (TTS) synthesis, automatic speech recognition (ASR), forced alignment, pronunciation modelling, and language learning tools. It also directly supports efforts to standardise and normalise the digital representation of Medumba in language technology contexts.

audio file	sentence (Medumba, AGLC)
bf3b017ea63a09123b36c0acb58aab43.mp3	ŋgàmbándá àʔ- nɛ̀ɛ́n ʧwɛ̀t ndàáʔndʒʉ á gə̀- lɛ̀ɛ́n mbə́zə̄ mbàŋ gə́- lú
421886434692cfc9d071901cc0d81bac.mp3	lɛ̂n sə bə α̂ Fʉ̀'nkə'ə à bə a zə nunga
a240324809aa0f990a72e2f42dc6c649.mp3	mə ghʉ̌ tà' nshun mɛnmαndùm mbὰ tà' nshun mɛn mɛ̀nnzwi
566d2baa7091db5d067acf13e99931a7.mp3	àbâ tɔ̌ tûɁndá zə̄ Númí lù - ngǝ́ bɛ́tǝ́ càŋ ŋkɔ̀k tə̀tswə́
bf03399bfe1b63c531dca78b33147275.mp3	Nya bùnte ndù

audio file

sentence (Medumba, AGLC)

bf3b017ea63a09123b36c0acb58aab43.mp3

ŋgàmbándá àʔ- nɛ̀ɛ́n ʧwɛ̀t ndàáʔndʒʉ á gə̀- lɛ̀ɛ́n mbə́zə̄ mbàŋ gə́- lú

421886434692cfc9d071901cc0d81bac.mp3

lɛ̂n sə bə α̂ Fʉ̀'nkə'ə à bə a zə nunga

a240324809aa0f990a72e2f42dc6c649.mp3

mə ghʉ̌ tà' nshun mɛnmαndùm mbὰ tà' nshun mɛn mɛ̀nnzwi

566d2baa7091db5d067acf13e99931a7.mp3

àbâ tɔ̌ tûɁndá zə̄ Númí lù - ngǝ́ bɛ́tǝ́ càŋ ŋkɔ̀k tə̀tswə́

bf03399bfe1b63c531dca78b33147275.mp3

Nya bùnte ndù

Description

Specifics

Considerations

Processes

Metadata

Language

Variants

Writing System

1. Vowels

2. Consonants

3. Tone system

4. Orthographic variation across sessions

Source

Domain

Size

Structure

Description of columns (mapping.tsv)

Sample