License:
NOODL-1.0
Steward:
Institute of African Digital HumanitiesDataset ID:
cmqj784dc01klnr07nz7msurn
Task: OTH
Release Date: 6/18/2026
Format: WAV, XML
Size: 500.98 MB
Share
EMAC-Dataset-DRC is a digitised ethnomusicological audio collection dedicated to the documentation and preservation of traditional vocal music from the Democratic Republic of Congo (DRC), formerly known as Zaïre. The dataset was constituted in the framework of the Ethnomusicologie d'Afrique Centrale (EMAC) project, initiated by CERDOTOLA (Centre de Recherche et de Documentation sur les Traditions Orales et pour le Développement des Langues Africaines) following the founding meeting held in Sangmélima, Cameroon, from 17 to 22 September 1979. The dataset comprises 5 high-quality WAV audio recordings representing three distinct oral music genres: the berceuse (lullaby), the chante-fable, and the musique épique (epic music). The epic music session (MEP_002) is distributed across three consecutive audio files. The recordings were originally captured on magnetic tape and digitised in 2015 at CERDOTOLA in Yaoundé, Cameroon, under the coordination of Dr. Emmanuel Ngue Um, using Adobe Audition 3.0. At the time of the Sangmélima meeting (1979), the DRC (then Zaïre) was described as having one of the most advanced ethnomusicological infrastructures in the region: the Institut des Musées Nationaux du Zaïre had conducted 13 field collection missions totalling 300 hours of tape recordings, and the Institut National des Arts in Kinshasa had an active musicology programme. The EMAC sub-corpus for DRC preserved here reflects a selection from the broader collection assembled in that context. As an ethnomusicological rather than a linguistic dataset, EMAC-Dataset-DRC does not provide transcriptions or sentence-level alignments.
Licensing
Nwulite Obodo Open Data Licence 1.0 (NOODL-1.0)
https://licensingafricandatasets.com/nwulite-obodo-licenseRestrictions/Special Constraints
By downloading this dataset, you agree: - To use it for research, education, and cultural heritage purposes only - That you will not re-host or re-share this dataset without the explicit permission of the legal owner
Forbidden Usage
You agree not to use the data for: determining the identity of any performer in the dataset; attempting to clone any voice or train models that imitate any performer in this dataset; Generative AI; reproduction; duplication; modification; augmentation; copying; distribution; transmission; display; sale; transfer; publication or creation of derivative works without the explicit permission of the legal owner of the dataset.
Intended Use
(a) Ethnomusicological and cultural heritage tasks: - Ethnomusicological research: The dataset provides authentic recordings of DRC lullaby, chante-fable, and epic music traditions and is suited for the study of vocal style, narrative structure, melodic contour, and genre conventions in Central African oral music. - Cultural heritage archiving: The dataset contributes to the digital preservation of intangible cultural heritage from the DRC, supporting institutions involved in the documentation of African oral traditions. - Cross-cultural comparative analysis: Used alongside the other six EMAC country datasets, these recordings enable comparative studies of shared and divergent oral music traditions across Central Africa. (b) Computational and technological tasks: - Computational ethnomusicology: Audio signal analysis, genre classification, melodic transcription, and acoustic modelling using traditional Central African vocal music as data. Ethical Review Process All recordings were collected with the knowledge and participation of the performers and their affiliated institutions. The dataset is made available for research and educational use in the spirit of the EMAC project's mission of documenting and promoting Central African musical heritage.
The musical heritage of the Democratic Republic of Congo is one of the most diverse and internationally recognised on the African continent, encompassing hundreds of ethnic musical traditions across its vast territory. The EMAC sub-corpus for DRC draws on traditions of vocal domestic music (berceuse), narrative oral music (chante-fable), and epic musical performance (musique épique). These genres belong to the sphere of community transmission and oral literature, where music functions as a vehicle for memory, narrative, and social identity. The diversity of DRC musical traditions is reflected in the country's ethnomusicological infrastructure, which by 1979 had already produced substantial archives of recorded traditional music.
Berceuse (BER): A traditional lullaby genre, performed vocally to soothe infants. One session is represented (BER_001).
Chante-fable (CHF): A hybrid narrative genre combining sung passages with spoken prose. One session is represented (CHF_001).
Musique épique (MEP): An epic music tradition in which extended narrative performances are delivered in a melodic or semi-melodic vocal style, often accompanied by instrumental music. One session is represented across three audio files (MEP_002, parts 1–3 of 3).
The musical traditions represented in this dataset are transmitted orally, through community performance, family apprenticeship, and participation in ceremonial and social events. No standardised written notation system exists for these genres. Epic music (musique épique) in particular is associated with specialist oral performers who acquire their repertoire through prolonged apprenticeship with master performers. The recordings in this dataset document elicited performances in a controlled environment and serve as primary archival evidence of these traditions as practised by DRC performers affiliated with the Institut des Musées Nationaux and the Institut National des Arts.
The recordings in this dataset were collected by researchers and technicians of the EMAC project in the Democratic Republic of Congo (then Zaïre), approximately between 1979 and 1982. The project engaged the Institut des Musées Nationaux du Zaïre and the Institut National des Arts de l'Université du Zaïre (Kinshasa campus) as the principal institutional partners. Emmanuel Gatete, chef du département de musicologie at the Institut National des Arts, was the country representative at the Sangmélima founding meeting. The original recordings were stored on magnetic tape at CERDOTOLA and digitised in 2015 under the coordination of Dr. Emmanuel Ngue Um.
The recordings represent three genres of traditional vocal music from the DRC: a lullaby, a song-fable, and an extended epic music performance in three parts. All performances were elicited in a controlled recording environment at the request of the EMAC project.
Total audio duration: approximately 1 hour 25 minutes (5,094 seconds), distributed across 5 WAV audio files. IMDI 3.03 XML metadata files: 1 (corpus-level only), in IMDI/ subfolder.
The dataset comprises 5 WAV audio files organised as follows:
EMAC_CD_BER_001-1-1.wav — Berceuse 001 (1 part of 1)
EMAC_CD_CHF_001-1-1.wav — Chante-fable 001 (1 part of 1)
EMAC_CD_MEP_002-1-3.wav — Musique épique 002 (part 1 of 3)
EMAC_CD_MEP_002-2-3.wav — Musique épique 002 (part 2 of 3)
EMAC_CD_MEP_002-3-3.wav — Musique épique 002 (part 3 of 3)
File naming convention: EMAC_{CC}{GENRE}{NNN}-{part}-{total}.wav
CC = ISO country code (CD = Democratic Republic of Congo)
GENRE = genre code (BER, CHF, MEP)
NNN = zero-padded session number
part / total = part index and total number of parts for multi-file sessions
IMDI metadata files (IMDI/ subfolder):
EMAC_Republique_Democratique_du_Congo_Zaire_.imdi — Corpus-level (Ethnomusicologie du Congo Démocratique)
| Audio file | Genre | Session | Parts |
|---|---|---|---|
| EMAC_CD_BER_001-1-1.wav | Berceuse | 001 | 1/1 |
| EMAC_CD_CHF_001-1-1.wav | Chante-fable | 001 | 1/1 |
| EMAC_CD_MEP_002-1-3.wav | Musique épique | 002 | 1/3 |
| EMAC_CD_MEP_002-2-3.wav | Musique épique | 002 | 2/3 |
| EMAC_CD_MEP_002-3-3.wav | Musique épique | 002 | 3/3 |