MDC Curators

About us

MDC Curators create and curate datasets from public domain and openly licensed sources.

Datasets

10 Datasets

Corpus de llenguatge ofensiu en catalàCC-BY-SA-4.0caNLPTSV57.35 KB
Cuentos en Kʼicheʼ leídos en voz altaCC-BY-SA-4.0qucASRMP3. TSV152.62 MB
Cuentos en Mam leídos en voz altaCC-BY-SA-4.0mamASRMP3, TSV110.28 MB
Finance Sentences - North American Spanish CC0-1.0es-USNLPTSV, JSON18.35 MB
LibriVox Croatian TTS Male VoiceCC0-1.0hrTTSMP3, TXT, TSV377.60 MB
LibriVox Czech TTS Female VoiceCC0-1.0csTTSMP3, TXT, TSV178.58 MB
LibriVox Italian TTS Female VoiceCC0-1.0itTTSMP3, TSV61.74 MB
Sentence translation difficulty in Spanish - BOUQuETCC-BY-SA-4.0esMTTSV81.48 KB
UK Sort Codes - ASR EvaluationCC-BY-4.0en-GBASRWEBM, TSV23.76 MB
ViQua² — Visual Question-answering about QuantitiesCC-BY-SA-4.0en-USCVJSON, JPEG281.05 MB