MDC Curators
- United Kingdom
- datacollective.mozillafoundation.org/
- Private Company
About us
MDC Curators create and curate datasets from public domain and openly licensed sources.
Datasets
10 Datasets
| Corpus de llenguatge ofensiu en català | CC-BY-SA-4.0 | ca | NLP | TSV | 57.35 KB |
| Cuentos en Kʼicheʼ leídos en voz alta | CC-BY-SA-4.0 | quc | ASR | MP3. TSV | 152.62 MB |
| Cuentos en Mam leídos en voz alta | CC-BY-SA-4.0 | mam | ASR | MP3, TSV | 110.28 MB |
| Finance Sentences - North American Spanish | CC0-1.0 | es-US | NLP | TSV, JSON | 18.35 MB |
| LibriVox Croatian TTS Male Voice | CC0-1.0 | hr | TTS | MP3, TXT, TSV | 377.60 MB |
| LibriVox Czech TTS Female Voice | CC0-1.0 | cs | TTS | MP3, TXT, TSV | 178.58 MB |
| LibriVox Italian TTS Female Voice | CC0-1.0 | it | TTS | MP3, TSV | 61.74 MB |
| Sentence translation difficulty in Spanish - BOUQuET | CC-BY-SA-4.0 | es | MT | TSV | 81.48 KB |
| UK Sort Codes - ASR Evaluation | CC-BY-4.0 | en-GB | ASR | WEBM, TSV | 23.76 MB |
| ViQua² — Visual Question-answering about Quantities | CC-BY-SA-4.0 | en-US | CV | JSON, JPEG | 281.05 MB |