MDC Curators
- United Kingdom
- datacollective.mozillafoundation.org/
- Private Company
About us
MDC Curators create and curate datasets from public domain and openly licensed sources.
Datasets
30 Datasets
| Corpus de llenguatge ofensiu en català | CC-BY-SA-4.0 | ca | NLP | TSV | 57.35 KB |
| CoVoST 2 Arabic-English | CC-BY-NC-4.0 | ar, en | MT | MP3, TSV | 148.21 MB |
| CoVoST 2 Catalan - English | CC-BY-NC-4.0 | ca, en | MT | MP3, TSV | 4.50 GB |
| CoVoST 2 Chinese (China) - English | CC-BY-NC-4.0 | zh-CN, en | MT | MP3, TSV | 713.86 MB |
| CoVoST 2 Dutch - English | CC-BY-NC-4.0 | nl, en | MT | MP3, TSV | 296.60 MB |
| CoVoST 2 English - Arabic | CC-BY-NC-4.0 | en, ar | MT | MP3, TSV | 12.56 GB |
| CoVoST 2 English-Slovenian | CC-BY-NC-4.0 | en, sl | MT | MP3, TSV | 12.56 GB |
| CoVoST 2 Estonian - English | CC-BY-NC-4.0 | et, en | MT | MP3, TSV | 239.07 MB |
| CoVoST 2 German-English | CC-BY-NC-4.0 | de, en | MT | MP3, TSV | 5.90 GB |
| CoVoST 2 Indonesian - English | CC-BY-NC-4.0 | id, en | MT | MP3, TSV | 79.77 MB |
| CoVoST 2 Italian - English | CC-BY-NC-4.0 | it, en | MT | MP3, TSV | 1.93 GB |
| CoVoST 2 Japanese - English | CC-BY-NC-4.0 | ja, en | MT | MP3, TSV | 82.06 MB |
| CoVoST 2 Latvian - English | CC-BY-NC-4.0 | lv, en | MT | MP3, TSV | 127.28 MB |
| CoVoST 2 Mongolian - English | CC-BY-NC-4.0 | mn, en | MT | MP3, TSV | 225.93 MB |
| CoVoST 2 Persian - English | CC-BY-NC-4.0 | fa, en | MT | MP3, TSV | 1.52 GB |
| CoVoST 2 Portuguese - English | CC-BY-NC-4.0 | pt, en | MT | MP3, TSV | 500.02 MB |
| CoVoST 2 Russian - English | CC-BY-NC-4.0 | ru, en | MT | MP3, TSV | 1.01 GB |
| CoVoST 2 Spanish - English | CC-BY-NC-4.0 | es, en | MT | MP3, TSV | 4.12 GB |
| CoVoST 2 Tamil - English | CC-BY-NC-4.0 | ta, en | MT | MP3, TSV | 81.45 MB |
| CoVoST 2 Turkish - English | CC-BY-NC-4.0 | tr, en | MT | MP3, TSV | 207.40 MB |
| CoVoST 2 Welsh - English | CC-BY-NC-4.0 | cy, en | MT | MP3, TSV | 95.16 MB |
| Cuentos en Kʼicheʼ leídos en voz alta | CC-BY-SA-4.0 | quc | ASR | MP3. TSV | 152.62 MB |
| Cuentos en Mam leídos en voz alta | CC-BY-SA-4.0 | mam | ASR | MP3, TSV | 110.28 MB |
| Finance Sentences - North American Spanish | CC0-1.0 | es-US | NLP | TSV, JSON | 18.35 MB |
| LibriVox Croatian TTS Male Voice | CC0-1.0 | hr | TTS | MP3, TXT, TSV | 377.60 MB |
| LibriVox Czech TTS Female Voice | CC0-1.0 | cs | TTS | MP3, TXT, TSV | 178.58 MB |
| LibriVox Italian TTS Female Voice | CC0-1.0 | it | TTS | MP3, TSV | 61.74 MB |
| Sentence translation difficulty in Spanish - BOUQuET | CC-BY-SA-4.0 | es | MT | TSV | 55.83 KB |
| UK Sort Codes - ASR Evaluation | CC-BY-4.0 | en-GB | ASR | WEBM, TSV | 23.76 MB |
| ViQua² — Visual Question-answering about Quantities | CC-BY-SA-4.0 | en-US | CV | JSON, JPEG | 281.05 MB |