CLEAR Global
- United States
- clearglobal.org
- Ngo
About us
CLEAR Global helps people get vital information and be heard, whatever language they speak.
We accomplish this mission through innovative global research and programs, language technology, language service platforms, an experienced, professional staff, and a community of over 200,000 linguists in 148 countries.
Datasets
15 Datasets
| Kanuri Books Corpus | CC-BY-4.0 | kr | LM | TXT | 545.68 KB |
| Marma Text Corpus | CC-BY-NC-SA-4.0 | rmz | LM | TSV | 188.92 KB |
| Read Speech in Kenyan Swahili (6h) | CC-BY-NC-4.0 | sw | ASR | WAV, TSV | 1.59 GB |
| Synthetic Text Corpus for African Language ASR | CC-BY-NC-4.0 | bm,ny,ha,kr,luo | NLP | TSV | 746.63 KB |
| TWB Parallel Sentence kits - Congo Swahili (25k) | CC-BY-4.0 | swc | MT | TSV | 2.18 MB |
| TWB Parallel Sentence kits - Hausa (30k) | CC-BY-4.0 | hau | MT | TSV | 1.68 MB |
| TWB Parallel Sentence kits - Kanuri (5k) | CC-BY-4.0 | kau | MT | TSV | 358.46 KB |
| TWB Parallel Sentence kits - Lingala (5k) | CC-BY-4.0 | lin | MT | TSV | 494.43 KB |
| TWB Parallel Sentence kits - Nande (15k) | CC-BY-4.0 | nnb | MT | TSV | 1.26 MB |
| TWB Parallel Sentence kits - Rohingya (5k) | CC-BY-4.0 | rhg | MT | TSV | 358.88 KB |
| TWB Parallel Sentence kits - Swahili (5k) | CC-BY-4.0 | swa | MT | TSV | 347.61 KB |
| TWB Parallel Sentence kits - Tigrinya (5k) | CC-BY-4.0 | tig | MT | TSV | 404.75 KB |
| TWB Voice 1.0 - Hausa | CC-BY-NC-4.0 | hau | ASR | WAV, TSV | 11.88 GB |
| TWB Voice 1.0 - Kanuri | CC-BY-NC-4.0 | kau | ASR | WAV, TSV | 10.98 GB |
| TWB Voice 1.0 - Shuwa Arabic | CC-BY-NC-4.0 | shu, ar | ASR | WAV, TSV | 3.17 GB |