CLEAR Global

About us

CLEAR Global helps people get vital information and be heard, whatever language they speak.

We accomplish this mission through innovative global research and programs, language technology, language service platforms, an experienced, professional staff, and a community of over 200,000 linguists in 148 countries.

Datasets

15 Datasets

Kanuri Books CorpusCC-BY-4.0krLMTXT545.68 KB
Marma Text CorpusCC-BY-NC-SA-4.0rmzLMTSV188.92 KB
Read Speech in Kenyan Swahili (6h)CC-BY-NC-4.0swASRWAV, TSV1.59 GB
Synthetic Text Corpus for African Language ASRCC-BY-NC-4.0bm,ny,ha,kr,luoNLPTSV746.63 KB
TWB Parallel Sentence kits - Congo Swahili (25k)CC-BY-4.0swcMTTSV2.18 MB
TWB Parallel Sentence kits - Hausa (30k)CC-BY-4.0hauMTTSV1.68 MB
TWB Parallel Sentence kits - Kanuri (5k)CC-BY-4.0kauMTTSV358.46 KB
TWB Parallel Sentence kits - Lingala (5k)CC-BY-4.0linMTTSV494.43 KB
TWB Parallel Sentence kits - Nande (15k)CC-BY-4.0nnbMTTSV1.26 MB
TWB Parallel Sentence kits - Rohingya (5k)CC-BY-4.0rhgMTTSV358.88 KB
TWB Parallel Sentence kits - Swahili (5k)CC-BY-4.0swaMTTSV347.61 KB
TWB Parallel Sentence kits - Tigrinya (5k)CC-BY-4.0tigMTTSV404.75 KB
TWB Voice 1.0 - HausaCC-BY-NC-4.0hauASRWAV, TSV11.88 GB
TWB Voice 1.0 - KanuriCC-BY-NC-4.0kauASRWAV, TSV10.98 GB
TWB Voice 1.0 - Shuwa ArabicCC-BY-NC-4.0shu, arASRWAV, TSV3.17 GB