RFE/RL

About us

RFE/RL is sharing on Mozilla Data Collective to support linguistic research and the development of natural language processing tools for the many languages RFE/RL broadcasts in (or has broadcast in historically).

Datasets

21 Datasets

RFE/RL Azerbaijani News Text CorpusCC-BY-NC-SA-4.0az,ruNLPTXT211.65 MB
RFE/RL Belarusian News Text CorpusCC-BY-NC-SA-4.0beNLPTXT486.55 MB
RFE/RL Bulgarian News Text CorpusCC-BY-NC-SA-4.0bgNLPTXT49.82 MB
RFE/RL Chechen News Text CorpusCC-BY-NC-SA-4.0ceNLPTXT28.29 MB
RFE/RL Crimean Tatar News Text CorpusCC-BY-NC-SA-4.0crhNLPTXT18.35 MB
RFE/RL Georgian News Text CorpusCC-BY-NC-SA-4.0kaNLPTXT257.53 MB
RFE/RL Hungarian News Text CorpusCC-BY-NC-SA-4.0huNLPTXT36.64 MB
RFE/RL Kazakh News Text CorpusCC-BY-NC-SA-4.0kkNLPTXT126.81 MB
RFE/RL Kyrgyz News Text CorpusCC-BY-NC-SA-4.0ky,ru,enNLPTXT282.41 MB
RFE/RL Macedonian News Text CorpusCC-BY-NC-SA-4.0mkNLPTXT133.95 MB
RFE/RL Pashto (Pakistani) News Text CorpusCC-BY-NC-SA-4.0psNLPTXT39.26 MB
RFE/RL Persian News Text CorpusCC-BY-NC-SA-4.0faNLPTXT307.78 MB
RFE/RL Romanian (Moldova) News Text CorpusCC-BY-NC-SA-4.0ro,ru,enNLPTXT311.87 MB
RFE/RL Romanian (Romania) News Text CorpusCC-BY-NC-SA-4.0roNLPTXT77.95 MB
RFE/RL Serbian, Bosnian, and Montenegrin (Balkan) News Text CorpusCC-BY-NC-SA-4.0hbsNLPTXT310.39 MB
RFE/RL Tajik News Text CorpusCC-BY-NC-SA-4.0tg,ruNLPTXT145.27 MB
RFE/RL Tatar-Bashkir News Text CorpusCC-BY-NC-SA-4.0tt,ba,ruNLPTXT102.44 MB
RFE/RL Turkmen News Text CorpusCC-BY-NC-SA-4.0tk,ruNLPTXT48.28 MB
RFE/RL Ukrainian (Crimea) News Text CorpusCC-BY-NC-SA-4.0ukNLPTXT180.13 MB
RFE/RL Ukrainian News Text CorpusCC-BY-NC-SA-4.0uk,ruNLPTXT591.97 MB
RFE/RL Uzbek News Text CorpusCC-BY-NC-SA-4.0uzNLPTXT154.21 MB