ANTCorpus stands for "Arabic News Texts Corpus". It is a research project that aims to collect texts from different sources of the web by incrementing the amount of data progressively.
The acronym ANT can remind the ants' work:
"Every ant should contribute to build the nest progressively".
From RSS feeds of news websites
Filter and Extract categorized data
Generate corpus documents
The files of ANT Corpus are subject to the following citation license:
By downloading ANT Corpus, you agree to cite at least one of our papers describing ANT Corpus (refer to the section below) and/or refer the project's main page in any kind of material you produce where ANT Corpus was used to conduct search or experimentation, whether be it a research paper, dissertation, article, poster, presentation, or documentation.
✅ By using this data, you have agreed to the citation licence.