We continue improving tools for processing languages. Greek corpora now have lemmatization and part-of-speech tagging available and they are tokenized better.
We have already reprocessed our main Greek corpus – Greek Web 2014 corpus (elTenTen14). The improved tools are also available for all user corpora. Your existing Greek corpora have to be recompiled to benefit from the improvements.