EurCom

The EurCom corpus is a collection of publications issued by the European Union from 2000 to 2010. The corpus was created by prof. Giuditta Caliendo (Université de Lille) in collaboration with the University of Antwerp. In addition to the plain utf-8 encoded format, there exists a POS-tagged version created by means of the Treetagger software. You can obtain both these versions by clicking on the following download link:

EurCom