Publishing Partner: Cambridge University Press CUP Extra Publisher Login

Software Details

Title: ELRA - Language Resources Catalogue - Update
Submitter: Helene Mazo
Description: ELRA is happy to announce that 1 new Written Corpus is now available in its

ELRA-W0050 The CINTIL Corpus – International Corpus of Portuguese
CINTIL-Corpus Internacional do Português is a linguistically interpreted
written and spoken corpus of European Portuguese. It is composed of one
million annotated tokens, each one of which verified by human expert
annotators. The annotation comprises information on part-of-speech, open
class lemma and inflection, multi-word expressions pertaining to the class
of adverbs and to the closed POS classes, and multi-word proper names (for
named entity recognition). The corpus is developed over raw textual
materials of several types, of which 30% are spoken materials.

For more information, see:

For more information on the catalogue, please contact Valérie Mapelli

Visit our On-line Catalogue:
Visit the Universal Catalogue:
Archives of ELRA Language Resources Catalogue Updates:
Linguistic Field(s): Computational Linguistics
Text/Corpus Linguistics

LL Issue: 20.2136
Date Posted: 11-Jun-2009

Search Again

Back to Software Index