Publishing Partner: Cambridge University Press CUP Extra Publisher Login
amazon logo
More Info

Software Details

Title: ELRA - Language Resources Catalogue - Update
Submitter: Helene Mazo
Description: ELRA is happy to announce that 1 new Written Corpus is now available in its
catalogue:

ELRA-W0050 The CINTIL Corpus – International Corpus of Portuguese
CINTIL-Corpus Internacional do Português is a linguistically interpreted
written and spoken corpus of European Portuguese. It is composed of one
million annotated tokens, each one of which verified by human expert
annotators. The annotation comprises information on part-of-speech, open
class lemma and inflection, multi-word expressions pertaining to the class
of adverbs and to the closed POS classes, and multi-word proper names (for
named entity recognition). The corpus is developed over raw textual
materials of several types, of which 30% are spoken materials.

For more information, see:
http://catalog.elra.info/product_info.php?products_id=1102

For more information on the catalogue, please contact Valérie Mapelli
mailto:mapelli@elda.org

Visit our On-line Catalogue: http://catalog.elra.info
Visit the Universal Catalogue: http://universal.elra.info
Archives of ELRA Language Resources Catalogue Updates:
http://www.elra.info/LRs-Announcements.html
Linguistic Field(s): Computational Linguistics
Text/Corpus Linguistics

LL Issue: 20.2136
Date Posted: 11-Jun-2009

Search Again

Back to Software Index