* * * * * * * * * * * * * * * * * * * * * * * *
LINGUIST List logo Eastern Michigan University Wayne State University *
* People & Organizations * Jobs * Calls & Conferences * Publications * Language Resources * Text & Computer Tools * Teaching & Learning * Mailing Lists * Search *
* *
LINGUIST List 21.3904

Mon Oct 04 2010

FYI: Release of SemEval2010-Task1 Datasets

Editor for this issue: Brent Miller <brentlinguistlist.org>

To post to LINGUIST, use our convenient web form at http://linguistlist.org/LL/posttolinguist.cfm.
        1.    Mariona Taulé, Release of SemEval2010-Task1 Datasets

Message 1: Release of SemEval2010-Task1 Datasets
Date: 04-Oct-2010
From: Mariona Taulé <mtauleub.edu>
Subject: Release of SemEval2010-Task1 Datasets
E-mail this message to a friend

We are pleased to announce the release of the SemEval2010-Task1 datasets
for coreference resolution in multiple languages including Catalan, Dutch,
German, Italian, and Spanish. The English data will be released by LDC
early 2011.

The data is now freely available for downloading from the task website, at

General formatting is shared by all languages and is inspired by the
previous CoNLL shared tasks (2008/2009 editions:
http://ufal.mff.cuni.cz/conll2009-st). The data are displayed in a uniform
column-based format with information about coreference, lemma, PoS,
morphological features, head, dependency relations, NEs, and semantic
dependencies. Both gold-standard and automatically predicted information
are provided (availability depending on the language). The existence of
gold and automatic preprocessing of already published results makes it an
ideal resource for training and testing coreference resolution systems,
especially when cross-language portability is to be achieved.

Marta Recasens, Lluís Màrquez, Emili Sapena, M. Antònia Martí, Mariona
Taulé, Véronique Hoste, Massimo Poesio, and Yannick Versley. 2010.
SemEval-2010 Task 1: Coreference Resolution in Multiple Languages. In
Proceedings of the 5th International Workshop on Semantic Evaluation
(SemEval-2010), ACL 2010, pages 1-8, Uppsala, Sweden.

We will be happy to publish your results or articles related to any of
these datasets on our website: http://stel.ub.edu/semeval2010-coref/posttask.

Please feel free to let us know.

Marta Recasens, M. Antònia Martí, Mariona Taulé
University of Barcelona
{mrecasens, amarti, mtaule}ub.edu

Lluís Màrquez, Emili Sapena
Technical University of Catalonia
{lluism, esapena}lsi.upc.edu

Massimo Poesio
University of Essex / University of Trento

Véronique Hoste
University College Ghent

Yannick Versley
University of Tübingen

Linguistic Field(s): General Linguistics

Read more issues|LINGUIST home page|Top of issue

Page Updated: 04-Oct-2010

Supported in part by the National Science Foundation       About LINGUIST    |   Contact Us       ILIT Logo
While the LINGUIST List makes every effort to ensure the linguistic relevance of sites listed on its pages, it cannot vouch for their contents.