LINGUIST List 32.171

Mon Jan 11 2021

Calls: Slavic Subgroup; Comp Ling/Online

Editor for this issue: Lauren Perkins <laurenlinguistlist.org>



Date: 11-Jan-2021
From: Roman Yangarber <roman.yangarberhelsinki.fi>
Subject: Shared Task on Slav-NER: Recognition, Normalization, Classification and Cross-lingual linking of Named Entities in Slavic languages
E-mail this message to a friend

Full Title: Shared Task on Slav-NER: Recognition, Normalization, Classification and Cross-lingual linking of Named Entities in Slavic languages
Short Title: Slav-NER-3

Date: 19-Apr-2021 - 19-Apr-2021
Location: Kyiv (Online), Ukraine
Contact Person: Roman Yangarber
Meeting Email: < click here to access email >
Web Site: http://bsnlp.cs.helsinki.fi/shared-task.html

Linguistic Field(s): Computational Linguistics

Language Family(ies): Slavic Subgroup

Call Deadline: 01-Mar-2021

Meeting Description:

The 3rd edition of the SlavNER Shared Task focuses on the analysis of Named Entities in multilingual Web documents in Slavic languages.

Due to rich inflection, free word order, derivation, and other phenomena present in the the Slavic languages, work on Named Entities poses a challenging task. Fostering research & development on the problems of Named Entities — detecting mentions of names, lemmatization (normalization), classification, and cross-lingual matching — is crucial for cross-lingual information access and wider use of NLP in Slavic languages.

Second Call for Participation:

The 3rd edition of the shared task covers six languages:
- Bulgarian,
- Czech,
- Polish,
- Russian,
- Slovene,
- Ukrainian

and five types of named entities:
- persons,
- locations,
- organizations,
- events,
- products.

The Shared Task focuses on cross-lingual, document-level extraction of named entities — the systems should recognize, classify, and extract all named entity mentions in a document; detecting the position of each named entity mention is not required. Named-entity mentions should be lemmatized, and mentions referring to the same real-world object should be linked across documents and languages. The input text collection consists of sets of documents retrieved from the Web, each set being about a certain entity or event. The corpus was obtained by crawling the Web and parsing the HTML of documents.

IMPORTANT: it is NOT mandatory to participate in the full task, e.g., monolingual responses, without lemmatization of the extracted named entities, can be evaluated also.

See the details about the 1st edition (2017) and the 2nd edition (2019) of this shared task.

Participation:
Teams that intend to participate should register by sending an email to: bsnlpcs.helsinki.fi, which includes the following information:
- name of team,
- names of team members,
- contact person,
- contact email




Page Updated: 11-Jan-2021