LINGUIST List 26.3205

Wed Jul 08 2015

Calls: Computational Linguistics; Historical Linguistics; Text/Corpus Linguistics/ Language Resources and Evaluation (Jrnl)

Editor for this issue: Andrew Lamont <alamontlinguistlist.org>


Date: 08-Jul-2015
From: Eszter Simon <simon.eszternytud.mta.hu>
Subject: Computational Linguistics; Historical Linguistics; Text/Corpus Linguistics/ Language Resources and Evaluation (Jrnl)
E-mail this message to a friend

Full Title: Language Resources and Evaluation


Linguistic Field(s): Computational Linguistics; Historical Linguistics; Text/Corpus Linguistics

Call Deadline: 31-Aug-2015

We are inviting submissions for a Special Issue of the Language Resources and Evaluation Journal, entitled ''Converging Corpora: How to standardize historical corpora of typologically and genetically different languages''.

Call for Papers

The availability of annotated language resources is becoming an increasingly important factor in more and more domains of linguistic research, since high-quality linguistic databases can provide a fertile ground for theoretical investigations. Historical corpora represent a rich source of data, but only if the relevant information is specified in a computationally retrievable and interpretable way.

Several databases of historical texts enriched with some kind of linguistic information and metadata have recently been created for various Indo-European languages, such as the Penn Corpora of Historical English, the Tycho Brahe Parsed Corpus of Historical Portuguese, or the Welsh Prose corpus and for non-Indo-European languages as well, cf. the Old Hungarian Corpus.

With the recent increase in the number of annotated historical corpora, it seems advisable to move towards a harmonized common framework and methodology. An important goal of the special issue is to highlight the issues we encounter when annotating languages with rich morphology.

Questions we would like to be addressed include:

- To what extent should the existing annotation schemes be extended for the incorporation of highly inflected languages?
- How can existing schemes be extended to accomplish this?
- How can the linguistic annotation of historical corpora be standardized to serve an easy-to-use data access for linguists?

We invite submissions of articles describing annotation schemes of historical corpora, attempts to standardization, and harmonized annotation frameworks.

To provide a possibility of collaboration, we organized a special workshop of the 16th Diachronic Generative Syntax conference on ''Converging Corpora: How to standardize historical corpora of typologically and genetically different languages''. A natural candidate for this call is an extended paper from the workshop presentations. However, we do not limit the contributions to DiGS-related works. Instead, other works presenting standardization efforts of annotation schemes of historical corpora are also welcome.

Finally, papers describing concrete historical corpora or tools adapted to old language varieties are also welcome, provided they highlight important properties of the problem of standardization and present relevant solutions.

Important Dates

Call for papers issued: 31 March 2015
Submissions due: 31 August 2015
Author notification of acceptance: 30 November 2015
Final manuscripts submitted: 31 March 2016

Submission of Works

To prepare the papers, please follow the style guidelines provided by the LRE journal.

To submit papers:

- Go to http://www.editorialmanager.com/lrev/
- Register and login as an author.
- Select ''S.I. : Converging Corpora'' as article type.
- Follow the instructions and submit your paper.

Guest Editors

- Tamás Váradi - Research Institute for Linguistics, Hungarian Academy of Sciences (varadi.tamasnytud.mta.hu)
- Eszter Simon - Research Institute for Linguistics, Hungarian Academy of Sciences (simon.eszternytud.mta.hu)


Page Updated: 08-Jul-2015