LINGUIST List 26.3205
Wed Jul 08 2015
Calls: Computational Linguistics; Historical Linguistics; Text/Corpus Linguistics/ Language Resources and Evaluation (Jrnl)
Editor for this issue: Andrew Lamont <alamontlinguistlist.org>
Date: 08-Jul-2015
From: Eszter Simon <simon.eszter
nytud.mta.hu>
Subject: Computational Linguistics; Historical Linguistics; Text/Corpus Linguistics/ Language Resources and Evaluation (Jrnl)
E-mail this message to a friend Full Title: Language Resources and Evaluation
Linguistic Field(s): Computational Linguistics; Historical Linguistics; Text/Corpus Linguistics
Call Deadline: 31-Aug-2015
We are inviting submissions for a Special Issue of the Language Resources and Evaluation Journal, entitled ''Converging Corpora: How to standardize historical corpora of typologically and genetically different languages''.
Call for Papers
The availability of annotated language resources is becoming an increasingly important factor in more and more domains of linguistic research, since high-quality linguistic databases can provide a fertile ground for theoretical investigations. Historical corpora represent a rich source of data, but only if the relevant information is specified in a computationally retrievable and interpretable way.
Several databases of historical texts enriched with some kind of linguistic information and metadata have recently been created for various Indo-European languages, such as the Penn Corpora of Historical English, the Tycho Brahe Parsed Corpus of Historical Portuguese, or the Welsh Prose corpus and for non-Indo-European languages as well, cf. the Old Hungarian Corpus.
With the recent increase in the number of annotated historical corpora, it seems advisable to move towards a harmonized common framework and methodology. An important goal of the special issue is to highlight the issues we encounter when annotating languages with rich morphology.
Questions we would like to be addressed include:
- To what extent should the existing annotation schemes be extended for the incorporation of highly inflected languages?
- How can existing schemes be extended to accomplish this?
- How can the linguistic annotation of historical corpora be standardized to serve an easy-to-use data access for linguists?
We invite submissions of articles describing annotation schemes of historical corpora, attempts to standardization, and harmonized annotation frameworks.
To provide a possibility of collaboration, we organized a special workshop of the 16th Diachronic Generative Syntax conference on ''Converging Corpora: How to standardize historical corpora of typologically and genetically different languages''. A natural candidate for this call is an extended paper from the workshop presentations. However, we do not limit the contributions to DiGS-related works. Instead, other works presenting standardization efforts of annotation schemes of historical corpora are also welcome.
Finally, papers describing concrete historical corpora or tools adapted to old language varieties are also welcome, provided they highlight important properties of the problem of standardization and present relevant solutions.
Important Dates
Call for papers issued: 31 March 2015
Submissions due: 31 August 2015
Author notification of acceptance: 30 November 2015
Final manuscripts submitted: 31 March 2016
Submission of Works
To prepare the papers, please follow the style guidelines provided by the LRE journal.
To submit papers:
- Go to
http://www.editorialmanager.com/lrev/ - Register and login as an author.
- Select ''S.I. : Converging Corpora'' as article type.
- Follow the instructions and submit your paper.
Guest Editors
- Tamás Váradi - Research Institute for Linguistics, Hungarian Academy of Sciences (varadi.tamas
nytud.mta.hu)
- Eszter Simon - Research Institute for Linguistics, Hungarian Academy of Sciences (simon.eszter
nytud.mta.hu)
Page Updated: 08-Jul-2015