* * * * * * * * * * * * * * * * * * * * * * * *
LINGUIST List logo Eastern Michigan University Wayne State University *
* People & Organizations * Jobs * Calls & Conferences * Publications * Language Resources * Text & Computer Tools * Teaching & Learning * Mailing Lists * Search *
* *

LINGUIST List 24.416

Wed Jan 23 2013

Calls: Computational Linguistics, Historical Linguistics/Norway

Editor for this issue: Alison Zaharee <alisonlinguistlist.org>

Date: 22-Jan-2013
From: Thorhallur Eythorsson <tollihi.is>
Subject: Computational Historical Linguistics
E-mail this message to a friend

Full Title: Computational Historical Linguistics

Date: 22-May-2013 - 22-May-2013
Location: Oslo, Norway
Contact Person: Thorhallur Eythorsson
Meeting Email: < click here to access email >
Web Site: http://spraakbanken.gu.se/eng/nodalida-chl-ws-2013

Linguistic Field(s): Computational Linguistics; Historical Linguistics

Call Deadline: 18-Mar-2013

Meeting Description:

Recent years have seen a surge of interest in the application of computational methods to problems in historical linguistics. To date, much of this work has been based on the application of simple similarity measures to short lists of lexical items or grammatical features for achieving large-scale genetic grouping of languages. While highly publicized and demonstrably useful, such approaches are inherently limited both by the narrow range of linguistic features examined and the low-level processing methods used.

At the same time, language technology for dealing with modern languages has developed apace, with automatic language tools now achieving a degree of accuracy that has enabled both popular online services such as Google translate and the rapid accumulation of linguistically annotated monolingual and multilingual corpora for many languages. Much less has been done on historical texts: there is little commercial interest in these language varieties, there is often limited amounts of data (making purely data-driven annotation approaches unfeasible), and they are less well-behaved than modern print corpora, due to lack of standardization on all linguistic levels, starting with orthography. Digitized older texts also often suffer from OCR errors.

The basic premise of the workshop is that historical linguistics can benefit greatly from having access to historical and diachronic corpora with rich linguistic annotations, but this is a field where researchers have barely scratched the surface of what is possible. However, because of the nature of the material and of the research questions, interesting questions of theory and method arise in connection with this work, which often are relevant to work on modern data as well (e.g., linguistic variation in spoken language or in web genres). The workshop aims at providing a forum where these questions can be discussed. The target audience of the workshop are researchers - linguists and computational linguists - involved in the creation and utilization of richly annotated historical and diachronic text corpora, in the context of historical-comparative (diachronic, genetic) linguistic research.

Call for Papers:

We invite papers presenting original research relating to computational historical linguistics, on topics such as:

1. Theoretical and methodological aspects of automatic annotation for historical linguistic research, e.g.:
- The influence and significance of annotation errors
- Which kinds of annotation are needed and useful for historical linguistics
- How to deal with variation and multilinguality
- Annotation transfer between diachronic language stages or between languages
- Issues of standardization, interoperability and data sharing
2. Innovative user interfaces for computational historical linguistics (including search and visualization solutions)
3. Design of optimal annotation workflows with manual and automatic components for creating historical and diachronic corpora
4. Linguistic processing of annotated historical and diachronic corpora for historical linguistic research, e.g.:
- Methods for tracking change in vocabulary and grammar in diachronic corpora
- Grammar extraction and comparison on historical and diachronic treebanks

Papers should conform to the main Nodalida stylesheet.

Submissions must be anonymous, i.e. not reveal author(s) on the title page or through self-references. Papers must be submitted digitally, in PDF, and uploaded through the on-line conference system. Paper submissions that violate either of these requirements will be returned without review.

The page limits for submissions are: up to fourteen pages for regular papers (for oral presentations), and up to eight pages for short papers (to be presented as posters/demos). For both submission types, these page limits do not include additional pages with bibliographic references. Please note that NoDaLiDa 2013 adopts a single-column, smaller page format, optimized for on-screen reading. In terms of actual word counts, the above page numbers correspond to approximately eight and four pages, respectively, in a ‘classic’, two-column conference proceedings layout.

All submissions to the workshop must be uploaded electronically, following the above requirements. All submissions will be reviewed by the program committee. All accepted papers will be collected into a proceedings volume to be submitted for publication in the NEALT Proceeding Series (Linköping Electronic Conference Proceedings).

Important Dates:

18 March: Paper submission to EasyChair
11 April: Notification of acceptance
25 April: Camera-ready papers for publication. You are also required to submit the NEALT transfer of copyright agreement together with your final submission.
22 May: Workshop

Read more issues|LINGUIST home page|Top of issue

Page Updated: 23-Jan-2013

Supported in part by the National Science Foundation       About LINGUIST    |   Contact Us       ILIT Logo
While the LINGUIST List makes every effort to ensure the linguistic relevance of sites listed on its pages, it cannot vouch for their contents.