Publishing Partner: Cambridge University Press CUP Extra Publisher Login

The LINGUIST List is dedicated to providing information on language and language analysis, and to providing the discipline of linguistics with the infrastructure necessary to function in the digital world. LINGUIST is a free resource, run by linguistics students and faculty, and supported by your donations. Please support LINGUIST List during the 2017 Fund Drive.

E-mail this page

Conference Information

Full Title: Encoding Language and Linguistic Information in Historical Corpora

Location: Saarbrücken, Germany
Start Date: 08-Mar-2017 - 10-Mar-2017
Contact: Kerstin Eckart
Meeting Email: click here to access email
Meeting URL:
Meeting Description: Historical corpora have been established as an empirical digital base for various types of linguistic studies. The corpora are based on texts (sometimes images) and often require special information encodings, e.g. transcription and normalization. With respect to corpus linguistics as a method, annotating a (historical) corpus is always a matter of interpretation, either of its structure or of its content, and need not be universally consensual. Additionally, annotations have to balance between a diplomatic representation of historical texts and its linguistic analysis. This requires a linguistic modelling of annotations to develop (i) annotation guidelines, standardized and customized ones, (ii) annotation concepts, such as spans, trees or graphs, (iii) annotation assignment methods, and (iv) corpus architectures. This working group would like to ask which methods of annotation have proven successful in order to address the balancing of historical diplomatic representation and linguistic analyses in historical, corpus-linguistic studies. Additionally, we would like to learn from cases, where common linguistic annotations are not sufficient for the structured exploration of the historical corpus data, and where new approaches address these requirements.

This workshop would like to bring together linguists interested in and using historical corpora, corpus linguists, and computational linguists.
Linguistic Subfield: Computational Linguistics; Historical Linguistics; Text/Corpus Linguistics
LL Issue: 27.2337

This is a session of the following meeting:
39th Annual Meeting of the DGfS (Deutsche Gesellschaft für Sprachwissenschaft)

Calls and Conferences main page