LINGUIST List 27.2337
Tue May 24 2016
Calls: Computational Ling, Historical Ling, Text/Corpus Ling/Germany
Editor for this issue: Anna White <awhitelinguistlist.org>
Carolin Odebrecht <carolin.odebrecht
Encoding Language and Linguistic Information in Historical Corpora E-mail this message to a friend
Full Title: Encoding Language and Linguistic Information in Historical Corpora
Date: 08-Mar-2017 - 10-Mar-2017
Location: Saarbrücken, Germany
Contact Person: Kerstin Eckart
Meeting Email: < click here to access email >
Web Site: https://hu.berlin/DGfS2017AGOdebrechtEckart
Linguistic Field(s): Computational Linguistics; Historical Linguistics; Text/Corpus Linguistics
Call Deadline: 16-Aug-2016
Historical corpora have been established as an empirical digital base for various types of linguistic studies. The corpora are based on texts (sometimes images) and often require special information encodings, e.g. transcription and normalization. With respect to corpus linguistics as a method, annotating a (historical) corpus is always a matter of interpretation, either of its structure or of its content, and need not be universally consensual. Additionally, annotations have to balance between a diplomatic representation of historical texts and its linguistic analysis. This requires a linguistic modelling of annotations to develop (i) annotation guidelines, standardized and customized ones, (ii) annotation concepts, such as spans, trees or graphs, (iii) annotation assignment methods, and (iv) corpus architectures. This working group would like to ask which methods of annotation have proven successful in order to address the balancing of historical diplomatic representation and linguistic analyses in historical, corpus-linguistic studies. Additionally, we would like to learn from cases, where common linguistic annotations are not sufficient for the structured exploration of the historical corpus data, and where new approaches address these requirements.
This workshop would like to bring together linguists interested in and using historical corpora, corpus linguists, and computational linguists.
Call for Papers:
We invite submissions for 30 minutes talks (20 min presentation + 10 min discussion) on these aspects but are not limited to:
- The interplay between corpus architecture, annotation decisions and reusability of the research data.
- The applicability of manual or automatic annotation: effects of corpus size, tag sets, error types, and methods of information retrieval.
- Methods for the exploration of annotated corpora (e.g. combination of annotation concepts, search for and interpretation of annotations).
Submissions for the working group (DGfS Kurz-AG) should comprise an abstract of max. 1 DIN A4 page in Times New Roman, pt. 12, 1,5 line spacing, including references which should conform to the guidelines of the ‘Zeitschrift für Sprachwissenschaft’.
Please send the abstracts in Word-format (doc, docx) or in PDF to kerstin.eckart
Deadline for submission: 16.8.2016
Notification of acceptance: 1.9.2016
We are looking forward to seeing you at the DGfS 2017!
Kerstin Eckart and Carolin Odebrecht
Page Updated: 24-May-2016