LINGUIST List 27.933

Mon Feb 22 2016

Confs: Computational Ling, Text/Corpus Ling/Germany

Editor for this issue: Ashley Parker <ashleylinguistlist.org>


Date: 22-Feb-2016
From: Armin Hoenen <hoenenem.uni-frankfurt.de>
Subject: 1st GSCL/CEDIFOR-Workshop on Corpora and Resources for Low Resource Languages with a Special Focus on Historical Languages
E-mail this message to a friend

1st GSCL/CEDIFOR-Workshop on Corpora and Resources for Low Resource Languages with a Special Focus on Historical Languages
Short Title: CRiLL-HL


Date: 29-Feb-2016 - 29-Feb-2016
Location: Frankfurt am Main, Germany
Contact: Armin Hoenen
Contact Email: < click here to access email >
Meeting URL: http://gscl-ak-korpuslinguistik-devel.hucompute.org/cfp-cedifor-workshop/

Linguistic Field(s): Computational Linguistics; Text/Corpus Linguistics

Meeting Description:

1st GSCL/CEDIFOR-Workshop on Corpora and Resources for Low Resource Languages with a Special Focus on Historical Languages
Short title: CRiLL-HL
Location: Goethe University Frankfurt, Germany
Web Site: http://gscl-ak-korpuslinguistik-devel.hucompute.org/cfp-cedifor-workshop/

Organizing commitee: Alexander Mehler, Jost Gippert, Armin Hoenen
Program commitee: tba

Keynote Speakers: Prof. Dr. Anke Lüdeling

CRiLL-HL aims at researchers working with Low Resource Languages - a special focus is on historical languages.

In recent years, the interest in language resources and computational models for the study of low resource languages (LRLs) has been constantly growing. This concerns especially historical LRLs for which authentic digitized material is hardly available. Since for most historical languages data is sparse one may consider Wikipedia and related projects (on Old English, Gothic, Old Church Slavonic, Latin etc.) to overcome data sparseness. This approach investigates to what extent such data can be useful for modeling historical languages. The LRL workshop addresses scholars working on these and related topics using LRL corpora for NLP of low-resourced languages.

Program Commitee:

Prof. Dr. Tara Andrews
Prof. Dr. Chris Biemann
Dr. Cristopher Brown
Dr. Marco Büchler
Prof. Dr. Christian Chiarcos
Prof. Dr. Stefanie Dipper
Dr. Steffen Eger
Prof. Dr. Christiane Fellbaum
Prof. Dr. Kurt Gärtner
Prof. Dr. Jolanta Gelumbeckaitė
Prof. Dr. Dafydd Gibbon
Prof. Dr. Jost Gippert
Dr. Erla Hallsteinsdóttir
Prof. Dr. Gerhard Heyer
Armin Hoenen, M.A.
Prof. Dr. Brian Joseph
Prof. Dr. Caroline Macé
Prof. Dr. Alexander Mehler
Prof. Dr. Marco Passarotti
Prof. Dr. Andrea Rapp
Prof. Dr. Joachim Scharloth
Prof. Dr. Bernhard Schröder
Prof. Dr. Anders Søgaard

Keynote Speakers:

Prof. Dr. Anke Lüdeling

Program:

8:45-9:00
Welcoming reception

9:00-09:20
Speaker: Stefanie Dipper
Authors: Florian Petran, Thomas Klein, Stefanie Dipper and Marcel Bollmann
Title: REM: A reference corpus of Middle High German — corpus compilation, annotation, visualization and searching

09:20-09:40
Speaker: Roland Mittmann
Author: Roland Mittmann
Title: Automatisierter Abgleich des Lautstandes althochdeutscher Wörter

09:40-10:00
Speaker:
Author: Lars Döhling, Manuel Burghardt and Christian Wolff
Title: Introducing PaLaFra – A Project on the Creation and Analysis of an Electronic Corpus of Historical Texts of Old French and Late Latin

10:00-10:30 Coffee Break

10:30-10:50
Speaker: Marija Lazar
Author: Marija Lazar
Title: Crossing “Bridges”: Modelling Inter-Slavic Language Continua in the Corpora of Medieval Legal Texts

10:50-11:10
Speaker: Oleg Borisenko
Author: Julia Normanskaja and Oleg Borisenko
Title: «The analysis of new archival and field data on the Samoyed languages»

11:10-11:30
Speaker: Patrick Brendon Pearson
Authors: Patrick Brendon Pearson, Kristian Heal and Deryle W. Lonsdale
Title: Corpus support for variable second-position clitics in Syriac

11:30-11:50
Speaker: Ergin Öpengin
Author: Ergin Öpengin
Title: Corpus of pre-1800 texts in Kurdish: Inferences on early-modern Kurmanji

12:00-13:30 Lunch Break

13:30-13:50
Speaker: Maria Molina
Author: Maria Molina
Title: SYNTACTICAL ANNOTATION OF HITTITE AND DEVELOPING OF A CORPUS: PROBLEMS AND PRINCIPLES

13:50-14:10
Speaker: Wahed Hemati, Tolga Uslu
Author: Wahed Hemati, Tolga Uslu and Alexander Mehler
Title: TextImager: a UIMA-based System for Processing Documents

14:10-14:30
Speaker: Armin Hoenen
Author: Armin Hoenen
Title: Reconstructing small epigraphic gaps assessing the use of recently created text in historical languages to overcome data sparsity

14:30-15:30
Working Groups I

15:30-16:00 Coffee break

16:00-16:30
Working Groups II

16:30-17:00
Presentation of Results and Plenar Discussion

17:00-18:00
Keynote Talk
Prof. Dr. Anke Lüdeling
A Resource for the Study of Register Development: The RIDGES Corpus of Herbal Texts

18:00-18:15
Closing remarks and farewell


Page Updated: 22-Feb-2016