Publishing Partner: Cambridge University Press CUP Extra Publisher Login

The LINGUIST List is dedicated to providing information on language and language analysis, and to providing the discipline of linguistics with the infrastructure necessary to function in the digital world. LINGUIST is a free resource, run by linguistics students and faculty, and supported by your donations. Please support LINGUIST List during the 2017 Fund Drive.

E-mail this page

Conference Information

Full Title: Language Resources and Technologies for Processing and Linking Historical Documents

Short Title: LRT4HDA
Location: Reykjavik, Iceland
Start Date: 26-May-2014 - 26-May-2014
Contact: Cristina Vertan
Meeting Email: click here to access email
Meeting URL:
Meeting Description: Recently, collaboration between the NLP community and specialists in various areas of the Humanities has become more efficient and
fruitful due to the common aim of exploring and preserving cultural heritage data. It is worth mentioning the efforts made during the
digitisation campaigns in the last years and within a series of initiatives in the Digital Humanities, especially in making old manuscripts
available through Digital Libraries.

Given the number of contemporary languages and their historical variants, it is practically impossible to develop brand new language
resources and tools for processing older texts. Therefore, the real challenge is to adapt existing language resources and tools, as well as
to provide (where necessary) training material in the form of corpora or lexicons for a certain period of time in history.

Another issue regarding historical documents is their usage after they are stored in digital libraries. Historical documents are not only
browsed but together with adequate tools they may serve as basis for re-interpretation of historical facts, discovery of new connections,
causal relations between events etc. In order to be able to make such analysis, historical documents should be linked among
themselves, on the one hand, and with modern knowledge bases, on the other. Activities in the area of Linked Open Data (LOD) play a
major role in this respect.

A particular type of historical documents are the newspaper collections and archives. Newspapers reflect what is going on in society, and
constitute a rich data collection for many types of humanities research, ranging from history, political and social sciences to linguistics,
both synchronic and diachronic, and both national and cross-national. They represent an important resource for analysis of changes at all
levels which emerged in Europe with begin of the industrialization period.

The aim of this workshop is to bring together researchers working in the interdisciplinary domain of cultural heritage, specialists in natural
language and speech processing working with less-resourced languages as well as key players among Linked Open Data initiatives.
They are expected to analyse problems and brainstorm solutions in the automatic analysis of historical documents, uni- or multimedia,
their deep annotation and interlinking.

The workshop is organised in collaboration with CLARIN (
Linguistic Subfield: Computational Linguistics; Historical Linguistics; Ling & Literature; Text/Corpus Linguistics
LL Issue: 25.670

This is a session of the following meeting:
Language Resources and Evaluation Conference

Calls and Conferences main page