* * * * * * * * * * * * * * * * * * * * * * * *
LINGUIST List logo Eastern Michigan University Wayne State University *
* People & Organizations * Jobs * Calls & Conferences * Publications * Language Resources * Text & Computer Tools * Teaching & Learning * Mailing Lists * Search *
* *


LINGUIST List 23.1640

Sat Mar 31 2012

Calls: Computational Ling, Text/Corpus Ling, Ling & Lit/Portugal

Editor for this issue: Alison Zaharee <alisonlinguistlist.org>


LINGUIST is pleased to announce an exciting service: Easy Abstracts! Easy Abs is a free abstract submission and review facility designed to help conference organizers and reviewers accept and process abstracts online. Just go to: http://www.linguistlist.org/confcustom, and begin your conference customization process today! With Easy Abstracts, submission and review will be as easy as 1-2-3!
Date: 30-Mar-2012
From: Marco Passarotti <marco.passarottiunicatt.it>
Subject: Annotation of Corpora for Research in the Humanities
E-mail this message to a friend

Full Title: Annotation of Corpora for Research in the Humanities
Short Title: ACRH-2

Date: 29-Nov-2012 - 29-Nov-2012
Location: Lisbon, Portugal
Contact Person: Marco Passarotti
Meeting Email: < click here to access email >
Web Site: http://alfclul.clul.ul.pt/crpc/acrh2/index.html

Linguistic Field(s): Computational Linguistics; Ling & Literature; Text/Corpus Linguistics

Call Deadline: 02-Sep-2012

Meeting Description:

The second edition of the workshop on 'Annotation of Corpora for Research in the Humanities' (ACRH-2) will be held on November 29, 2012 at the University of Lisbon (Portugal) (http://alfclul.clul.ul.pt/crpc/acrh2/index.html).

The workshop will be co-located with the 11th International Workshop on Treebanks and Linguistic Theories (TLT-11), which will be held on November 30 - December 1, 2012 (http://tlt11.clul.ul.pt/).

Like in its first edition (held in Heidelberg on 5 January, 2012: proceedings available here: http://www.jlcl.org/index.php?modus=aktuelle_ausgabe&language=en), the ACRH workshop aims at building a tighter collaboration between people working in various areas of the Humanities (such as literature, philology, history etc.) and the research community involved in developing, using and making accessible annotated corpora.

Addressing topics related to annotated corpora for research in the Humanities is an interdisciplinary task, which involves corpus and computational linguists (mostly those working in literary computing), philologists, scholars in the Humanities and computer scientists. However, this interdisciplinarity is not fully realised yet. Indeed, philologists and scholars are not used to exploit NLP tools and language resources such as annotated corpora; in turn, computational linguists are more prone to develop language resources for NLP purposes only.

For instance, although many corpora that play a relevant role for research in Humanities are today available in digital format (theatrical plays, contemporary novels, critical literature, literary reviews etc.), only a few of them are linguistically tagged, while most still lack linguistic tagging at all. Historical corpora are also a case of special interest, since their creation demands a strong interplay between computational linguistics and more traditional scholarship. Over the past few years a number of historical annotated corpora have been started, among which are treebanks for Middle, Early Modern and Old English, Early New High German, Medieval Portuguese, Ugaritic, Latin, Ancient Greek and several translations of the New Testament into Indo-European languages. The experience of these ever-growing groups of projects can provide many suggestions on the methodology as well as on the practice of interaction between literary studies, philology and corpus linguistics.

We believe that a tighter collaboration between people working in the Humanities and the research community involved in developing annotated corpora is now needed because, while annotating a corpus from scratch still remains a labor-intensive and time-consuming task, today this is simplified by intensively exploiting prior experience in the field. Indeed, such a collaboration is still quite far from being achieved, as a gap still holds between computational linguists (who sometimes do not involve humanists in developing and exploiting annotated corpora for the Humanities) and humanists (who sometimes just ignore that such corpora do exist and that automatic methods and standards to build them are today available).

Invited Speaker:

Martin Wynne (University of Oxford, UK)

Call for Papers:

Submissions are invited for oral presentations and posters (with or without demonstrations) featuring high quality and previously unpublished research on the topics described below. Contributions should focus on results from completed as well as ongoing research, with an emphasis on novel approaches, methods, ideas, and perspectives, whether descriptive, theoretical, formal or computational.

Proceedings will be published in time for the workshop by the Centro de Linguística da Universidade de Lisboa (CLUL). Publication will be online only.

Topics:

To overcome the above mentioned issues, ACRH-2 aims at covering a wide range of topics related to the annotation of corpora for research in the Humanities.

The topics to be addressed in the workshop include (but are not limited to) the following:

- Specific issues related to the annotation of corpora for research in the Humanities
- Annotated corpora as a basis for research in the Humanities
- Diachronic, historical and literary annotated corpora
- Use of annotated corpora for stylometrics and authorship attribution
- Philological issues, like different readings, textual variants, apparatus, non-standard orthography and spelling variation
- Annotation principles and schemes of corpora for research in the Humanities
- Adaptation of NLP tools for older language varieties. Specific features of tools for accessing and retrieving annotated corpora to address various research topics in the Humanities
- Examples of fruitful collaboration between Computational Linguistics and Humanities in building and exploiting annotated corpora

Important Dates:

Deadlines: always midnight, UTC ('Coordinated Universal Time'), ignoring DST ('Daylight Saving Time'):

Deadline for paper submission: September 2, 2012
Notification of acceptance: October 7, 2012
Final version of paper for workshop proceedings: October 28, 2012
Workshop: November 29, 2012

Instructions for Submission:

We invite to submit full papers describing original, unpublished research related to the topics of the workshop. Papers should not exceed 12 pages.

The language of the workshop is English. All papers must be submitted in well-checked English.

Papers should be submitted in PDF format only. Submissions have to be made via the EasyChair page of the workshop at https://www.easychair.org/conferences/?conf=acrh2. Please, first register at EasyChair if you do not have an EasyChair account.

The style guidelines follow the specifications required by TLT. They can be found here:

http://alfclul.clul.ul.pt/crpc/acrh2/submission.html

Please, note that as reviewing will be double-blind, the papers should not include the authors' names and affiliations or any references to web-sites, project names etc. revealing the authors' identity. Furthermore, any self-reference should be avoided. For instance, instead of 'We previously showed (Brown, 2001)...', use citations such as 'Brown previously showed (Brown, 2001)...'. Each submitted paper will be reviewed by three members of the program committee.

Submitted papers can be for oral or poster presentations (with or without demo). There is no difference between the different kinds of presentation both in terms of reviewing process and publication in the proceedings (the limit of 12 pages holds for both oral and poster presentations).

Oral Presentation:

The oral presentations at the workshop will be 30 minutes long (25 minutes for presentation and 5 minutes for questions and discussion).

Program Committee Chairs:

Francesco Mambrini (University of Cologne, Germany)
Marco Passarotti (Università Cattolica del Sacro Cuore, Milan, Italy)
Caroline Sporleder (Saarland University, Saarbrücken, Germany)

Program Committee Members:

David Bamman (USA)
Gabriel Bodard (UK)
Lars Borin (Sweden)
Antonio Branco (Portugal)
Helma Dik (USA)
Milena Dobreva (Malta)
Anette Frank (Germany)
Dag Haug (Norway)
Erhard Hinrichs (Germany)
Beáta Megyesi (Sweden)
Martha Nell Smith (USA)
Petya Osenova (Bulgaria)
Martin Reynaert (the Netherlands)
Victoria Rosén (Norway)
Jeff Rydberg Cox (USA)
Melissa Terras (UK)
Manfred Thaller (Germany)
Martin Volk (Switzerland)

Local Organization:

Amalia Mendes
Iris Hendrickx
Sandra Antunes
Aida Cardoso
Sandra Pereira

All CLUL, University of Lisbon, Portugal



Read more issues|LINGUIST home page|Top of issue



Page Updated: 31-Mar-2012

Supported in part by the National Science Foundation       About LINGUIST    |   Contact Us       ILIT Logo
While the LINGUIST List makes every effort to ensure the linguistic relevance of sites listed on its pages, it cannot vouch for their contents.