LINGUIST List 24.1218

Tue Mar 12 2013

Confs: Phonology, Syntax, Text/Corpus Ling, Discipline of Ling/Germany

Editor for this issue: Xiyan Wang <>

Date: 12-Mar-2013
From: Claudia Wegener <>
Subject: Information Structure in Spoken Language Corpora
E-mail this message to a friend

Information Structure in Spoken Language Corpora Short Title: ISSLaC

Date: 10-Jun-2013 - 12-Jun-2013 Location: Bielefeld, Germany Contact: Claudia Wegener Contact Email: < click here to access email > Meeting URL:

Linguistic Field(s): Discipline of Linguistics; Phonology; Syntax; Text/Corpus Linguistics

Meeting Description:

Recent developments in technology have made it possible for linguists to create spoken language corpora on a hitherto unprecedented range of languages, including several lesser-studied languages. For languages without a written tradition, spoken corpora assume an even greater value since they document the only mode of communication. Data obtained from corpora are increasingly used in linguistic research, reflecting a more usage-based orientation on the part of linguists on the one hand, and making analyses verifiable on the other.

Spoken language corpora are promising to be particularly useful to the study of information structure (IS). IS often involves complex correspondences between communicative goals and marking strategies, encompassing prosody, morphology, and syntactic structure, the full range of which can best be observed in naturally occurring data (Brunetti et al: 2011). However, the investigation of IS in spoken corpora still has many methodological obstacles to overcome, ranging from those related to the prosodic analysis of spontaneous speech to those relating to the very identification of IS categories in such spontaneous data. These challenges explain why much research on IS continues to rely on introspection or on experimental research. These techniques are rarely available to linguists working with lesser-known languages: they are usually not native speakers, making introspection impossible; further, many types of experiments are not applicable in non-literate and/or non-western cultural contexts. Thus analysing spoken corpora is the only means to get insights into the encoding of IS in these languages, and indeed it is only through the study of spontaneous data that it is possible to gather inventories of the full range of IS categories and understand how they are employed in discourse.

The goal of this workshop is to discuss both research findings on information structure based on spoken corpora, and methodological issues arising in such investigations, in a cross-linguistic perspective.

This workshop is part of the project ‘Discourse and prosody across language family boundaries: two corpus-based case studies on contact-induced syntactic and prosodic convergence in the encoding of information structure’, funded by DoBeS (Volkswagen-Stiftung Funding Initiative ‘Documentation of Endangered Languages’).

Invited Speakers:

Evangelia Adamou (CNRS - LACITO, Villejuif, France)
Lisa Brunetti (CNRS - LLF & Université Paris-Diderot, France)
Yiya Chen (Leiden University, The Netherlands)
Dejan Matic (Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands)
Amina Mettouchi (CNRS - LLACAN, Villejuif, France)
Antje Muntendam (Radboud University Nijmegen, The Netherlands)
Patrizia Paggio (University of Copenhagen, Denmark; University of Malta)
Arndt Riester (Universität Stuttgart, Germany)
Dina El Zarka (Karl-Franzens-Universität Graz, Austria)
Sabine Zerbian (Universität Potsdam, Germany)


Claudia Wegener, University of Bielefeld
Candide Simard, SOAS, London
Eva Schultze-Berndt, University of Manchester

A preliminary program is now online on the workshop website:

Anyone interested in the topic is welcome to attend. Attendance is free, but a registration before May 27th is required. The registration form can be found at:

You can also find information on how to get to Bielefeld and where to stay on the workshop website:

Page Updated: 12-Mar-2013