LINGUIST List 26.3238

Thu Jul 09 2015

Confs: Cog Science, Computational Ling, Psycholing, Text/Corpus Ling, Translation/Romania

Editor for this issue: Erin Arnold <>

Date: 09-Jul-2015
From: Verginica Mititelu <>
Subject: Errors by Humans and Machines in Multimedia, Multimodal and Multilingual Data Processing
E-mail this message to a friend

Errors by Humans and Machines in Multimedia, Multimodal and Multilingual Data Processing
Short Title: ERRARE 2015

Date: 12-Sep-2015 - 13-Sep-2015
Location: Sinaia, Romania
Contact: Ioana Vasilescu
Contact Email: < click here to access email >
Meeting URL:

Linguistic Field(s): Cognitive Science; Computational Linguistics; Psycholinguistics; Text/Corpus Linguistics; Translation

Meeting Description:

The workshop will be organized around the topic of errors produced and processed by humans and machines in multimedia, multimodal and multilingual data with a particular focus on spoken language. It distinguishes itself from other conferences addressing these issues by providing a forum for dialogue and exchange between researchers working in linguistics, including psycho- and neurolinguistics, on the one hand, and researchers in computer science, machine learning and multimedia speech and language processing, on the other hand.

For this interdisciplinary workshop, we would like to gather these different communities around the issues of variation, ambiguity and errors in speech and language. The purpose of this workshop is to share interdisciplinary expertise on a heterogeneous phenomenon referred to as “variation” and “ambiguity” in some domains and as “errors” in others. Researchers are invited to share their thoughts and observations through case studies run in the context of various initiatives.

A large panel of research areas shares a common object of study: human language. These areas encompass historically well-established research communities: classical humanities and social sciences (phonetics, phonology, psycholinguistics, etc.), and more recent domains of the sciences (brain and computer science). Research objectives include analyzing, modeling, understanding and theorizing the human processing of speech variation. For linguists and psycholinguists variation in speech involves some matching process between variable surface forms and stable underlying forms: in such a framework errors may naturally arise as mismatches occurring at the interface of surface and underlying representations. Yet by which mechanisms errors may arise and how to interpret the patterning of errors within theoretical models of speech production and perception has been a matter of controversy. Speech error research in recent years has particularly highlighted the fuzzy boundary between the concepts of 'variability', ambiguity' and 'error'.

Research activities most often include corpora consisting of various types of recorded speech from controlled (laboratory) speech to large scale data. Such corpora may be a result of a variety of capturing techniques from standard audio recordings to multi-sensor capturing of either articulation gestures or brain activities. Errors can also be envisioned as a result of noisy data capturing conditions.

Sharing experience with errors, variation and ambiguity is expected to produce beneficial insights for the different communities:

Concerning humanities, variation and ambiguity are central to the different branches of linguistics. Furthermore, human production and perception errors challenge the existing language acquisition, production and perception models.

For automatic speech and language processing, residual errors indicate regions which escape current modeling capacities. In-depth analyses in collaboration with linguists, psycholinguists and speech scientists may contribute to a better understanding of these phenomena and to the proposal of innovative strategies.

Brain sciences, a recent rapidly evolving research area, open new opportunities and the study of errors can contribute to reveal the hidden organization of the brain.


Tentative Program Workshop ERRARE 2015, 11-14 September 2015

Friday, September 11

Afternoon and evening arrival of participants at the Mara hotel in Sinaia (1h30 from Bucharest-Otopeni Airport)

Welcome Dinner at the Mara Hotel

Day 1: Saturday, September 12

Welcome/Presentation of the Workshop

Session #1
Variation and speech: analysis and modeling of errors in speech-to-text applications

Keynote C. Burileanu
“The Romanian ASR System developed at “Speech & Dialogue” (“SpeeD”) Research Laboratory - University POLITEHNICA of Bucharest. Under-resourced languages challenges for an ASR system”

10:15-10:30 Break

D. Telaar, J. Weiner, T. Schulz
“Error signatures to identify Errors in ASR in an unsupervised fashion”

M. A. Ben Jannet, M. Adda-Decker, O. Galibert, S. Rosset
“How to evaluate ASR errors impact on NER?”

H. Cucu, A. Buzo, C. Burileanu
“ASR errors in transcribing informal pronunciations of Romanian numbers”

S. Ghannay, N. Camelin, Y. Estève
“Which ASR errors are hard to detect?”

12:30-13:45 Lunch

Session #2
From Variation to Errors in Speech Production and Perception

Keynote R. Hartsuiker
“Conflict monitoring in speech production”

S. Marin, M. Pouplier
“Spontaneously occurring speech errors in German: BAS corpora analysis”

C. Dutrey, M. Adda-Decker, F. Santiago
“Large-scale prosodic-acoustic analysis of ASR errors in French”

15:45-16:00 Break

Keynote M. Zock
“« Errare humanum est ». Refusing to « appreciate » this fact could be a big mistake!”

M. Ernestus, L. ten Bosch
“Modeling language-learners’ errors in understanding casual speech”

I. Chitoran, J. Hualde
“Articulatory undershoot - from misperception to historical sound change”

Gala Dinner

Day 2: Sunday, September 13

Peles Castle Visit (Departure of bus from Mara Hotel at 9:00 am)

Session #3
Errors and dialogue: issues in Human/Human and Human/Machine Interaction

Keynote F. Lefèvre
“Uncertainty management in vocal interaction interfaces, seamlessly modeling and handling errors in interactive processes”

R. Higashinaka, K. Funakoshi, M. Mizukami, H. Tsukahara, Y. Kobayashi, M. Araki
“Analyzing dialogue breakdowns in chat-oriented dialogue systems”

12:30-13:45 Lunch

L. Devillers, S. Rosset
“The communication accommodation of the machine to deal with errors”

A. I. Niculescu, R. Banchs
“Strategies to cope with errors in human-machine speech interactions: using chatbots as back-off mechanism for task-oriented dialogues”

14:45-15:00 Break

Session #4
Errors from multiple sources (syntax, discourse, multimedia...)

Keynote M. Turchi
“Working with human post-edited sentences: old challenges, new solutions.”

M. Kai
“Truncated Speech and Its Repair”

F. Santiago, M. Adda-Decker, C. Dutrey
“Towards a Typology of ASR Errors via Syntax-Prosody Mapping”

17:00-17:15 Break

V. Barbu Mititelu, E. Irimia
“Types of errors in the automatic syntactic parsing of Romanian”

D. Charlet, J. Poignant, H. Bredin, C. Fredouille, S. Meignier
“What Makes a Speaker Recognizable in TV Broadcast? Going Beyond Speaker Identification Error Rate”

Final Discussion

18:45 -19:15
Farewell Drink

Dinner at the Mara hotel

Monday, 14 of September

Departure of Bus #1 from Mara Hotel to Otopeni Airport (1h30m)

Departure of Bus#2 from Mara Hotel to Dracula’s Castle (1h)

Visit of Dracula’s Castle (1h)

11:00–12:00 Picnic Lunch

Departure of Bus #2 from Dracula’s Castle to Otopeni Airport (2h30m)

Registration is now open at

Page Updated: 09-Jul-2015