LINGUIST List 25.1401

Mon Mar 24 2014

Confs: Computational Linguistics/Sweden

Editor for this issue: Xiyan Wang <>

Date: 23-Mar-2014
From: Agata Savary <>
Subject: EACL 2014 Workshop on Multiword Expressions
E-mail this message to a friend

EACL 2014 Workshop on Multiword Expressions
Short Title: MWE 2014

Date: 26-Apr-2014 - 27-Apr-2014
Location: Gothenburg, Sweden
Contact: Valia Kordoni
Contact Email: < click here to access email >
Meeting URL:

Linguistic Field(s): Computational Linguistics

Meeting Description:

The 10th Workshop on Multiword Expressions (MWE 2014)
Workshop at EACL 2014 (Gothenburg, Sweden), April 26-27, 2014

Endorsed by the Special Interest Group on the Lexicon of the Association for Computational Linguistics (SIGLEX;; SIGLEX’s Multiword Expressions Section (SIGLEX-MWE;; and PARSEME, European IC1207 COST Action (

Under the denomination 'multiword expression', one assumes a wide range of linguistic constructions such as idioms (storm in a teacup, sweep under the rug), fixed phrases (in vitro, by and large, rock'n roll), noun compounds (olive oil, laser printer), compound verbs (take a nap, bring about), etc. While easily mastered by native speakers, their interpretation poses a major challenge for computational systems, due to their flexible and heterogeneous nature.

For a start, MWEs are not nearly as frequent in NLP resources as they are in real-word text, and this problem of coverage may impact the performance of many NLP tasks. Moreover, treating MWEs also involves problems like determining their semantics, which is not always compositional (to kick the bucket meaning to die). In sum, MWEs are a key issue and a current weakness for natural language parsing and generation, as well as real-life applications depending on language technology, such as machine translation, just to name a prominent one among many.

Thanks to the joint efforts of researchers from several fields working on MWEs, significant progress has been made in recent years, especially concerning the construction of large-scale language resources. For instance, there is a large number of recent papers that focus on acquisition of MWEs from corpora, and others that describe a variety of techniques to find paraphrases for MWEs. Current methods use a plethora of tools such as association measures, machine learning, syntactic patterns, web queries, etc. A considerable body of techniques, resources and tools to perform these tasks are now available, and are indicative of the growing importance of the field within the NLP community.

Many of these advances are described as part of the annual workshop on MWEs, which attracts the attention of an ever-growing community working on a variety of languages and MWE types. The workshop has been held since 2001 in conjunction with major computational linguistics conferences (ACL, EACL, NAACL, COLING, LREC), providing an important venue for the community to interact, share resources and tools and collaborate on efforts for advancing the computational treatment of MWEs. Additionally, special issues on MWEs have been published by leading journals in computational linguistics. The latest such effort is the special issue on “Multiword Expressions: from Theory to Practice and Use”, which has recently been published by the ACM Transactions on Speech and Language Processing.


Saturday, April 26, 2014

Opening Remarks

Oral Session 1: Detection and Extraction of MWEs

Breaking Bad: Extraction of Verb-Particle Constructions from a Parallel Subtitles Corpus
Aaron Smith

A Supervised Model for Extraction of Multiword Expressions, Based on Statistical Context Features
Meghdad Farahmand and Ronaldo Martins

Oral Session 2: PARSEME I – Parsing MWEs

VPCTagger: Detecting Verb-Particle Constructions with Syntax-Based Methods
István Nagy T. and Veronika Vincze

10:30–11:00 Coffee Break

Invited Talk 1: Preslav Nakov - Title 'The Web as an Implicit Training Set: Application to Noun Compounds Syntax and Semantics'

Oral Session 2: PARSEME I – Parsing MWEs (continued)

The Relevance of Collocations for Parsing
Eric Wehrli

12:30–14:00 Lunch

Oral Session 3: Short papers – PARSEME II

Parsing Modern Greek Verb MWEs with LFG/XLE grammars
Niki Samaridi and Stella Markantonatou

Evaluation of a Substitution Method for Idiom Transformation in Statistical Machine Translation
Giancarlo Salton, Robert Ross and John Kelleher

Encoding MWEs in a Conceptual Lexicon
Aggeliki Fotopoulou, Stella Markantonatou and Voula Giouli

Poster Booster Session (4 minutes per poster)

German Compounds and Statistical Machine Translation. Can they get along?
Carla Parra Escartín, Stephan Peitz and Hermann Ney

Extracting MWEs from Italian Corpora: A Case Study for Refining the POS-pattern Methodology
Sara Castagnoli, Malvina Nissim and Francesca Masini

Mickey Mouse Is Not a Phrase: Improving Relevance in E-Commerce with Multiword Expressions
Prathyusha Senthil Kumar, Vamsi Salaka, Tracy Holloway King and Brian Johnson

Encoding of Compounds in Swedish FrameNet
Karin Friberg Heppin and Miriam R L Petruck

Extraction of Nominal Multiword Expressions in French
Marie Dubremetz and Joakim Nivre

Towards an Empirical Subcategorization of Multiword Expressions
Luigi Squillante

Contexts, Patterns, Interrelations - New Ways of Presenting Multi-word Expressions
Kathrin Steyer and Annelen Brunner

Detecting Change and Emergence for Multiword Expressions
Martin Emms and Arun Jayapal

An Approach to Take Multi-word Expressions
Claire Bonial, Meredith Green, Jenette Preciado and Martha Palmer

15:30–16:00 Coffee Break

Poster Session

Sunday, April 27, 2014

Invited Talk 2: TBA

10:30–11:00 Coffee Break

Oral Session 5: Short papers – MWEs in Multilingual Applications

Paraphrasing Swedish Compound Nouns in Machine Translation
Edvin Ullman and Joakim Nivre

Feature Norms of German Noun Compounds
Stephen Roller and Sabine Schulte im Walde

Identifying Collocations Using Cross-lingual Association Measures
Lis Pereira, Elga Strafella, Kevin Duh and Yuji Matsumoto

Oral Session 6: Issues in Lexicon Construction and Machine Translation

Unsupervised Construction of a Lexicon and a Repository of Variation Patterns for Arabic Modal Multiword Expressions
Rania Al-Sabbagh, Roxana Girju and Jana diesner

12:30–14:00 Lunch

Oral Session 6: Issues in Lexicon Construction and Machine Translation (continued)

Issues in Translating Verb-Particle Constructions from German to English
Nina Schottmüller and Joakim Nivre

Invited Talk 3: Ekaterina Shutova - Title: 'Statistical Modelling of Metaphor'

Closing remarks

Page Updated: 24-Mar-2014