LINGUIST List 22.5089|
Sat Dec 17 2011
Calls: Syntax, Computational Ling, Text/Corpus Ling/Canada
Editor for this issue: Alison Zaharee
LINGUIST is pleased to announce the launch of an exciting new feature: Easy Abstracts! Easy Abs is a free abstract submission and review facility designed to help conference organizers and reviewers accept and process abstracts online. Just go to: http://www.linguistlist.org/confcustom, and begin your conference customization process today! With Easy Abstracts, submission and review will be as easy as 1-2-3!
1. Ines Rehbein ,
Workshop on Syntactic Analysis of Non-Canonical Language
Message 1: Workshop on Syntactic Analysis of Non-Canonical Language
From: Ines Rehbein <irehbeinuni-potsdam.de>
Subject: Workshop on Syntactic Analysis of Non-Canonical Language
E-mail this message to a friend
Full Title: Workshop on Syntactic Analysis of Non-Canonical Language
Short Title: SANCL 2012
Date: 08-Jun-2012 - 08-Jun-2012
Location: Montreal, Quebec, Canada
Contact Person: Ines Rehbein
Meeting Email: < click here to access email >
Web Site: https://sites.google.com/site/sancl2012
Linguistic Field(s): Computational Linguistics; Syntax; Text/Corpus Linguistics
Call Deadline: 26-Mar-2012
SANCL 2012 - NAACL-HLT Workshop on Syntactic Analysis of Non-Canonical Language
The first Workshop on Syntactic Analysis of Non-Canonical Language will be held in conjunction with the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2012) which will take place in June, 3-8, 2012 in Montreal, Canada.
The SANCL workshop aims to provide a forum for all researchers interested in syntactic analysis and parsing of language that is 'non-canonical'. By that term we mean structures with characteristics deviating from the standard written form of the language. A case in point is spoken language, but also the language of social media, computer-mediated communication in general, the interlanguage produced by language learners, or historical data. All of these pose challenges for parsing models trained on edited newspaper text as well as for the theoretical analysis of these structures.
Call for Papers:
Scope and Topics:
We aim to encourage a cross-fertilisation of ideas amongst researchers working on different but related problems, such as
- What is the best strategy for parsing non-canonical language?
- Should we treat parsing of non-canonical language as a problem of robustness or domain adaptation?
- Or would it be better to develop new training data sets addressing the particular properties of the data?
- What are the pros and cons of a one-size-fits-all annotation approach and of applying annotation schemes developed for standard written text to non-canonical data?
- Can insights gained from parsing one type of non-canonical text help in parsing another?
- What are the challenges of handling the often heterogeneous nature of the data (e.g. code-switching)?
- What role does pre-processing play in the parsing of non-canonical data?
- To what extent is it necessary or desirable to perform full parsing for some kinds of non-canonical text?
- From a theoretical perspective, what are the appropriate analyses for non-canonical structures?
- How should new linguistic forms emerging from social media be analysed, e.g. the use of hashtags in Twitter?
- What is the optimal unit of analysis?
- For non-sentential units (frequent in spoken language) and especially for elliptical utterances: what kind of information is necessary for a meaningful analysis? Depending on the application, categories like 'NP' or 'PP' might not sufficient.
Contributions to the workshop should address the adequate syntactic representation as well as the unit of analysis for the task at hand. We welcome both theoretical and practical contributions for any grammatical framework, any parsing approach and any language.
Authors are invited to submit long or short papers on original, unpublished work addressing these (or related) topics. Long papers may consist of up to 8 pages of content plus two extra pages for references; short papers may consist of 4 pages of content including references. Papers should be formatted according to the NAACL 2012 guidelines (for more information please visit http://www.naaclhlt2012.org/conference/conference.php)
As the reviewing will be blind, the paper must not include the authors' names and affiliations. Furthermore, self-references that reveal the author's identity, e.g., 'We previously showed (Smith, 1991) ...' must be avoided. Instead, use citations such as 'Smith previously showed (Smith, 1991) ...' Papers that do not conform to these requirements will be rejected without review. In addition, please do not post your submissions on the web until after the review process is complete.
Papers that have been or will be submitted to other meetings or publications must indicate this at submission time. Please visit the workshop web page (https://sites.google.com/site/sancl2012) for more details.
The SANCL 2012 workshop will host the first shared task on parsing English web text organised by Google. A session in the workshop will be devoted to presenting and discussing the results of this shared task. For more details, please visit:
March 26, 2012: Paper submission deadline
April 23, 2012: Notification of acceptance
May 4, 2012: Camera-ready deadline
June 8, 2012: SANCL workshop at NAACL-HLT 2012
Ozlem Cetinoglu (IMS Stuttgart, Germany)
Jennifer Foster (NCLT, DCU, Ireland)
Ines Rehbein (Potsdam University, Germany)
Shared Task Organizers:
Slav Petrov (Google Research, USA)
Ryan McDonald (Google Research, USA)
Bernd Bohnet (IMS Stuttgart, Germany)
Aoife Cahill (Educational Testing Service, USA)
Marie Candito (University of Paris 7, France)
John Carroll (University of Sussex, UK)
Jinho Choi (University of Colorado at Boulder, USA)
Eric de la Clergerie (INRIA, France)
Markus Dickinson (Indiana University, USA)
Steffi Dipper (University of Bochum, Germany)
Gulsen Eryigit (Istanbul Technical University, Turkey)
Stefan Evert (University of Darmstadt, Germany)
Kim Gerdes (University of Paris 3, France)
Ron Kaplan (Microsoft, USA)
Jonas Kuhn (IMS Stuttgart, Germany)
Sandra Kübler (Indiana University, USA)
Joseph Le Roux (Université Paris-Nord, France)
Anke Lüdeling (Humboldt-University of Berlin, Germany)
David McClosky (Stanford University, USA)
Detmar Meurers (University of Tübingen, Germany)
Joakim Nivre (Uppsala University, Sweden)
Lilja Øvrelid (University of Oslo, Sweden)
Brian Roark (Oregon Health & Science University, USA)
Kenji Sagae (University of Southern California, USA)
Djamé Seddah (University of Paris 4, France)
Reut Tsarfaty (Uppsala University, Sweden)
Josef van Genabith (Dublin City University, Ireland)
Heike Zinsmeister (University of Konstanz, Germany)
For general questions about the workshop, please email sancl2012contactgmail.com. For specific questions about the shared task, please email the shared task organizers (parsingthewebgmail.com). Additional information about SANCL 2012 can be found at:
Read more issues|LINGUIST home page|Top of issue
Page Updated: 17-Dec-2011
While the LINGUIST List makes every effort to ensure the linguistic relevance of sites listed
on its pages, it cannot vouch for their contents.