LINGUIST List 14.765

Mon Mar 17 2003

Calls: Discourse & Dialogue/Empirical Methods in NLP

Editor for this issue: Marie Klopfenstein <>

FUND DRIVE 2003 Thanks to the generosity of our subscribers we were able to reach the $10,000 mark in record time. Because of this we were able to secure a $1,500 donation from John Benjamins Publishing - a big thank you to all the crew at JBs! But we still have a long way to go! Please help us reach our total of $50,000 by making a donation at: The LINGUIST List depends on the generous contributions from subscribers like you; we would not be able to operate without your help. The moderators, staff, and student editors at LINGUIST would like to take this opportunity to thank you for your continuous support. As a matter of policy, LINGUIST discourages the use of abbreviations or acronyms in conference announcements unless they are explained in the text. To post to LINGUIST, use our convenient web form at


  1. Alex Rudnicky, 4th SIGdial Workshop on Discourse and Dialogue
  2. Priscilla Rasmussen, Empirical Methods in Natural Language Processing

Message 1: 4th SIGdial Workshop on Discourse and Dialogue

Date: Thu, 13 Mar 2003 09:55:17 -0500
From: Alex Rudnicky <>
Subject: 4th SIGdial Workshop on Discourse and Dialogue

Please note that the deadline for submissions to the 2003 SIGdial
Workshop in Sapporo has been extended to March 27th, 2003.

The amended Call for Papers follows:

	 4th SIGdial Workshop on Discourse and Dialogue

		 Sapporo, July 5 and 6, 2003

 (immediately preceding the 41st annual meeting of the ACL)

Continuing with a series of successful workshops in Hong Kong, Aalborg
and Philadelphia, this workshop spans the ACL and ISCA SIGdial
interest area of discourse and dialogue. This series provides a
regular forum for the presentation of research in this area to both
the larger SIGdial community as well as researchers outside this
community. The workshop is organized by SIGdial, which is sponsored
jointly by ACL and ISCA.


We welcome formal, corpus-based, implementational and analytical work
on discourse and dialogue, with a focus on the following three themes:

(1) Dialogue Systems
Spoken, multi-modal, and text/web based dialogue systems including
topics such as:
* Dialogue management models (specific sub-problems or general
modeling, in particular models for mixed initiative and user-adaptive
* Speech and gesture, text and graphics integration (for understanding
or generation);
* Context-based interpretation of dialogues and/or response planning,
in particular how this contributes to natural interaction;
* Strategies for preventing, detecting or handling miscommunication
(repair and correction types, clarification and under-specificity,
grounding and feedback strategies);
* Utilizing prosodic information for understanding and for
* Contrasts between task-driven and conversational dialogue.

(2) Corpora, Tools and Methodology
Corpus-based work on discourse and spoken, text-based and multi-modal 
dialogue including its support, in particular:
* Issues and problems in discourse and dialogue annotation;
* Annotation tools and coding schemes; 
* Data resources for discourse and dialogue studies;
* Corpus-based techniques and analysis (including machine learning);
* Tools (XML-based and other) for dialogue system building;
* Evaluation of dialogue systems, including methodology, metrics and 
case studies.

(3) Pragmatic and/or Semantic Modeling
The pragmatics and/or semantics of discourse and dialogue (i.e.,
beyond a single sentence) including the following issues:
* The semantics/pragmatics of dialogue acts (including those which are
less studied in the semantics/pragmatics framework);
* Incremental (plan-based, topic-based, etc.) models of
discourse/dialogue structure integrating referential and relational
* Modeling genre-specific aspects of discourse and dialogue structure,
including the specific structural aspects of (interactive) digital
* Prosody in discourse and dialogue;
* Modeling politeness and non-recursive parts of discourse and
* Models of presupposition and accommodation; operational models of
conversational implicature.


The program committee welcomes the submission of long papers for full
plenary presentation as well as short papers and demonstrations. Short
papers and demo descriptions will be featured in short plenary
presentations, followed by posters and demonstrations.

* Long papers must be no longer than 10 pages, including title page,
examples, references, etc. In addition to this, two additional pages
are allowed as an appendix which may include extended example
discourses or dialogues, algorithms, graphical representations, etc.
* Short papers and demo descriptions should aim to be 5 pages or less
(including title page, examples, references, etc.)

Authors are encouraged to make illustrative materials available, on
the web or otherwise. For example, excerpts of recorded conversations,
recordings of human-computer dialogues, interfaces to working systems,

Both long papers and short papers should be sent electronically to the
e-mail address: and must be received no later
than March 27th. The format to use for papers and abstracts is the
same (and is the 2003 ACL final paper format). Style files and
additional instructions are available at

Papers must be submitted in pdf format.

The title page (no separate title page is needed) should include the
following information:
 Authors' names, affiliations, and email addresses;
 Abstract (short summary up to 5 lines);


Submission March 27, 2003
Notification April 28, 2003
Final submissions May 23, 2003
Workshop	 July 5-6, 2003

Workshop website: 
Sigdial website:
ACL website:

All accepted papers will be published in the workshop proceedings and
will subsequently be available on the SIGdial web site.

Alexander Rudnicky, Carnegie Mellon University (co-chair),
Syun Tutiya, Chiba University (co-chair),
Donna Byron (Ohio State University)
Phil Cohen (Oregon Health University/OGI)
Nils Dahlbeck (Link�pings universitet)
Yasuharu Den (Chiba University)
Joakim Gustafson (Telia)
Masato Ishizaki (JAIST)
Yasuhiro Katagiri (ATR MIC)
Masahito Kawamori (Nippon Telegraph and Telephone Co.)
Andreas Kellner (Philips)
Ali Knott (Otago University)
Ivana Kruijff-Korbayova (Universit�t des Saarlandes)
Tomoko Kumagai (National Institute for Japanese Language)
Alex Lascarides (University of Edinburgh)
Lin-Shan Lee (National Taiwan University)
Oliver Lemon (Stanford University)
Wolfgang Minker (University of Ulm)
Mikio Nakano (Nippon Telegraph and Telephone Co.)
Shrikanth Narayanan (USC)
Roberto Pieraccini (SpeechWorks Int.)
Massimo Poesio (University of Edinburgh)
Alexandros Potamianos (Technical University of Crete)
Norbert Reithinger (DFKI)
Laurent Romary (LORIA)
Yoshinori Sagisaka (Waseda University)
Candace L. Sidner (MERL)
Michael Strube (European Media Laboratory)
Jan Wiebe (Univ. of Pittsburgh)
Bo Xu (Chinese Academy of Science)

Akira Kurematsu, University of Electro-Communications (general chair),
Alexander Rudnicky, Carnegie Mellon University, 
Syun Tutiya, Chiba University,
Laila Dybkj�r, University of Southern Denmark, 
David Traum, USC Institute for Creative Technologies, 
Mail to author|Respond to list|Read more issues|LINGUIST home page|Top of issue

Message 2: Empirical Methods in Natural Language Processing

Date: Fri, 14 Mar 2003 17:07:26 EST
From: Priscilla Rasmussen <>
Subject: Empirical Methods in Natural Language Processing

 2003 Conference on Empirical Methods in Natural Language Processing
 (EMNLP 2003) 

 July 11-12
 Sapporo, Japan


SIGDAT, the Association for Computational Linguistics' special
interest group on linguistic data and corpus-based approaches to NLP,
invites submissions to EMNLP 2003. The conference will be held on
July 11-12 in Sapporo, Japan, immediately following the 41st meeting
of the ACL (ACL 2003).


We are interested in papers from academia, government, and industry on
all areas of traditional interest to the SIGDAT community and aligned
fields, including but not limited to:

- information extraction
- information retrieval
- language and dialogue modeling
- lexical acquisition
- machine translation
- multilingual technologies
- question answering
- statistical parsing
- summarization
- generation
- tagging
- term and named entity extraction
- word sense disambiguation
- word, term, and text segmentation
- general NLP-related machine learning techniques: 
 theory, methods and algorithms

In addition to providing a general forum, the theme for this year is

 "Combining Deep Analysis and Statistical Methods for NLP"

We solicit papers that describe attempts to apply statistical analysis
to "deeper" representations of natural language than existing methods.
Some illustrative examples are:

- Statistical approaches to dialogue systems.

- Statistical approaches to generation.

- Corpus-based approaches to machine translation which make use of
deeper analyses than the original IBM alignment models.

- Empirical and statistical work on Natural Language "Understanding".

- Statistical processing of syntactic formalisms which give detailed
analyses, such as LFG, CCG, HPSG, or TAG.

- Extraction of logical form/predicate-argument structure, or
information extraction that goes beyond named entities to relations
between entities.

- Work connecting statistical NLP to research in probabilistic
reasoning and representation in AI, for example reinforcement learning
and belief networks.


Submissions must describe original, completed, unpublished work, and
include concrete evaluation results when appropriate. Papers being
submitted to other meetings must provide this information (see
submission format). In the event of multiple acceptances, authors are
requested to immediately notify the EMNLP program chair
( and to choose which meeting to present and
publish the work at as soon as possible. EMNLP cannot accept for
publication or presentation work that will be (or has been) published


Submissions should take the form of full papers (up to 8 pages in
two-column format). Authors are strongly encouraged to use the style
files originally provided for ACL 2003. This includes a LaTeX style
file acl03.sty, which is intended for use with the acl.bst BibTeX
bibliography style. An MS Word document template <> is also
available. The style files are available from the online version of
this CFP, at

*We strongly prefer submissions to be as PS files.* Any author who
submits in PDF must assume the responsibility for ensuring that fonts
are treated properly so that the paper will print (not just view)
anywhere. (This may involve reading the manual.) DOC/RTF formats
cannot be accepted.

Reviewing will be blind. No information identifying the authors should
be in the paper: this includes not only the authors' names and
affilations, but also self-references that reveal authors' identities;
for example, "We have previously shown (Smith 1999)" should be changed
to "Smith (1999) has previously shown". A separate identification
email is required: see below.


First, an email with submission details is required. Please send an
email to (subject line EMNLP 2003 submission) by
April 4 with the following information:

 Paper title
 Authors' names, affiliations, and email addresses
 Contact author's email address
 A short list of keywords
 A short (no more than 5 lines) summary of the contents
 Whether or not the paper is under consideration for other
 conferences (please specify)

Second, an electronic version of the paper in PostScript format, named
<contact-author-email>.ps must be received by April 4, 23:00 GMT (6pm
EDT) at The paper should be formatted for A4 or
letter-size paper. It should not include information which identifies
the author(s). Please use gzip or plain old zip (or PKZIP) for
compression to ensure nothing is lost during the email transfer.

In case of difficulties sending the PostScript version, please
generate a PDF format instead (and name it accordingly:
<contact-author-email.pdf). Let us stress again that the PostScript
format is nevertheless strongly preferred.

Only in case of really, *REALLY* unsolvable difficulties in sending
the electronic version, please send a single hardcopy of the paper and
the ID page to

 EMNLP 2003 Submissions
 Michael Collins
 MIT Artificial Intelligence Laboratory
 Room NE43-723
 200 (545) Technology Square
 MIT Building NE43
 Cambridge, MA 02139 

The EMNLP committee is not responsible for postal delays or other
e-mail and mail problems. Submissions that do not conform to the
guidelines above are subject to rejection without review.


Submission deadline: April 4, 2003 
Acceptance notification: May 9, 2003 
Camera-ready copy due: June 6, 2003 
Conference: July 11-12, 2003 


Michael Collins (chair), 

Mark Steedman (co-chair), 
University of Edinburgh,

(As of 14th March 2003: there may be future additions)

Srinivas Bangalore			Kevin Knight
Regina Barzilay				Maria Lapata
Thorsten Brants				Gina-Anne Levow
Chris Brew				Liz Liddy
Eric Brill				Hang Li
Ted Briscoe				Dekang Lin
John Carroll				Lluis Marquez
Eugene Charniak				Helen Meng
Stanley Chen				Paola Merlo
Ken Church				Masaaki Nagata
Stephen Clark				Hwee Tou Ng
Radu Florian				Grace Ngai
Rob Gaizauskas				Franz-Josef Och
Louise Guthrie				Miles Osborne
Jan Hajic				Martha Palmer
Sanda Harabagiu				Lance Ramshaw
Marti Hearst				Eric Ringger
James Henderson				Brian Roark
John Henderson				Erik Tjong Kim Sang
Mark Hepple				Anoop Sarkar
Rebecca Hwa				Richard Sproat
Hitoshi Iida				Paola Velardi
Mark Johnson				David Weir

Mail to author|Respond to list|Read more issues|LINGUIST home page|Top of issue