* * * * * * * * * * * * * * * * * * * * * * * *
LINGUIST List logo Eastern Michigan University Wayne State University *
* People & Organizations * Jobs * Calls & Conferences * Publications * Language Resources * Text & Computer Tools * Teaching & Learning * Mailing Lists * Search *
* *
LINGUIST List 17.621

Sat Feb 25 2006

Calls: Computational Ling/Australia

Editor for this issue: Kevin Burrows <kevinlinguistlist.org>


As a matter of policy, LINGUIST discourages the use of abbreviations or acronyms in conference announcements unless they are explained in the text. To post to LINGUIST, use our convenient web form at http://linguistlist.org/LL/posttolinguist.html.
Directory
        1.    Timothy Baldwin, Information Extraction Beyond the Document
        2.    Timothy Baldwin, Multilingual Language Resources and Interoperability


Message 1: Information Extraction Beyond the Document
Date: 22-Feb-2006
From: Timothy Baldwin <tim+colacl2006csse.unimelb.edu.au>
Subject: Information Extraction Beyond the Document


Full Title: Information Extraction Beyond the Document

Date: 22-Jul-2006 - 22-Jul-2006
Location: Sydney, Australia
Contact Person: Timothy Baldwin
Meeting Email: tim+colacl2006csse.unimelb.edu.au
Web Site: http://nlp.shef.ac.uk/result/iebd06/

Linguistic Field(s): Computational Linguistics

Call Deadline: 31-Mar-2006

Call for Papers

COLING/ACL 2006 Workshop

INFORMATION EXTRACTION BEYOND THE DOCUMENT

22nd July 2006, Sydney, Australia

Organisers:
Mary Elaine Califf
(Illinois State University)
Mark A. Greenwood
(University of Sheffield)
Mark Stevenson
(University of Sheffield)
Roman Yangarber
(University of Helsinki)


Traditional approaches to the development and evaluation of
Information Extraction (IE) systems have relied on relatively small
collections of up to a few hundred documents tagged with detailed
semantic annotations. While this paradigm has enabled rapid advances
in IE technology, it remains constrained by a dependence on annotated
documents and does not make use of the information available in large
corpora. Alternative approaches, which make use of large text
collections and inter-document information, are now beginning to
emerge -- as evidenced by a parallel emergence of interest in learning
>From unlabelled data in AI in general. For example, some systems
learn extraction patterns by exploiting information about their
distribution across corpora; others exploit the redundancy of the
internet by assuming that facts with multiple mentions are more
reliable. These approaches require large amounts of unannotated text,
which is generally easy to obtain, and employ unsupervised or
minimally supervised learning algorithms, as well as related
techniques such as co-training and active learning. These alternative
approaches are complementary to the established IE paradigm based on
supervised training, and are now forming a cohesive emergent trend in
recent research. They will constitute the focus of this workshop.

There are several advantages to employing large text collections for
IE.
They provide enormous amounts of training data, albeit mostly
unannotated. Facts can be extracted from, or verified across,
multiple documents. Large text collections often contain vast amounts of
redundancy in the form of multiple references to or mentions of
closely related facts. Redundancy can be exploited in the IE setting to
identify trends and patterns within the text, e.g., by means of Data
Mining techniques.

This workshop invites new, original work on learning extraction rules
or identifying facts across document boundaries while exploiting
sizable amounts of unlabelled text in the training stage, in the
extraction stage, or both. The workshop hopes to bring together
researchers from the various related areas, such as Information
Extraction, Data Mining, biomedical text processing, Question
Answering, Information Retrieval, Machine Learning, identification of
lexical relations (hyponymy, meronymy etc.), multi-lingual text
processing and the Semantic Web. This workshop solicits papers on all
relevant aspects, including algorithms, techniques and applications.

Topics of particular interest include:
- Extraction of information described across documents
- Integration and mutual benefits of IE and Data Mining
- Extraction of information from massive corpora (such as the
Internet)
- Mutual applications and interaction between Information Extraction
and the Semantic Web
- Verification of information using external sources
- Exploiting cross-lingual and multi-lingual approaches for improving
performance in IE

IMPORTANT DATES

Submission Deadline: March 31st, 2006
Notification of acceptance: May 12th, 2006
Camera-ready papers due: May 29th, 2006

SUBMISSION INSTRUCTIONS

Authors are invited to submit original, unpublished work on the topic
areas of the workshop. Submissions should follow the standard
two-column formatting instructions for the main COLING/ACL 2006
conference. Submitted papers should be no longer than eight (8) pages
in length, including references. We strongly recommend the use of the
Latex and Microsoft Word style files which will be available on the
main conference website.

As reviewing will be blind, the paper should not include the authors'
names and affiliations. Furthermore, self-references that reveal the
author's identity, e.g., ''We previously showed (Smith, 1991) ...'',
should be avoided. Instead, use citations such as ''Smith previously
showed (Smith, 1991) ...''.

Submission will be electronic. Details will appear on the workshop web
site (http://nlp.shef.ac.uk/result/iebd06).

Questions regarding the submission procedure should be directed to
Mark Greenwood (markdcs.shef.ac.uk).

WORKSHOP ORGANIZERS

Mary Elaine Califf
School of Information Technology, Illinois State University

Mark A. Greenwood
Department of Computer Science, University of Sheffield

Mark Stevenson
Department of Computer Science, University of Sheffield

Roman Yangarber
Department of Computer Science, University of Helsinki

PROGRAM COMMITTEE

Markus Ackermann (University of Leipzig)
Amit Bagga (AskJeeves)
Roberto Basili (University of Rome, Tor Vergata)
Antal van den Bosch (Tilburg Uniersity)
Neus Catala (Universitat Polithcnica de Catalunya)
Walter Daelemans (University of Antwerp)
Jenny Rose Finkel (Stanford University)
Robert Gaizauskas (University of Sheffield)
Ralph Grishman (NYU)
Takaaki Hasegawa (NTT)
Heng Ji (NYU)
Nick Kushmerick (University College Dublin, Ireland)
Alberto Lavelli (ITK-IRST, Italy)
Gideon Mann (John Hopkin's University)
Ion Muslea (Language Weaver Inc.)
Chikashi Nobata (Sharp, Japan)
Ellen Riloff (University of Utah)
Tony Rose (Cognia Ltd.)
Stephen Soderland (University of Washington)
Kiyotaka Uchimoto (CRL, Japan)
Yorick Wilks (University of Sheffield)
Message 2: Multilingual Language Resources and Interoperability
Date: 22-Feb-2006
From: Timothy Baldwin <tim+colacl2006csse.unimelb.edu.au>
Subject: Multilingual Language Resources and Interoperability



Full Title: Multilingual Language Resources and Interoperability

Date: 23-Jul-2006 - 23-Jul-2006
Location: Sydney, Australia
Contact Person: Timothy Baldwin
Meeting Email: tim+colacl2006csse.unimelb.edu.au
Web Site: http://coli.lili.uni-bielefeld.de/mlri06/

Linguistic Field(s): Computational Linguistics

Call Deadline: 03-Apr-2006

Call for Papers

Post COLING/ACL-2006 Workshop:
Multilingual Language Resources and Interoperability
Sydney, Australia July 23rd, 2006
http://coli.lili.uni-bielefeld.de/mlri06

MLRI06: Multilingual Language Resources and Interoperability

In an ever-expanding information society, most information systems are now
facing the ''multilingual challenge''. Lexical resources play an essential role
in these information systems. Such lexical resources need to provide information
on many languages in a common framework and should be (re)usable in many
applications (for automatic or human use).

This workshop aims at bringing together research on using and building different
language resources, such as lexicons, (Semantic Web) ontologies, tagsets,
annotation schemes etc., to achieve multilingual interoperation.

We welcome submissions on recent and innovative work involving such multilingual
ressources. Submissions should focus on any of the following topics:

- models for language/application-independent linguistic data,
- multilingual linguistic data acquisition and maintenance,
- interaction between different kinds of resources (lexicons,
corpora, ontologies, etc.) for different languages,
- lexical architectures to link data between languages
- Internationalization of annotations and representations
- use of linguistic resources for applications in a
- multilingual
context
- reuse of existing lexical resources in a specific
- multilingual
application
- human use of lexical resources in a multilingual context
- quality evaluation of multilingual data
- standardization issues

We welcome submissions concerning multilingual language resources of ANY kind.
However, special attention will be given to submissions presenting uses of XML
to achieve multilingual interoperability. Moreover, as the workshop takes place
in Australia, special attention will also be given to submissions on resources
involving Asian or Australian lesser-used languages.

The intent of the workshop is not only to review academic developments but also
to hear from industry, how these resources are used in new technology
developments and what the current needs are in the field. Presentations, project
notes and position papers are welcome.

SUBMISSION INSTRUCTIONS

Submissions should follow the two-column format of ACL proceedings and should
not exceed eight (8) pages, including references. We strongly recommend the use
of the LaTeX style files or Microsoft Word document template that will be made
available on the COLING/ACL onference Web site. A description of the required
format will be made available to those who are unable to make direct use of
these style files.

Submission will be electronic. The only accepted format for submitted papers is
Adobe PDF. The papers must be submitted no later than April 3, 2006. Papers
submitted after that time will not be reviewed. For details of the submission
procedure, please consult the submission webpage reachable via the workshop website.

Questions regarding the submission procedure should be directed to the
Program Co-Chairs (Gilles.Serassetimag.fr,
andreas.wittuni-bielefeld.de).

IMPORTANT DATES

Deadline for submission of Papers: April 3, 2006
Notification of Acceptance: May 2, 2006
Deadline for final paper submission: May 22, 2006


ORGANIZING COMMITTEE

Susan Armstrong ISSCO, Universiti de Genhve, Switzerland
Jim Breen, Monash University, Australia
Ulrich Heid IMS-CL, University of Stuttgart, Germany
Felix Sasaki, World Wide Web Consortium (Keio Research Institute at
SFC), Japan
Gilles Sirasset, GETA CLIPS-IMAG, Universiti Joseph Fourier, France
Andreas Witt, Bielefeld University, Germany

PROGRAM COMMITTEE

Helen Aristar-Dry, The Linguist List
Susan Armstrong ISSCO, Universiti de Genhve, Switzerland
Pushpak Battacharya IIT, Mumbai, India
Christian Boitet GETA CLIPS-IMAG, Universiti Joseph Fourier, France
Pierrette Bouillon ISSCO, Universiti de Genhve, Switzerland
Jim Breen, Monash University, Australia
Nicoletta Calzolari CNR, Pisa, Italy
Jean Carletta, University of Edinburgh, UK
Dan Cristea, University of Iasi, Romania
Patrick Drouin OLST, University of Montreal,Canada
Scott Farrar, University of Arizona, Tuscon, USA
Ulrich Heid IMS-CL, University of Stuttgart, Germany
Erhard Hinrichs, Eberhard-Karls-Universitdt T|bingen, Germany
Claus Huitfeldt, Bergen University, Norway
Phanh Huy Khan, DATIC, University of Danang, Vietnam
Nancy Ide, Vassar University, Poughkeepsie, NY, USA
Kyo Kageura University of Tokyo, Tokyo, Japan
Chuah Choy Kim, USM, Penang, Malaisie
Anke Luedeling, HU Berlin, Germany
Mathieu Mangeot Universiti de Savoie, France
Dieter Metzing, Bielefeld University, Germany
Massimo Poesio, University of Essex, UK
Alain Polguhre, OLST, University of Montreal,Canada
Andrei Popescu-belis ISSCO, Universiti de Genhve, Switzerland
Goutam Kumar Saha, Centre for Development of Advanced Computing, CDAC,
Kolkata, India
Felix Sasaki, World Wide Web Consortium (Keio Research Institute at
SFC), Japan
Thomas Schmidt, ICSI, Berkeley, USA
Gilles Sirasset, GETA CLIPS-IMAG, Universiti Joseph Fourier, France
Gary Simons, SIL, Dallas, USA
Virach Sornlertlamvanich, Thai Computational Linguistics Laboratory,
NICT, Thailand
C.M. Sperberg-McQueen, MIT Boston and W3C, USA
Manfred Stede, Potsdam University, Germany
Dan Tufis, RACAI, Uni Bucharest, Romania
Jun'ichi Tsujii, University of Tokyo, Japan
Takehiro Utsuro, Kyoto University, Japan
Andreas Witt, Bielefeld University, Germany
Michael Zock, LIMSI, Orsay, France



Respond to list|Read more issues|LINGUIST home page|Top of issue




Please report any bad links or misclassified data

LINGUIST Homepage | Read LINGUIST | Contact us

NSF Logo

While the LINGUIST List makes every effort to ensure the linguistic relevance of sites listed
on its pages, it cannot vouch for their contents.