LINGUIST List 15.677

Tue Feb 24 2004

Calls: Lang Documentation/USA; Computational Ling

Editor for this issue: Andrea Berez <>

As a matter of policy, LINGUIST discourages the use of abbreviations or acronyms in conference announcements unless they are explained in the text. To post to LINGUIST, use our convenient web form at


  1. fox, E-MELD Workshop on Digital Language Documentation of Endangered Languages
  2. collier, International Joint Workshop on Natural Language Processing in Biomedicine and Its Applications

Message 1: E-MELD Workshop on Digital Language Documentation of Endangered Languages

Date: Mon, 23 Feb 2004 19:01:17 -0500 (EST)
From: fox <>
Subject: E-MELD Workshop on Digital Language Documentation of Endangered Languages

E-MELD Workshop on Digital Language Documentation of Endangered

Date: 15-July-2004 - 18-July-2004
Location: Detroit, MI, United States of America
Contact: Naomi Fox
Contact Email: 
Meeting URL: 

Linguistic Sub-field: Computational Linguistics, Language Description

Call Deadline: 05-Apr-2004 

Meeting Description:

E-MELD Digital Language Documentation Workshop: Databases and Best Practice.

The E-MELD (Electronic Metastructure for Endangered Languages Data)
Project is a five-year project funded by the National Science
Foundation with a dual objective: to aid in the preservation of
endangered languages data and documentation and to aid in the
development of the infrastructure necessary for effective
collaboration among electronic archives. As part of the effort to
promote consensus on best practices in digital language resources,
E-MELD is hosting its fourth workshop on issues in digitizing
endangered language documentation July 15-18, 2004.

The focus for the 2004 workshop will be 'Databases and Best Practice'.
A database can be a valuable utility in handling linguistic data, and
although the use of a database in itself does not promote digital best
practices, there are ways to use databases that are consonant with
best practice. Current recommendations in the E-MELD School of Best
Practice ( include archiving irreplaceable
text data in .txt file format with XML markup; however, this format is
not always optimal for working with linguistic data. Using a database
can be a valuable tool for managing linguistic data.

E-MELD welcomes 1-page abstracts of papers which address the issue of
databases in best practice. We also invite descriptions of electronic
tools which support best practice in linguistic documentation for
inclusion in a demonstration session. The deadline for abstracts is
April 5. Please submit abstracts using the form at Abstracts
should be one page in length, with an additional page allowed for
tables, images, or references.

The workshop will consist of general sessions, demonstrations of tools
useful for linguistic data preparation, and working group sessions in
which the workshop participants will provide input on best practices
in digital language documentation and advise on the expansion of the
School of Best Practice ( We particularly
welcome the participation of field linguists in the workshop.

The conference website is located at For more information, visit the E-MELD
website ( or the School of Best Practice
(, or e-mail us (
Previous topics for E-MELD workshops include Interlinear Text
(, Digitizing Lexical Information
(, and The Need for Standards
Mail to author|Respond to list|Read more issues|LINGUIST home page|Top of issue

Message 2: International Joint Workshop on Natural Language Processing in Biomedicine and Its Applications

Date: Mon, 23 Feb 2004 21:47:54 -0500 (EST)
From: collier <>
Subject: International Joint Workshop on Natural Language Processing in Biomedicine and Its Applications

International Joint Workshop on Natural Language Processing in
Biomedicine and Its Applications
Short Title: BioNLP/NLPBA 2004 

Date: 28-Aug-2004 - 29-Aug-2004
Location: Geneva, Switzerland
Contact: Nigel Collier
Contact Email: 
Meeting URL: 

Linguistic Sub-field: Computational Linguistics 

Call Deadline: 14-Apr-2004 

Meeting Description:

International Joint Workshop on Natural Language Processing in
Biomedicine and its Applications at COLING 2004, August 28-29, Geneva,
Switzerland. Deadline for regular papers is April 14th and for
shared-task papers is April 21st. COLING Workshop


Call for Papers

Geneva, Switzerland
August 28-29, 2004

Workshop Description

Recent years have seen a growing interest in the application of NLP
techniques to texts in the domains of biology and medicine. The
problem of information overload that has resulted from the massive
growth in the scientific literature has clearly shown the necessity to
automatically locate, organize and manage facts relating to
experimental results. At the same time clinicians have experienced
greatly improved access to the medical literature and clinical
repositories which needs to be matched by the development of enhanced
information access tools. This year NLPBA
( and BioNLP
( merge for a
joint workshop with the aim of bringing together researchers from
natural language processing, bio-informatics, medicine and ontologies
who are concerned with developing methods and resources for solving
these problems.

Over the last five years we have seen significant steps forward in the
development of language technology and large-scale resources for the
Bio-Medical domain such as linguistically annotated corpora
(e.g. GENIA POS and NE corpora), ontologies (e.g. Gene Ontology),
thesauri (e.g. UMLS Metathesaurus), lexicons and term lists (e.g. UMLS
SPECIALIST) as well as information retrieval collections (e.g. TREC
Genomics track). At the application level we see development of
question answering systems, event recognition, zone (rhetorical
region) identification, as well as term and bio-entity
recognition. The demand for information access tools from domain users
is increasing to support literature survey, often integrated into
online 'portals' where scientists can navigate through related
information resources such as genetics and disease databases. Ongoing
challenges relate to the growing and ambiguous nomenclature, the need
to integrate deep knowledge sources into machine learning, a need to
scale up methods for processing full text articles etc.

The objective of the workshop is to bring together researchers in this
area, to establish common themes and goals between different
groups. We have seen from previous experience in the natural language
learning and information retrieval communities the benefits of sharing
resources and developing common evaluation criteria. In this workshop
we are introducing a special shared task to promote discussion of
these issues as well as the objective of integrating machine learning
with knowledge resources.

We invite submission of papers on topics related to bio-medical NLP
including, but not limited to:

* Information extraction
* Text mining
* Named entity recognition
* Coreference resolution
* Term recognition
* Knowledge-based information retrieval
* Multi-lingual resources and applications
* Ontology construction and ontology mapping
* Visualization tools for viewing clustered or extracted information
or meta-data
* Multi-modal approaches combining text and images, etc.
* Event recognition
* Construction of pathways from literature and databases
* Creation of data-sets of bio-medical entities, coreferences and
* Annotation standards and quality control methodologies
* Resource integration and re-engineering
* Corpus/lexicon construction
* Text summarization and report generation

Shared Task

This year we propose to have a special shared task: bio-medical named
entity recognition from the GENIA corpus. The purpose of this track is
essentially to investigate the integration of statistical machine
learning methods with symbolic knowledge sources from the bio-medical
domain such as ontologies, thesauri and lexicons : shared task

Invited Speaker
To be announced soon

Paper Format and Submission

Papers must follow the COLING 2004 templates, and will be submitted by
email to

Important Dates
Submission deadline for regular workshop papers: April 14th, 2004
Submission deadline for shared task papers: April 21st, 2004
Notification of accepted papers: May 14th, 2004
Deadline for camera ready copies: June 6th, 2004

Workshop Chairs
Nigel Collier (National Institute of Informatics, Japan)
Patrick Ruch (University Hospital of Geneva and EPFL, Switzerland)
Adeline Nazarenko (LIPN, France)

Organizing Committee
Alfonso Valencia (Centro Nacional de Biotecnologia, Spain)
Carol Friedman (CUNY/Columbia University, USA)
Donia Scott (University of Brighton, UK)
Udo Hahn (Albert-Ludwigs University, Freiburg, Germany)
Junichi Tsujii (University of Tokyo, Japan)

Program Committee
Sophia Ananiadou (University of Salford, UK)
Alan Aronson (National Library of Medicine, USA)
Robert Baud (University Hospital of Geneva, Switzerland)
Christian Blaschke (CNB, Spain)
Oliver Bodenreider (National Library of Medicine, USA)
Berry de Bruijn (National Research Center, Canada)
Marc Craven (University of Wisconsin, USA)
Robert Gaizauskas (University of Sheffield, UK)
Eric Gaussier (Xerox, XRCE, France)
Vasileios Hatzivassiloglou (Columbia University, USA)
Lynette Hirschman (MITRE, USA)
Dimitar Hristovski (University of Ljubljana, Slovenia)
Jerry Hobbs (USC/ISI, USA)
Aravind Joshi (University of Pennsylvania, USA)
Su Jian (Institute for Infocomm Research, Singapore)
Asao Fujiyama (National Institute of Informatics, Japan)
Arne Jonsson (University of Linkoping, Sweden)
Frederique Lisacek (GeneBio SA, Switzerland)
Yuji Matsumoto (NAIST, Japan)
Claire Nedellec (INRA, France)
Kousaku Okubo (Kyushu University, Japan)
Jong C. Park (KAIST, Korea)
Thierry Poibeau (LIPN, France)
Denys Proux (Xerox, XRCE, France)
James Pustejovsky (Brandeis University, USA)
Dietrich Rebholz-Schuhmann (European Bioinformatics Institute, EU)
Irena Spasic (UMIST, UK)
Ben Stapley (UMIST, UK)
Padmini Srinivasan (University of Iowa, USA)
Hirotoshi Taira (NTT Communication Science, Japan)
Toshihisa Takagi (University of Tokyo, Japan)
Yuka Tateishi (University of Tokyo, Japan)
Anne-Lise Veuthey (SIB, Switzerland)
Limsoon Wong (Institute for Infocomm Research, Singapore)
Pierre Zweigenbaum (AP-HP, INSERM & INaLCO, France)

Contact Information

General organization :
Shared task organization:
Main page :
Mail to author|Respond to list|Read more issues|LINGUIST home page|Top of issue