LINGUIST List 13.3297

Sat Dec 14 2002

Calls: Multilingual Corpora/Computational Ling

Editor for this issue: Karolina Owczarzak <>

As a matter of policy, LINGUIST discourages the use of abbreviations or acronyms in conference announcements unless they are explained in the text.


  1. hansen, Multilingual Corpora, Lancaster UK
  2. bowen, Pacific Association for Computational Linguistics, Canada

Message 1: Multilingual Corpora, Lancaster UK

Date: Fri, 13 Dec 2002 09:50:45 +0000
From: hansen <>
Subject: Multilingual Corpora, Lancaster UK

Multilingual Corpora: Linguistic Requirements and Technical Perspectives

Location: Lancaster, Great Britain
Date: 27-Mar-2003 - 27-Mar-2003 
Call Deadline: 20-Jan-2003

Web Site:
Contact Person: Stella Neumann
Meeting Email:

Linguistic Subfield(s): Text/Corpus Linguistics 

Meeting Description: 

A pre-conference workshop to be held at Corpus Linguistics 2003
Lancaster, 27 March 2003



Stella Neumann (Department of Applied Linguistics, Translation and

Silvia Hansen (Department of Computational Linguistics)

Saarland University, Saarbr�cken, Germany


How do researchers go about building multilingual corpora? For the
development of a linguistically interpreted corpus on the basis of
more than one language there seem to be two methods: First, the
multilingual corpus is split up into monolingual sub-corpora which are
then annotated independently. For the second method, one language
serves as the basis for building up and interpreting a multilingual
corpus, whereas the other has to be adapted. Both methods, however,
are rather problematic. They do not take sufficiently into account the
differences and commonalities between the languages in question at
each stage of corpus-based research, involving the comparability of
the corpus design, the different kinds of segmentation, the diverging
annotation schemes, the corpus representations and finally the again
converging querying across different languages. Mistakes or
inconsistencies which happen at one stage of the multilingual corpus
development have negative influences on the following steps and result
in worse mistakes or inconsistencies. Not only do these problems arise
at each methodological step. They also multiply with the growing
complexity of the research design. If the research aims at
interpreting linguistic data on several levels, cross-linguistic
comparability has to be taken into account on each level.

The goal of the workshop is to bring together researchers who
formulate specific requirements of how to work with corpora under a
linguistic perspective and engineers who can offer technical solutions
but need the input of users to adapt their tools to the needs of the
linguists. Within this context, questions like the following are to be

- What happens, if the units under investigation diverge on the
different levels? 
- At present, the preferred solution is to use XML at all stages and
on all layers. But is this really practicable? 
- Do linguists get along with stand-off mark-up? 
- Is this maybe a technical compromise?

The workshop should result in a requirement catalogue in combination
with technical solutions. It could thus serve as a starting point for
the development of an annotation typology which takes into account
different languages as well as different annotation layers. On the
basis of this typology, the comparability of a multilingual
multi-layer annotated corpus can be guaranteed. With this in mind, a
multilingual corpus builder should be able to cope with possible
problems in each of the above explained steps in corpus development.

Papers are expected on the following questions:
- linguistic requirements in the different methodological steps
- state-of-the-art technical solutions
- international standards which facilitate the development and
exchange of multilingual corpora


The workshop will take a full day comprising about 8-10 papers. Short
presentations are expected leaving enough time for discussion and
assessment of the used methodologies as well as the development of
possible solutions. This already points to the workshop agenda: The
first third will deal with linguistic fundamentals, the second part
will discuss the technical aspects and the last third will provide a
platform for integrating both perspectives. Workshop proceedings will
be produced.


to be announced!


20 January 2003: Deadline for submitted papers
21 February 2003: Notification of acceptance
7 March 2003: Camera ready copy
27 March 2003: Workshop


Please refer to the main conference web page 
( for registration details.


Please send submissions in English as RTF or plain text files
(preferably by email) to the address below. Paper length should be
8-10 pages, formatted in the same way as for the main conference (see for paper format
Mail to author|Respond to list|Read more issues|LINGUIST home page|Top of issue

Message 2: Pacific Association for Computational Linguistics, Canada

Date: Fri, 13 Dec 2002 01:44:43 +0000
From: bowen <>
Subject: Pacific Association for Computational Linguistics, Canada

Pacific Association for Computational LINGuistics

Short Title: PACLING'03
Location: Halifax, Nova Scotia, Canada
Date: 22-Aug-2003 - 25-Aug-2003 
Call Deadline: 07-Mar-2003

Web Site:
Contact Person: Bowen Hui
Meeting Email:

Linguistic Subfield(s): Computational Linguistics 

Meeting Description: 

PACLING'03 will be a low-profile, high-quality, workshop-oriented
meeting whose aim is to promote friendly scientific relations among
Pacific Rim countries, with emphasis on interdisciplinary scientific
exchange demonstrating openness towards good research falling outside
current dominant schools of thought, and on technological transfer
within the Pacific region. The conference represents a unique forum
for scientific and technological exchange, being smaller than ACL,
COLING, or Applied NLP, and also more regional with extensive
representation from the Pacific.

- ----
Original papers are invited on any topic in computational linguistics
(and closely related areas) including, but not limited to, the

* phonology, phonetics, morphology, syntax, semantics, pragmatics,
* dialogues, spoken languages, corpora,
* text and message understanding and generation,
* language translation and translation aids,
* language learning and learning aids,
* question-answering systems and interfaces to (multimedia)
* databases, language and input/output devices,
* natural-language-based software.

Submission of Papers
- ------------------
Authors should prepare extended abstracts, in English, not more than
3000 words including references. The title page must include: author's
name, postal address, e-mail address (if possible), telephone and
facsimile numbers; a brief 100--200 word summary; and some key words
for classifying the submission.

- ------
 Submission deadline: March 14, 2003
 Notification of acceptance: May 15, 2003
 Camera-ready copy due: June 20, 2003 

PACLING (Pacific Association for Computational LINGuistics) has grown
out of the very successful Japan-Australia joint symposia on natural
language processing held in November 1989 in Melbourne, Australia and
in October in Iizuka, Japan in 1991. The first five meetings of the
retitled PACLING, a name designed to express the wider membership,
took place in Vancouver, Canada in 1993, in Brisbane, Australia in
1995, in Ohme, Tokyo, Japan in 1997, in Waterloo, Canada in 1999, and
in Kitakhyshu in 2001. The sixth meeting will be hosted by:

* Canadian Society for Computational Studies of Intelligence
 (CSCSI)/Soci�t� canadienne pour l'�tude de l'intelligence
 par ordinateur (SCEIO)
* Dalhousie University, Canada
* GINIus., Inc.
* The Technical Group on Natural Language Understanding and
 Communication of Institute of Electronics, Information, and
 Communication Engineers of Japan
* The Technical Group on Thought and Language of Institute of
 Electronics, Information, and Communication Engineers of Japan

Papers that are being submitted to other conferences, whether verbatim
or in essence, must reflect this fact after key words. If a paper
appears at another conference, it must be withdrawn from
PACLING'03. Papers that violate these requirements are subject to
rejection without review.

Authors of a selection of representative papers which the Program
Committee identifies will be invited to revise their papers and submit
to a special issue of one of several journals with whom we are
negotiating including Computational Intelligence: An International
Journal, Natural Language Engineering, and Artificial Intelligence

Please send four copies of each submission to:

 Tsutomu ENDO, Professor
 Department of Artificial Intelligence
 Kyushu Institute of Technology
 680-4 Kawazu, Iizuka, Fukuoka 820-8502 Japan
 Tel. +81-948-29-7616
 Fax. +81-948-29-7601
- or -
 Vlado KESELJ, Assistant Professor
 Faculty of Computer Science
 Dalhousie University
 6050 University Avenue,Halifax, NS B3H 1W5
 Tel. 1-902-494-2893
 Fax. 1-902-494-1517

Organizing committee
- ------------------
Members: Hiroshi Sakaki (Meisei University, Japan)
 Robert Dale (Macquarie University, Australia)
 Randy Goebel (University of Alberta, Canada)

Advisory committee
- ----------------
Members: Naoyuki Okada (Kyushu Institute of Technology, Japan)
 Yorick Wilks (Sheffield University, England)
 Charles Fillmore (University of California, Berkeley, USA)

Conference committee
- ------------------
Chairs: Shun Ishizaki (Keio University, Japan)
 Nick Cercone (Dalhousie University, Canada)
Program coordinators:
 Tsutomu Endo (Kyushu Institute of Technology, Japan)
 Vlado Keselj (Dalhousie University, Canada)
Aijun An (York University, Canada)
Chutiporn Anutariya (Institutt for Telematikk, Norway)
Francis Bond (NTT, Japan)
Sandra Carberry (University of Delaware, U.S.A.)
Charlie Clarke (University of Waterloo, Canada)
Robin Cohen (University of Waterloo, Canada)
Veronica Dahl (Simon Fraser University, Canada)
Robert Dale (Macquarie University, Australia)
Hercules Dalianis (Royal Inst. of Tech., DSV-KTH, Sweden)
Amit Dubey (Universitat des Saarlands, Germany)
Chrysanne DiMarco (University of Waterloo, Canada)
Randy Goebel (University of Alberta, Canada)
Michael Higgins (Yamaguchi University, Japan)
Graeme Hirst (University of Toronto, Canada)
Immy Huang (University of Waterloo, Canada)
Bowen Hui (University of Toronto, Canada)
Kentaro Inui (Kyushu Institute of Technology, Japan)
Shun Ishizaki (Keio University, Japan)
Richard Kittredge (University of Montreal, Canada)
Kiyoshi Kogure (ATR Laboratories, Japan)
Guy Lapalme (University of Montreal, Canada)
Dekang Lin (University of Alberta, Canada)
Charles Ling (University of Western Ontario, Canada)
Qin Lu (Hon Kong Polytechnic University, Hong Kong)
Stan Matwin (University of Ottawa, Canada)
Robert Mercer (University of Western Ontario, Canada)
Gordon McCalla (University of Sydney, Australia)
Paul McFetridge (Simon Fraser University, Canada)
Kanlaya Naruedomkul (Mahidol University, Thailand)
T. Pattabhiraman (Conversay, USA)
Fuchun Peng (University of Waterloo, Canada)
Gerald Penn (University of Toronto, Canada)
Fred Popowich (Simon Fraser University, Canada)
Hiroshi Sakaki (Meisei University, Japan)
L. K. Schubert (University of Rochester, USA)
Dale Schuurmans (University of Waterloo,Canada)
Akira Shimazu (NTT, Japan)
Booncharoen Sirinaovakul (KMUTT, Thailand)
Virach Sornlertlamvanich (NECTEC, Thailand)
Tomek Strzalkowski (SUNY Albany, USA)
Ryoichi Sugimura (Matsushita, Japan)
Thomas Trappenberg (Dalhousie University, Canada)
Yorick Wilks (Sheffield University, England)
Jianna Zhang (Western Washington State, USA)
Ning Zhong (Yamaguchi University, Japan)
Ingrid Zukerman (Monash University, Australia)
Mail to author|Respond to list|Read more issues|LINGUIST home page|Top of issue