LINGUIST List 10.1970

Sun Dec 19 1999

Calls: Canadian Ling Association, Corpus Ling/LREC

Editor for this issue: Lydia Grebenyova <>

As a matter of policy, LINGUIST discourages the use of abbreviations or acronyms in conference announcements unless they are explained in the text.


  1. Nicole Rosen, Canadian Linguistic Association/ Edmonton, Canada
  2. Nancy M. Ide, Corpus Linguistics Workshop - LREC 2000/ Athens, Greece

Message 1: Canadian Linguistic Association/ Edmonton, Canada

Date: Fri, 17 Dec 1999 11:50:50 -0500
From: Nicole Rosen <>
Subject: Canadian Linguistic Association/ Edmonton, Canada



The Canadian Linguistic Association Inc will hold its 2000 conference
as part of the Congress of the Social Sciences and Humanities at the
University of Alberta, Edmonton, Canada, from Friday 26 May to Monday
29 May 2000. Members are invited to submit abstracts representing all
areas of linguistics. Only members in good standing for 2000 may
submit an abstract.

ABSTRACT REQUIREMENTS (Please read carefully)
1.	Abstracts are to be submitted on 21.6 x 27.9 cm (8.5 x 11 
in) typewritten single-spaced pages. A completed copy of the 
enclosed abstract submission form is to be sent along with five (5) 
copies of the abstract, one of which must be an original suitable for 
reproduction. Abstracts may be no longer than one (1) page, plus 

a) A complete list of references cited in the abstract must also be
submitted (also in five copies).

b) In order to help ensure anonymity, abstracts should not contain
excessive references to its author's publications. Furthermore, the
abstract itself should not contain acknowledgements for grants or to
granting agencies.

c) References in the abstract should normally be limited to published

3. Abstracts should contain 

a) a title;

b) a clear and explicit statement of the theory or hypothesis to be

c) an indication of the main arguments or studies involved, including
key examples, if applicable;

d) a statement of conclusions reached and an assessment of their

Abstracts which, on account of their subject matter, cannot conform to
requirements 3 b) to 3d) must contain a clear statement of the subject
to be discussed, its relationship to previous studies in the same
discipline, and its general relevance.

4. Authors should identify themselves on the abstract submission form
only, not on the abstract.

5. Members are limited to one individual and one joint abstract

	CLA Program Committee
	Department of Linguistics
	University of Toronto
	130 St. George St.
	Toronto, Ontario

E-MAIL: Abstracts may be submitted by e-mail. Abstracts being
submitted by email should preferably be sent as attachments, in one of
the following formats: MSWord, Rich Text Format, or WordPerfect 6.0 or
lower, in descending order of preference. If the abstract is sent in
the body of the message, then please send a hard copy by snail mail at
the same time for enclosure in the abstract booklet. Likewise, if the
abstract contains fonts or diacritics that are not easily transmitted
over email, please send it via regular mail.



For further information about the conference, please visit the CLA's 
Web site:

The CLA receives funding from the SSHRCC to partially cover
participants' travel expenses. In no case will a grant exceed the
cost of a round-trip economy-class fare. Grants generally only
partially cover travel costs. Only members residing in Canada and in
good standing on 1 February 2000 will be eligible for a travel grant.
Furthermore, applicants must submit their original ticket stubs to the
Secretary-Treasurer (Virginia Hill, University of New Brunswick,
P.O. Box 5050 Saint John, N.B. E2L 4L5. Email: before 1
July 2000

There will be a contest for the best paper at the CLA conference
written and delivered by one or more students. Please indicate on the
abstract submission form if you wish to be considered for this

- ------------------------------------------------------



Please complete this form and attach it to your abstract. Kindly 
identify yourself on this form only, and not on the abstract itself. 
Please type or print clearly. Deadline for receipt by the Program 
Committee: 1 February 2000. Mail to : CLA Program Committee, 
Department of Linguistics, University of Toronto, 130 St. George St., 
Toronto, ON, CANADA M5S 3H1. E-mail address: 


Address:___________________________	e-mail address: _______________
	___________________________	office phone: ________________

If applicable:	Co-author(s) 	_______________________________________________
		Affiliation(s) ________________________________________________
		Email Address(es) _____________________________________________

Total number of authors: ___

Title of paper: _________________________________________________________

Main area addressed by paper (please choose one area only):
___ discourse analysis	___ 1st/2nd language acquisition
___ historical linguistics
___ morphology	___ neurolinguistics	___ phonetics
___ phonology	___ pragmatics	___ psycholinguistics
___ semantics	___ sign language studies	___ sociolinguistics
___ syntax	___ other : __________________

Will you require audio-visual equipment?
___ yes	___ no	If yes, please specify: _________________________________

Have you given or do you plan to give this paper elsewhere?
___ yes	___ no
If yes, please specify when and where: 

Would you be willing to chair a session?
___ yes	___ no	If yes, in which area: __________________________________

If your paper is accepted, would you like to be considered for the 
Best Student Paper Contest? (all authors must be students.)
___yes ___no
Mail to author|Respond to list|Read more issues|LINGUIST home page|Top of issue

Message 2: Corpus Linguistics Workshop - LREC 2000/ Athens, Greece

Date: Sat, 18 Dec 1999 11:39:05 -0500 (EST)
From: Nancy M. Ide <>
Subject: Corpus Linguistics Workshop - LREC 2000/ Athens, Greece


Second International Conference on Language Resources and Evaluation
 (LREC 2000)

 Athens, Greece

 Pre-Conference Workshop Announcement and Call for Participation

 Data Architectures and Software Support for Large Corpora:
 Towards an American National Corpus

 Monday, May 29, 2000


- --------- 

Several software systems for linguistic annotation, search, and
retrieval of large corpora have been developed within the natural
language processing community over the past several years, including
LT-XML (Edinburgh), GATE (Sheffield), IMS Corpus Workbench
(Stuttgart), Alembic Workbench (Mitre), MATE
(Edinburgh/Odense/Stuttgart), Silfide (Loria/CNRS), SARA (BNC), and
several others. Related to and in support of this development, there
have also been efforts to develop standards for encoding and various
kinds of linguistic annotation, as well as data architectures (e.g.,
TIPSTER, TalkBank) etc. Still other developments, such as the
introduction of XML and the powerful XSL transformation language and
work on semi-structured data (e.g., the work of the Lore group at
Stanford), have also impacted the ways in which corpora and other
linguistic resources can be represented, stored, and accessed. 

Approaches to the fundamental design of the formats, data, and tools
are varied among current systems for the annotation and exploitation
of linguistic corpora. A primary reason for this diversity is that
most developers of formats and systems are concerned with only one
aspect of the creation/annotation/exploitation process. However, in
order to work effectively to develop commonality, the phases of the
process must be considered as a whole. This demands bringing together
researchers and developers from a variety of domains in text, speech,
video, etc., many of whom have previously had little or no contact
with one another. 

This workshop is intended to bring these groups together to look
broadly at the technical issues that bear on the development of
software systems for the annotation and exploitation of linguistic
resources. The goal is to lay the groundwork for the definition of a
data and system architecture to support corpus annotation and
exploitation that can be widely adopted within the community. Among
the issues to be addressed are: 

 o layered data architectures 
 o system architectures for distributed databases 
 o support for plurality of annotation schemes 
 o impact and use of XML/XSL 
 o support for multimedia, including speech and video 
 o tools for creation, annotation, query and access of corpora 
 o mechanisms for linkage of annotation and primary data 
 o applicability of semi-structured data models, search and query 
 systems, etc. 
 o evaluation/validation of systems and annotations 

The motivation for this workshop is the American National Corpus (ANC)
effort, which should begin corpus creation within the year. We
anticipate that the ANC will provide a significant resource for
natural language processing, and we therefore seek to identify
state-of-the-art methods for its creation, annotation, and
exploitation. Also, as a national and freely available resource, the
data and system architecture of the ANC is likely to become a de facto
standard. We therefore hope to draw together leading researchers and
developers to establish a basis for the design of a system to support
the creation and use of the ANC. 

A "Birds of a Feather" session for those interested in the ANC project
will be held immediately following the workshop.

Submission information
- --------------------

Submissions should address one or more of the listed
topics. Descriptions of planned or existing systems is acceptable, but
they should be situated in the larger context of the issues the
workshop addresses e.g., outline of the strengths and/or weaknesses of
the system and/or data formats, comparison with alternative
approaches, etc.

A 3000-4500 word abstract in English should be submitted by e-mail to
Nancy Ide ( in plain ASCII text format and with the
subject line "LREC WORKSHOP SUBMISSION : <First author's name>". Each submission
should include title; author(s); affiliation(s); and contact author's
e-mail address, postal address, telephone and fax numbers.

February 15, 2000 : Submissions due
March 15, 2000 : Results transmitted to authors
April 15, 2000 : Final Papers due
May 29, 2000 : Workshop

Organizing Committee
- ------------------ 

Nancy Ide (contact)
Department of Computer Science
Vassar College
Poughkeepsie, New York 12604-0520 USA
Tel : +1 914 437 5988
Fax : +1 914 437 7498
Email :

Laurent Romary
Campus Scientifique - BP 239
54506 Vandoeuvre-lhs-Nancy FRANCE
Tel : +33 (0)3 83 59 30 00
Fax : +33 (0)3 83 27 83 19
Email :

Henry S. Thompson 
Human Communication Research Centre 
2 Buccleuch Place 
Edinburgh EH8 9LW 
Tel : +44 (131) 650 4440 
Fax : +44 (131) 650 4587
Email : 

Program Committee
- --------------- 

Steven Bird, Linguistic Data Consortium 
Patrice Bonhomme, LORIA/CNRS 
Roy Byrd, IBM Corporation 
Jean Carletta, HCRC Edinburgh 
Ulrich Heid, IMS Stuttgart 
Hamish Cunningham, Sheffield 
David Day, Mitre Corporation 
Robert Gaizauskas, Sheffield 
Ralph Grishman, New York University 
Nancy Ide, Vassar College (Chair) 
Masato Ishizaki, JAIST 
Dan Jurafsky, University of Colorado at Boulder 
Tony McEnery, Lancaster 
David McKelvie, HCRC Edinburgh 
Laurent Romary, LORIA/CNRS 
Gary Simons, Summer Institute of Linguistics 
Henry Thompson, HCRC Edinburgh 
Yorick Wilks, Sheffield 
Peter Wittenburg, Max Planck Institute 
Remi Zajac, New Mexico State University
Mail to author|Respond to list|Read more issues|LINGUIST home page|Top of issue