LINGUIST List 9.83

Mon Jan 19 1998

Calls: ChiPhon '98, NLP workshop

Editor for this issue: Brett Churchill <>

Please do not use abbreviations or acronyms for your conference unless you explain them in your text. Many people outside your area of specialization will not recognize them. Also, if you are posting a second call for the same event, please keep the message short. Thank you for your cooperation.


  1. Rachel Hemphill, Chicago Phonetics and Phonology (ChiPhon) 98 call
  2. Wim Peters, Workshop announcement

Message 1: Chicago Phonetics and Phonology (ChiPhon) 98 call

Date: Sun, 18 Jan 1998 23:08:47 -0600
From: Rachel Hemphill <>
Subject: Chicago Phonetics and Phonology (ChiPhon) 98 call

ChiPhon -- the Phonetics and Phonology Group at the University of
Chicago -- provides a forum for discussing fundamental questions in
spoken language research. 

Last year, over 100 people joined us for a day-long panel on whether
speech is special. This year, as part of the 34th annual meeting of
the Chicago Linguistic Society, we examine how language acquisition
data should be incorporated into phonological and phonetic theory.

 ChiPhon '98
 Saturday, April 18,
 Chicago, Illinois


"the ideal relationship between central and developmental portions of
a field would be one in which the investigators in each... would
construct model children who were capable of developing into adults
and, conversely, model adults who could have developed from children"
(Menn, 1980).

* Which provides a better measure of a speaker's successful
acquisition of speech -- perceiving contrasts or producing them? How
are PERCEPTION & PRODUCTION related to linguistic competence?

* How should acquisition data be used when formulating and evaluating
phononological theories of tone and stress? What are the PROSODIC
PRIMITIVES, and are they innate or can they be bootstrapped from the

* What speech processing mechanisms are responsible for
SECOND-LANGUAGE development? Are they the same as those for adults
processing L1 or for infants acquiring it? What does DISORDERED
PHONOLOGY tell us about these mechanisms?

We invite abstracts that treat the acquisition of spoken language as
integral to the development of linguistic theory.

Invited Speakers:
	James Flege (University of Alabama at Birmingham)
	Peter Jusczyk (Johns Hopkins University)
	Robert Port (Indiana University)

Like last year, this year's symposium will provide ample opportunities
for discussion with talks throughout the day, a box lunch, and an
hour-long discussion at the end.

OTHER EVENTS that weekend include a panel on LANGUAGE ACQUISITION AND
THE LEXICON on April 19.

* Sophisticated CONNECTIONIST MODELS of language and acquisition
provide an ideal framework for constructing linguistic theories that
incorporate the complex interactions between sounds, symbols and

* The ways in which we conceptualize the organization of MORPHOLOGY
have different implications for first and for second language

* Ontogeny recapitulates phylogeny and for mature organisms we look at
LEXICAL ACCESS convinced cognition then recapitulates ontogeny.

Invited speakers:
	Joan Bybee (University of New Mexico)
	David Pisoni (Indiana University)
	Terry Regier (University of Chicago)

The CLS conference will also host a panel on The Status of Constraints
in Linguistic Theory on April 17, with John McCarthy and Jerrold

Papers on phonology, morphology, and syntax, among others, will be
presented during the main sessions on April 17-19, with Diana
Archangeli and David Dowty.

BY JANUARY 31, 1998. 

Mail to author|Respond to list|Read more issues|LINGUIST home page|Top of issue

Message 2: Workshop announcement

Date: Mon, 19 Jan 1998 17:34:40 GMT
From: Wim Peters <>
Subject: Workshop announcement

			Call for papers

	 Distributing and Accessing Linguistic Resources

Workshop immediately before the First International Conference on
language Resources and Evaluation (LREC),
May 27 1998
Granada, Spain

Short description:

This workshop will discuss ways to increase the efficacy of linguistic
resource distribution and programmatic access, and work towards the
definition of a new method for these tasks based on distributed processing
and object-oriented modelling with deployment on the WWW.

Organizers: Yorick Wilks, Hamish Cunningham, Wim Peters, Remi Zajac

Workshop Scope and Aims
- ---------------------

In general the reuse of of NLP data resources (such as lexicons or corpora)
has exceeded that of algorithmic resources (such as lemmatisers or parsers).
However, there are still two barriers to data resource reuse:

1) each resource has its own representation syntax and corresponding
 programmatic access mode (e.g. SQL for CELEX, C or Prolog for Wordnet,
 SGML for the BNC);

2) resources must generally be installed locally to be usable (and of
 course precisely how this happens, what operating systems are supported
 etc. varies from case to case).

The consequences of 1) are that although resources share some structure in
common (lexicons are organised around words, for example) this commonality
is wasted when it comes to using a new resource (the developer has to learn
everything afresh each time) and that work which seeks to investigate or
exploit commonalities between resources (e.g. to link several lexicons to an
ontology) has to first build a layer of access routines on top of each
resources. So, for example, if we wish to do task-based evaluation of lexicons
by measuring the relative performance of an information extraction system
with different instantiations of lexical resource, we might end up writing
code to translate several different resources into SQL or SGML.

The consequence of 2) is that there is no way to "try before you buy": no
way to examine a data resource for its suitability for your needs before
licencing it. Correspondingly there is no way for a resource provider to
expose limitted access to their products for advertising purposes, or gain
revenue through piecemeal supply of sections of a resource.

This workshop will discuss ways to overcome these barriers. The proposers 
will discuss a new method for distributing and accessing language resources 
involving the development of a common programmatic model of the various 
resources types, implemented in CORBA IDL and/or Java, along with a 
distributed server for non-local access. This model is being designed as
part of the GATE project (General Architecture for Text Engineering: and goes under the
provisional title of an Active CREOLE Server. (CREOLE: Collection of REusable
Objects for Language Engineering. Currently CREOLE supports only algortihmic
objects, but will be extended to data objects.)

A common model of language data resources would be a set of inheritance
hierarchies making up a forest or set of graphs. At the top of the hierarchies
would be very general abstractions from resources (e.g. lexicons are about
words); at the leaves would be data items that were specific to individual
resources. Programmatic access would be available at all levels, allowing
the developer to select an appropriate level of commonality for each

Note that although an exciting element of the work could be to provide
algorithms to dynamically merge common resources (e.g. connect WordNet to 
Celex), what we're suggesting initially is not to develop anything 
substantively new, but simply to improve access to existing resources. This 
is NOT a new standards initiative, but a way to build on previous initiatives.

Of course, the production of a common model that fully expressed all the
subtleties of all resources would be a large undertaking, but we believe
that it can be done incrementally, with useful results at each stage. Early
versions will stop decomposing the object structure of resources at a fairly
high level, leaving the developer to handle the data structures native to
the resources at the leaves of the forest. There should still be a
substantial benefit in uniform access to higher level strucures.

Draft Program Committee
- ---------------------

Yorick Wilks
Hamish Cunningham
Wim Peters
Remi Zajac
Roberta Catizone
Paola Velardi
Maria Teresa Pazienza
Louise Guthrie
Roberto Basili
Bran Boguraev
Sergei Nirenburg
James Pustejowsky
Ralph Grishman
Christiane Fellbaum

Paper Submission
- --------------


Papers should not exceed 4000 words or 10 pages.


Three hard copies should be sent to:

Gill Callaghan, FAO Yorick Wilks
Dept. Computer Science
University of Sheffield
Regent Court
211 Portobello St.,
Sheffield S1 4DP


Electronic submission will be allowed in Poscript or HTML.
An ftp site will be available on demand.
Authors should send an info email to (Yorick Wilks) even
if they submit in paper form. An electronic submission should be
accompanied by a plain ascii text.

# NAME : Name of first author
# TITLE: Title of the paper
# PAGES: Number of pages
# FILES: Name of file (if also submitted electronically)
# NOTE : Anything you'd like to add
# KEYS : Keywords
# EMAIL: Email of the first author
# ABSTR: Abstract of the paper
# . . . . . .


Paper Submission Deadline (Hard Copy/Electronic) 	February 15th 1998
Paper Notification 				April 1st
Camera-Ready Papers Due 				May 1st
DALR workshop 					May 27st
Mail to author|Respond to list|Read more issues|LINGUIST home page|Top of issue