LINGUIST List 9.35

Sun Jan 11 1998

Calls: LREC Workshop, 6th Workshop/Very Large Corpora

Editor for this issue: Martin Jacobsen <>

Please do not use abbreviations or acronyms for your conference unless you explain them in your text. Many people outside your area of specialization will not recognize them. Also, if you are posting a second call for the same event, please keep the message short. Thank you for your cooperation.


  1. Svetlana Sheremetyeva, LREC Workshop
  2. Eugene Charniak, 6th Workshop on Very Large Corpora

Message 1: LREC Workshop

Date: Fri, 9 Jan 1998 13:56:29 -0700 (MST)
From: Svetlana Sheremetyeva <>
Subject: LREC Workshop

 Call for Participation
 A Workshop on Minimizing the Effort
 for Language Resource Acquisition
 Granada, Spain, 26 May, 1998
 in conjunction with 
 The First International Conference
 on Language Resources and Evaluation
 Granada, Spain, 28-30, May 1998
 An applied NLP system must produce adequate results and must be
made deployable within reasonable time. Gathering and acquiring
language resources to build an application system is very
time-consuming, and it is imperative to find ways of speeding up
acquisition of high quality, useful static knowledge sources such as a
variety of grammars, lexicons, corpora, etc. Viability of avoiding
masive resource acquisition, if possible, must also be carefully
 Resource acquisition should include methods, based both on sound
theoretical principles and practical experience, of deciding, among
other things, on the amount of knowledge one *really* needs for a
given application. Increasing the size of knowledge sources or their
number and variety does not necessarily lead to a commensurate
improvement of output quality in an application, though a correlation
between the two certainly exists, but it definitely needs to much
increased costs.
 No matter how large the acquired resources are and how many of them
have been acquired, there will always remain a residue of language
processing problems which can be tackled only by foregoing the
requirement of full automation and involving expensive semi-automatic
or even manual acquisition. It becomes imperative, therefore, to
assess when the static knowledge source acquisition is NO LONGER
PROFITABLE. Thus, in a system for interactive authoring and automatic
generation of patent claim texts, the lexical knowledge base can be
restricted to a lexicon of domain-related verbs marked for
subcategorization (as the nominals are provided interactively by the
 The proposed workshop will be devoted to ANY TECHNOLOGICAL AND
 The technological issues to be discussed at the conference include,
- minimization of effort in acquiring monolingual and multilingual
	text corpora;
- minimization of effort in acquiring computational lexicons,
	including phonological, morphological, syntactic, semantic 
	and other (including application-specific) information;
- minimization of effort in acquisition of resources for the support
	of corpus-based language engineering methods;
- minimization of effort in acquiring grammatical coverage of
	languages and sublanguages;
- methods of determining levels of reusability of existing language
- balancing the needs of the application and the grain size of
	language description;
- minimization of effort through balancing automatic and interactive
	methods of knowledge acquisition;
- evaluation of potential utility of resources to applications;
We particularly encourage reports about actual practical large-scale
resource acquisition efforts in which economy of effort has been a
conscious choice.
Organizing Committee:
 Svetlana Sheremetyeva, NMSU CRL, USA (Chair)
 Eduard Hovy, USC ISI, USA
 Bernardo Magnini, IRST, Italy
 Sergei Nirenburg, NMSU CRL, USA
 Victor Raskin, Purdue University, USA
 Frederique Segonde, Xerox Research Centre Europe, France
 Leo Wanner, University of Stuttgart, Germany
Papers should not exceed 4000 words or 10 pages. Presentations will be
selected on the basis of a review of papers and project reports.
 Each submission should include a title page containing the title,
author(s), affiliation(s), submitting author's mailing address,
telephone number, fax number and e-mail address.
The authors may submit three hard copies OR submit ELECTRONICALLY in
postscript form to:
 Svetlana Sheremetyeva
 Computing Research Laboratory 
 New Mexico State University, USA
 Box30001/Dept.3CRL/Las Cruces
 New Mexico 88003-8001
 Receipt of submissions will be acknowledged. 
 Thursday, February 19, 1998 Submissions due
 Monday, March, 16 1998 Acceptances and rejections 
 Friday, April 10 1998 Final papers due
 Tuesday, May 26, 1998 Workshop date
 Registration for the workshop will be:
 10,000 pesetas for those not attending LREC
 5,000 pesetas for those attending LREC
These fees will include a coffee break and the proceedings of the
Participation in the workshop will be limited by the venue. Requests
for participation will be processed on the first come first served
Mail to author|Respond to list|Read more issues|LINGUIST home page|Top of issue

Message 2: 6th Workshop on Very Large Corpora

Date: Thu, 8 Jan 1998 14:51:04 -0500 (EST)
From: Eugene Charniak <>
Subject: 6th Workshop on Very Large Corpora



WHEN: August 15-16, 1998 (immediately following ACL/COLING-98)
WHERE: University of Montreal, Montreal, Quebec, Canada


As in past years, the workshop will offer a general forum for new
research in corpus-based and statistical natural language processing.
Areas of interest include (but are not limited to):

 - robust parsing, phrase structure analysis
 - part of speech tagging
 - term and name identification
 - word sense disambiguation
 - morphological analysis
 - anaphora resolution
 - event categorization
 - discourse structure identification
 - alignment of parallel texts and bilingual terminology
 - language modelling
 - lexicography
 - machine translation
 - spelling and grammar correction


	Eugene Charniak 	Brown University

	Steven Abney			Lillian Lee
	Eric Brill			Christopher Manning
	Ted Briscoe			Dan Melamed
	Rebecca Bruce			Scott Miller
	Claire Cardie			Raymond Mooney
	Bob Carpenter			James Pustejovksy
	Glen Carroll			Lance Ramshaw
	Ken Church			Adwait Rathnaparkhi
	Michael Collins			Ellen Riloff
	Joshua Goodman			Hinrich Schutze
	Vasilis Hatzivassiloglou	Ralph Weischedel
	Mark Johnson 			Janyce Wiebe
	Andrew Kehler			Dekai Wu
	John Lafferty			David Yarowsky

SPONSOR: SIGDAT (ACL's special interest group for linguistic data
 and corpus-based approaches to NLP)



Only hard-copy submissions will be accepted. Authors should submit
six (6) copies of their full-length paper (3500-8000 words) to Eugene
Charniak at the Johns Hopkins University address below. Papers should
describe original work. A paper accepted for presentation cannot be
presented or have been presented at any other meeting. Papers
submitted to other conferences will be considered, as long as this
fact is clearly indicated in the submission.


 Submission Deadline: April 20, 1998
 Notification Date: June 1, 1998
 Camera ready copy due: June 22, 1998 


 Eugene Charniak

 Address:	Before February 1, 1998	and After June 1, 1998

	Department of Computer Science
	Brown University
	Providence RI 02912-1910

 Address:	From February 1, 1998 until June 1, 1998

	Department of Computer Science
	Johns Hopkins University
	NEB 224, 3400 N. Charles Street
	Baltimore, MD 21218-2694 
Mail to author|Respond to list|Read more issues|LINGUIST home page|Top of issue