Editor for this issue: Martin Jacobsen <marty
linguistlist.org>
Call for Participation A Workshop on Minimizing the Effort for Language Resource Acquisition Granada, Spain, 26 May, 1998 in conjunction with The First International Conference on Language Resources and Evaluation Granada, Spain, 28-30, May 1998 An applied NLP system must produce adequate results and must be made deployable within reasonable time. Gathering and acquiring language resources to build an application system is very time-consuming, and it is imperative to find ways of speeding up acquisition of high quality, useful static knowledge sources such as a variety of grammars, lexicons, corpora, etc. Viability of avoiding masive resource acquisition, if possible, must also be carefully considered. Resource acquisition should include methods, based both on sound theoretical principles and practical experience, of deciding, among other things, on the amount of knowledge one *really* needs for a given application. Increasing the size of knowledge sources or their number and variety does not necessarily lead to a commensurate improvement of output quality in an application, though a correlation between the two certainly exists, but it definitely needs to much increased costs. No matter how large the acquired resources are and how many of them have been acquired, there will always remain a residue of language processing problems which can be tackled only by foregoing the requirement of full automation and involving expensive semi-automatic or even manual acquisition. It becomes imperative, therefore, to assess when the static knowledge source acquisition is NO LONGER PROFITABLE. Thus, in a system for interactive authoring and automatic generation of patent claim texts, the lexical knowledge base can be restricted to a lexicon of domain-related verbs marked for subcategorization (as the nominals are provided interactively by the author). The proposed workshop will be devoted to ANY TECHNOLOGICAL AND ADMINISTRATIVE FACET OF ECONOMY OF ACQUISITION EFFORT. The technological issues to be discussed at the conference include, BUT ARE NOT LIMITED TO: - minimization of effort in acquiring monolingual and multilingual text corpora; - minimization of effort in acquiring computational lexicons, including phonological, morphological, syntactic, semantic and other (including application-specific) information; - minimization of effort in acquisition of resources for the support of corpus-based language engineering methods; - minimization of effort in acquiring grammatical coverage of languages and sublanguages; - methods of determining levels of reusability of existing language resources; - balancing the needs of the application and the grain size of language description; - minimization of effort through balancing automatic and interactive methods of knowledge acquisition; - evaluation of potential utility of resources to applications; We particularly encourage reports about actual practical large-scale resource acquisition efforts in which economy of effort has been a conscious choice. Organizing Committee: Svetlana Sheremetyeva, NMSU CRL, USA (Chair) Eduard Hovy, USC ISI, USA Bernardo Magnini, IRST, Italy Sergei Nirenburg, NMSU CRL, USA Victor Raskin, Purdue University, USA Frederique Segonde, Xerox Research Centre Europe, France Leo Wanner, University of Stuttgart, Germany SUBMISSION OF PAPERS Papers should not exceed 4000 words or 10 pages. Presentations will be selected on the basis of a review of papers and project reports. SUBMISSION MODE Each submission should include a title page containing the title, author(s), affiliation(s), submitting author's mailing address, telephone number, fax number and e-mail address. The authors may submit three hard copies OR submit ELECTRONICALLY in postscript form to: Svetlana Sheremetyeva Computing Research Laboratory New Mexico State University, USA Box30001/Dept.3CRL/Las Cruces New Mexico 88003-8001 lanaMail to author|Respond to list|Read more issues|LINGUIST home page|Top of issuecrl.nmsu.edu Receipt of submissions will be acknowledged. IMPORTANT DATES Thursday, February 19, 1998 Submissions due Monday, March, 16 1998 Acceptances and rejections Friday, April 10 1998 Final papers due Tuesday, May 26, 1998 Workshop date Registration for the workshop will be: 10,000 pesetas for those not attending LREC 5,000 pesetas for those attending LREC These fees will include a coffee break and the proceedings of the workshop. Participation in the workshop will be limited by the venue. Requests for participation will be processed on the first come first served basis.
CALL FOR PAPERS SIXTH WORKSHOP ON VERY LARGE CORPORA WHEN: August 15-16, 1998 (immediately following ACL/COLING-98) WHERE: University of Montreal, Montreal, Quebec, Canada WORKSHOP DESCRIPTION: As in past years, the workshop will offer a general forum for new research in corpus-based and statistical natural language processing. Areas of interest include (but are not limited to): - robust parsing, phrase structure analysis - part of speech tagging - term and name identification - word sense disambiguation - morphological analysis - anaphora resolution - event categorization - discourse structure identification - alignment of parallel texts and bilingual terminology - language modelling - lexicography - machine translation - spelling and grammar correction PROGRAM CHAIR: Eugene Charniak Brown University PROGRAM COMMITTEE: Steven Abney Lillian Lee Eric Brill Christopher Manning Ted Briscoe Dan Melamed Rebecca Bruce Scott Miller Claire Cardie Raymond Mooney Bob Carpenter James Pustejovksy Glen Carroll Lance Ramshaw Ken Church Adwait Rathnaparkhi Michael Collins Ellen Riloff Joshua Goodman Hinrich Schutze Vasilis Hatzivassiloglou Ralph Weischedel Mark Johnson Janyce Wiebe Andrew Kehler Dekai Wu John Lafferty David Yarowsky SPONSOR: SIGDAT (ACL's special interest group for linguistic data and corpus-based approaches to NLP) WEB SITES: COLING-ACL'98 - http://coling-acl98.iro.umontreal.ca/ FORMAT FOR SUBMISSION: Only hard-copy submissions will be accepted. Authors should submit six (6) copies of their full-length paper (3500-8000 words) to Eugene Charniak at the Johns Hopkins University address below. Papers should describe original work. A paper accepted for presentation cannot be presented or have been presented at any other meeting. Papers submitted to other conferences will be considered, as long as this fact is clearly indicated in the submission. SCHEDULE: Submission Deadline: April 20, 1998 Notification Date: June 1, 1998 Camera ready copy due: June 22, 1998 CONTACT: Eugene Charniak e-mail ecMail to author|Respond to list|Read more issues|LINGUIST home page|Top of issuecs.brown.edu Address: Before February 1, 1998 and After June 1, 1998 Department of Computer Science Brown University Providence RI 02912-1910 Address: From February 1, 1998 until June 1, 1998 Department of Computer Science Johns Hopkins University NEB 224, 3400 N. Charles Street Baltimore, MD 21218-2694