Editor for this issue: <>
THE THIRD WORKSHOP ON VERY LARGE CORPORA ----------------------------------------- Friday, 30 June 1995 8:45 AM - 5:25 PM MIT, Cambridge, Massachusetts, USA at ACL-95 (Sponsored by ACL's SIGDAT and SIGNLL, LEXIS-NEXIS) The workshop will present original research in corpus-based and statistical natural language processing. Topics will include sense disambiguation, grammar induction, part-of-speech tagging, information retrieval, language modeling, and machine translation. This year's theme is: Supervised Training vs. Self-organizing Methods Historically, annotated corpora have made a significant contribution to tasks such as part-of-speech tagging and sense disambiguation. But annotated corpora are expensive and generally unavailable for languages other than English. Self-organizing methods offer the hope that annotated corpora might not be necessary. Can we achieve comparable performance using little or no tagged training data? What are the tradeoffs? Organizers: Ken Church and David Yarowsky REGISTRATION: Registration fees are $35 for participants who register by 19 May 1995, $40 for payment received by 15 June 1995, and $45 at the door. Registration includes a copy of the proceedings, catered lunch and refreshments during the day. Acceptable forms of payment are US$ cheques payable to "ACL" or credit card (VISA/Mastercard) payment. E-mail registrations are encouraged. Please submit the following form along with payment: ---------------------------------------------------- Name: Institution (for name tag): Postal address: Email address: Payment (specify cheque or credit card): Credit card info - Name on card: - Card number: - Expiration date: Dietary requirements (vegetarian, etc.): ------------------------------------------------------- Please send to: David Yarowsky Dept. of Computer and Information Science University of Pennsylvania 200 S. 33rd St. Philadelphia, PA 19104-6389 USA email: yarowskyMail to author|Respond to list|Read more issues|LINGUIST home page|Top of issueunagi.cis.upenn.edu PROGRAM 8:15 - 8:45 Registration. Coffee, danish, etc. available 8:45 - 8:50 Welcome 8:50 - 9:35 INVITED TALK (Mark Liberman) 9:35 - 9:50 Break 9:50 - 10:15 Eric Brill Unsupervised Learning of Disambiguation Rules for Part of Speech Tagging 10:15 - 10:40 Carl de Marcken Lexical Heads, Phrase Structure and the Induction of Grammar 10:40 - 11:05 Michael Collins and James Brooks Prepositional Phrase Attachment through a Backed-off Model 11:05 - 11:15 Break 11:15 - 11:40 Andrew Golding A Bayesian Hybrid Method for Context-sensitive Spelling Correction 11:40 - 12:05 Philip Resnik Disambiguating Noun Groupings with Respect to Wordnet Senses 12:05 - 1:05 CATERED LUNCH 1:05 - 1:30 Dekai Wu Trainable Coarse Bilingual Grammars for Parallel Text Bracketing 1:30 - 1:55 Lance Ramshaw and Mitch Marcus Text Chunking using Transformation-Based Learning 1:55 - 2:05 Break 2:05 - 3:00 INVITED TALK (Henry Kucera and Nelson Francis) 3:00 - 3:10 Break 3:10 - 3:35 Fernando Pereira, Yoram Singer and Naftali Tishby Beyond Word N-Grams 3:35 - 4:00 Jing-Shin Chang, Yi-Chung Lin and Keh-Yih Su Automatic Construction of a Chinese Electronic Dictionary 4:00 - 4:10 Break 4:10 - 4:35 Ken Church and Bill Gale Inverse Document Frequency (IDF): A Measure of Deviations from Poisson 4:35 - 5:00 Joe Zhou and Pete Dapkus Automatic Suggestion of Significant Terms for a Predefined Topic 5:00 - 5:25 Ellen Riloff and Jay Shoen Automatically Acquiring Conceptual Patterns without an Annotated Corpus More Information: http://www.cis.upenn.edu/~yarowsky/wvlc3.html ACL-95 Homepage: http://www.ai.mit.edu/people/cgdemarc/acl/acl-info.html