LINGUIST List 11.1924

Wed Sep 13 2000

FYI: Corpus/Middle English, Penn Ling Colloquium

Editor for this issue: Lydia Grebenyova <lydialinguistlist.org>


Directory

  • Tony Kroch, The Penn-Helsinki Parsed Corpus of Middle English
  • Alexander Williams, Penn Linguistics Colloquium 24 - Proceedings

    Message 1: The Penn-Helsinki Parsed Corpus of Middle English

    Date: Mon, 11 Sep 2000 17:56:34 -0400
    From: Tony Kroch <krochchange.ling.upenn.edu>
    Subject: The Penn-Helsinki Parsed Corpus of Middle English


    The second edition of the Penn-Helsinki Parsed Corpus of Middle English (PPCME2) is now publicly available under the conditions outlined below. It consists of 55 text samples containing 1.3 million words of syntactically annotated Middle English prose and ranging over four time periods, from 1150 to 1500.

    Like the first edition of the PPCME, the PPCME2 is based on the Middle English portion of the Helsinki Corpus of English Texts that was created at the University of Helsinki under the direction of Matti Rissanen and Ossi Ihalainen. The size of the text samples in the second edition has been enlarged so that the total corpus size is nearly three times larger. In addition, the corpus is now tagged for part of speech and the syntactic annotation system is richer. For the earliest time period, all texts except one are complete; the exception is the Ancrene Riwle sample, which contains approximately 50,000 words. For the later time periods, two texts per time period were expanded to approximately 50,000 words. The remaining texts are represented by the Helsinki Corpus sample.

    The PPCME2 is being distributed on a CD-ROM that includes several files for each text in the corpus:

    - a file with unannotated text - a file with philological and other information about the text (manuscript and edition used, date, dialect, genre, and word count of the sample) - a file in which individual words are tagged for part of speech - a file that is annotated for syntactic structure

    Available with the corpus is CorpusSearch, a Java program written by Beth Randall that runs under Unix, Linux, MacOS and Windows. CorpusSearch uses standard syntactic predicates like ``(immediately) precedes'', ``(immediately) dominates'', and Boolean combinations thereof, and it allows outputs of previous search as inputs to further searches.

    To order the PPCME2, please go to:

    http://www.ling.upenn.edu/mideng

    and follow the instructions there.

    The cost of a subscription to the corpus is $200 and the cost of a license for CorpusSearch is $50. The items may be purchased together or separately. Proceeds from the sale of the corpus will pay for improving the corpus and for increasing its size over time. Proceeds from the sale of CorpusSearch will go to the author.

    The PPCME2 was designed and built by Anthony Kroch and Ann Taylor at the University of Pennsylvania. Supplementary assistance was provided by Beatrice Santorini. The PPCME2 is part of of a larger project to produce a parsed diachronic corpus of English from 800 to 1800. The Old English part is under construction at York under the direction of Anthony Warner, Susan Pintzuk, and Ann Taylor and the Early Modern English part is under construction at the University of Pennsylvania under the direction of Kroch and Santorini.

    Message 2: Penn Linguistics Colloquium 24 - Proceedings

    Date: Tue, 12 Sep 2000 14:56:33 -0400 (EDT)
    From: Alexander Williams <alexand3babel.ling.upenn.edu>
    Subject: Penn Linguistics Colloquium 24 - Proceedings


    The University of Pennsylvania Working Papers in Linguistics is pleased to announce the publication of

    ************************

    PWPL 7.1 (2000) Proceedings of the 24th Annual Penn Linguistics Colloquium M.Minnick-Fox, A.Williams & E.Kaiser (eds.)

    ************************

    - ---------------------------------------------------------------------------- [Ordering information follows the table of contents.]

    TABLE OF CONTENTS

    Bakovic', Eric "Nasal Place Assimilation in Spanish"

    Bender, Emily "Non-Categorical Constraints in Perception"

    Bergen, Benjamin K. "Implications of syntax-phonology interactions for phonological models"

    Bleam, Tonia, Chung-hye Han and Jeffrey Lidz "Grammatical Downsizing and the Redistribution of Semantic Wealth"

    Chang Woohyeok "Geminate vs. Non-Geminate Consonants in Italian: Evidence from a phonetic analysis"

    Citko, Barbara "Reconstruction and (Free) Relatives"

    Grohmann, Kleanthes K. "Movement issues in left-dislocation constructions"

    Grommes, Patrick "Contributing to Coherence: An Empirical Study on OR-Team Communication"

    Han, Chunghye, Mee-sook Kim and Juntae Yoon "Getting Rid of `Double Relative Clauses' in Korean"

    Idsardi, William J & Sun-Hoi Kim "The phonology-syntax interface in North Kyungsang Korean"

    Kennedy, Becky "Acquisition of Constraints on Forward Coreference"

    Kulick, Seth "A Unified Account of Clitic Climbing and Long Distance Scrambling"

    Lee, Minkyung "Focus and morpho-syntax in Korean accentual phrasing"

    Massey, Victoria W. "Perfect, Passive and Reflexive in Albanian"

    Meroni, Luisa, Andrea Gualmini and Stephen Crain "A conservative account of quantification in child language"

    Pekarek Doehler, Simona "Anaphora in Conversation: Grmmatical coding and preference organization."

    Pylkkanen, Liina "What Applicative Heads Apply to"

    Rathert, Monica "Universal/existential ambiguities in German"

    San Martin, Itziar "The Control basis for Obviation in Basque."

    Sharvit, Yael "Tense in copular constructions: Identifying three types of copular sentences."

    Sichel, Ivy "Demonstratives, the binding theory and identity"

    Sugahara, Mariko "The Japanese Major Phrase Formation and NONFINALITY"

    Tokizaki Hisao "Syntax-Phonology Mapping and the Length of Constituents"

    - -----------------------------------------------------------------------

    ORDERING INFORMATION

    Issues are US$15.00 each. Shipping is free, even interationally. Checks and money-orders should be made out to: "Penn Linguistics Club." Payment must be received before an order can be processed.

    Please send orders, with payment, to:

    PWPL 619 Williams Hall Department of Linguistics University of Pennsylvania Philadelphia, PA 19104-6305 USA

    Please direct queries to: working-papersling.upenn.edu For general information about the PWPL, see: www.ling.upenn.edu/papers/pwpl.html



    Thank you, Alexander Williams Editor, PWPL

    ===========================

    Alexander Williams Department of Linguistics 619 Williams Hall University of Pennsylvania Philadelphia, PA 19104

    dept phone: (215) 898-6046 home phone: (215) 386-6961

    ===========================