LINGUIST List 7.1666

Sun Nov 24 1996

Sum: Lancaster Corpus

Editor for this issue: Susan Robinson <robinsonemunix.emich.edu>


Directory

  1. Guenter Schubert, Sum: Lancaster Corpus

Message 1: Sum: Lancaster Corpus

Date: Fri, 22 Nov 1996 17:14:00 +0100
From: Guenter Schubert <0431659800-0001t-online.de>
Subject: Sum: Lancaster Corpus

Recently I issued questions about the existence and nature of the
Lancaster Corpus, also about access possibilities and availability on
a CD-ROM. I'm indebted to the following people all over the world for
their helpful hints:

1)Tracy Cameron Mansfield ( mansfieldmailworldnet.att.net)
2) Yael Maschler ( yaelmvms.huji.ac.il)
3) Donald C. Freeman (dfreemanbcf.usc.edu)
4)M.Hundt (hundtrcs.urz.tu-dresden.de)
5)Jonathan Swift (jonsais.co.uk)
6) Sung-Ho Ahn (shahnemail.hanyang.ac.kr)
7) Ingo Plag (plagmailer.uni-marburg.de)
8) Jane Setter (egjanespolyu.edu.hk)
9) Martha Jo McGinnis ( marthajoMIT.edu)
10) Suzanne E Kemmer ( kemmerruf.rice.edu)
11) Dennis Newson ( dnewsondosunil.rz.uni-osnabrueck.de)
12) Lex Olorenshaw (lexolsi.sel.sony.com)

These helpful people sent me information as to specialists involved in
the completion of the corpus or to people who could help for other
reasons. I got relevant WWW- and e-mail-addresses, and hints at other
existing corpora.

The Corpus in question is a joint enterprise of Lancaster - Oslo -
Bergen ( the LOB - Corpus). It was compiled at the Norwegian Computing
Centre for the Humanities in Bergen; it is a 30-mill. word archive of
the English published in 1961. There is a (slightly expensive) CD-ROM
available that also includes LOB's American equivalent, the BROWN-
Corpus (Univ.,Providence, Rhode Island, USA; also known as
FRANCIS/KUCHERA- Corpus),and the LONDON-LUND-Corpus.

Information about the CD-ROM can be acquired from the International
Computer Archive of Modern English in Oslo: ICAME (e-mail:
icamehd.uib.no)

Other corpora mentioned in the correspondence were: MARSEC (Leeds
Univ.) and PENN TREEBANK (Univ. of Pennsylvania).

Relevant internet addresses:

http://www.awl-elt.com/dictionaries/lasde.html (= The Longman's
 dictionary web site)
http://www.ruf.rice.edu/~barlow/corpus.html ( = available corpora)
http://www.cis.upenn.edu/~treebank/home.html (= Treebank Corpus)
http://www.ldc.upenn.edu/ (= Linguistic Data Consortium : American
 English data)
http://www.dcs.shef.ac.uk/research/ilash/info/papers/ShATR (=for
 British Engl.)

The Univ. of Freiburg/Germany is presently working on a corpus which
will be an update (1990s) of LOB and BROWN.

Thank you all very much, I appreciate your help.

Guenter Schubert
Kiel/Germany
e-mail: guenter.schubertt-online.de
Mail to author|Respond to list|Read more issues|LINGUIST home page|Top of issue