LINGUIST List 13.595

Mon Mar 4 2002

FYI: New Corpus Release, 1 Year M.Phil in Ling

Editor for this issue: Marie Klopfenstein <>


  1. LDC Office, New ELDA/LDC Corpus Release
  2. K.M. Jaszczolt, M.Phil. in Cambridge

Message 1: New ELDA/LDC Corpus Release

Date: Thu, 28 Feb 2002 13:18:30 -0500
From: LDC Office <>
Subject: New ELDA/LDC Corpus Release

Cooperation Between ELDA and LDC - Distribution of Language Resources

Networking Data Centers, "Net-DC", (MLIS-5017), aims to improve the
infrastructure for language resources, by designing and implementing new
modes of cooperation between the Linguistic Data Consortium (LDC) and
the European Language Resources Distribution Agency (ELDA). In the
framework of this cooperation, LDC and ELDA are happy to announce the
following joint distribution of language resources.

Translanguage English Database (TED)
ELRA reference:
LDC reference:

The Translanguage English Database (TED) is a corpus of recordings made
of oral presentations at Eurospeech'93 in Berlin. The corpus name
derives from the high percentage of oral presentations given in English
by non-native speakers of English. Two hundred twenty-four (224) oral
presentations at the conference were successfully recorded, providing a
total of about 75 hours of speech material. These recordings provide a
large number of presenters, speaking multiple variants of English, over
a relatively large amount of time (15 minutes for each presentation + 5
minutes of discussion), on a specific topic. This release of TED (6
CDROMs) includes 188 speeches, without the ensuing discussion periods.
This database was produced with the support of ELSNET. Associated text
materials consist of ASCII versions of over 400 proceedings papers and
oral preparations that were supplied by the authors, as well as, 250
speaker questionnaires.

Translanguage English Database (TED) Transcripts
ELRA reference:
LDC reference:

The Translanguage English Database (TED) Transcripts corpus contains
transcriptions of thirty-nine of the 188 speeches of the TED Corpus
(ELRA ref.: ; LDC
ref.: made at
Eurospeech'93 in Berlin. The thirty-nine transcripts in this publication
are in Universal Transcription Format (UTF) and were prepared by the
LDC. All utf files in the transcript publication were validated against
an included utf.dtd. Tables containing speaker demographic information
and a cross-reference of file names from the TED audio corpus are

For further information, please contact ELRA/ELDA or LDC at:

55-57 rue Brillat-Savarin
F-75013 Paris, France
Tel: +33 01 43 13 33 33
Fax: +33 01 43 13 33 30
Email: or

LDC - Linguistic Data Consortium
3615 Market Street, Suite 200
PA 19104-2608 Philadelphia, USA
Tel: (215) 898-0464
Fax: (215) 573-2175
Mail to author|Respond to list|Read more issues|LINGUIST home page|Top of issue

Message 2: M.Phil. in Cambridge

Date: Thu, 28 Feb 2002 16:50:16 -0000
From: K.M. Jaszczolt <>
Subject: M.Phil. in Cambridge

Applications are invited for a one-year M.Phil. course in Linguistics 
offered by the Department of Linguistics, University of Cambridge, UK. The 
taught component of the course provides training in phonetics, phonology, 
morphology, syntax and semantics, as well as research methods, with options 
also available in historical linguistics, pragmatics and Romance 
linguistics. Students also conduct two pieces of research on topics of 
their choice, selected from a wide range of areas of expertise offered by 
the linguists of the Faculty of Modern and Medieval Languages. The course 
can be taken as a one-year programme ('M.Phil. only')or can serve as a 
preparation for a PhD ('M.Phil. in the first instance'). For further 
details and application procedure see

Dr Kasia Jaszczolt
Director of the M.Phil.
Department of Linguistics
Faculty of Modern and Medieval Languages
University of Cambridge
United Kingdom


Newnham College
Cambridge CB3 9DF

tel +44 1223 335744
fax +44 1223 335062

Mail to author|Respond to list|Read more issues|LINGUIST home page|Top of issue