Publishing Partner: Cambridge University Press CUP Extra Wiley-Blackwell Publisher Login
amazon logo
More Info

New from Oxford University Press!


Language Planning as a Sociolinguistic Experiment

By: Ernst Jahr

Provides richly detailed insight into the uniqueness of the Norwegian language development. Marks the 200th anniversary of the birth of the Norwegian nation following centuries of Danish rule

New from Cambridge University Press!


Acquiring Phonology: A Cross-Generational Case-Study

By Neil Smith

The study also highlights the constructs of current linguistic theory, arguing for distinctive features and the notion 'onset' and against some of the claims of Optimality Theory and Usage-based accounts.

New from Brill!


Language Production and Interpretation: Linguistics meets Cognition

By Henk Zeevat

The importance of Henk Zeevat's new monograph cannot be overstated. [...] I recommend it to anyone who combines interests in language, logic, and computation [...]. David Beaver, University of Texas at Austin

Summary Details

Query:   Sum: Speech Corpus for Neural Network Training
Author:  Scott Drellishak
Submitter Email:  click here to access email
Linguistic LingField(s):   Text/Corpus Linguistics

Summary:   A few weeks ago, I posted a request for information about speech corpora of a particular kind to both the Linguist List and the Corpora-List. This is the (somewhat belated) summary.

I described the corpora we are seeking as follows:

''We are looking for a corpus that contains samples of many speakers producing many vowels (preferably in a less reduced register) that also contains human-validated pitch and formant (F1, F2, and F3) tracks and, if possible, bandwidth information. A corpus that contains more than just vowels is fine, since we can discard sections of the samples that do not suit our needs.''

I received five replies:

1) John Lawler suggested MICASE (Michigan Corpus of Academic
Spoken English), which is available here:

2) Lesley Carmichael suggested I post my request to the

3) Jane Edwards pointed me at the Switchboard Transcription

4) Susana Sotillo wrote, ''At a recent conference (CALICO) I
saw a demonstration of the Speechcalator (Allen Blackwell
and associates). Why don't you write him at Carnegie-

5) Linda Bawcom offered an hour and a half of taped
conversation that she used in her MA research.

Many thanks to everyone who replied.

Scott Drellishak
University of Washington
Seattle, WA

LL Issue: 15.2363
Date Posted: 23-Aug-2004
Original Query: Read original query


Sums main page