Sum: Speech Corpus for Neural Network Training
|Submitter Email:||click here to access email|
A few weeks ago, I posted a request for information about speech corpora of a particular kind to both the Linguist List and the Corpora-List. This is the (somewhat belated) summary.
I described the corpora we are seeking as follows:
''We are looking for a corpus that contains samples of many speakers producing many vowels (preferably in a less reduced register) that also contains human-validated pitch and formant (F1, F2, and F3) tracks and, if possible, bandwidth information. A corpus that contains more than just vowels is fine, since we can discard sections of the samples that do not suit our needs.''
I received five replies:
1) John Lawler suggested MICASE (Michigan Corpus of Academic
Spoken English), which is available here:
2) Lesley Carmichael suggested I post my request to the
3) Jane Edwards pointed me at the Switchboard Transcription
4) Susana Sotillo wrote, ''At a recent conference (CALICO) I
saw a demonstration of the Speechcalator (Allen Blackwell
and associates). Why don't you write him at Carnegie-
5) Linda Bawcom offered an hour and a half of taped
conversation that she used in her MA research.
Many thanks to everyone who replied.
University of Washington
|Original Query:||Read original query|
Sums main page