Publishing Partner: Cambridge University Press CUP Extra Publisher Login
amazon logo
More Info

New from Oxford University Press!


Oxford Handbook of Corpus Phonology

Edited by Jacques Durand, Ulrike Gut, and Gjert Kristoffersen

Offers the first detailed examination of corpus phonology and serves as a practical guide for researchers interested in compiling or using phonological corpora

New from Cambridge University Press!


The Languages of the Jews: A Sociolinguistic History

By Bernard Spolsky

A vivid commentary on Jewish survival and Jewish speech communities that will be enjoyed by the general reader, and is essential reading for students and researchers interested in the study of Middle Eastern languages, Jewish studies, and sociolinguistics.

New from Brill!


Indo-European Linguistics

New Open Access journal on Indo-European Linguistics is now available!

Query Details

Query Subject:   Initial Training for Speech Recognition Software
Author:   Anna Haberko
Submitter Email:  click here to access email

Linguistic LingField(s):  Computational Linguistics

Query:   My company is developing software for doctors to dictate reports. Our
software relies on a speech recognition engine that is trained to
recognize words. To improve on the current model, I am redesigning
the initial speech training component. As I would like to develop
effective material, I am looking for insight on the following questions:

What are the requirements for initial speech training text (to be read by
the user of speech recognition in order to initially train the speech
engine, and start working with a satisfactory level of recognition)?
Does it have to include all possible phonemes of a language?
Do they have to repeat certain number of times?
If the full phonemic inventory is not required, what would be necessary
for a language such as English?
What other requirements should I consider for such a text?

While I have attempted to do some research on this subject, I have had
trouble finding adequate guidelines for this, and speech corpora have
not really been searchable for texts like this. I have an exemplary text
of SpeechMagic software (provided by Nuance), but I would be grateful
for any additional examples people could provide. Any other resources
or guidelines for speech recognition development would also be greatly
LL Issue: 22.2903
Date posted: 15-Jul-2011


Sums main page