* * * * * * * * * * * * * * * * * * * * * * * *
LINGUIST List logo Eastern Michigan University Wayne State University *
* People & Organizations * Jobs * Calls & Conferences * Publications * Language Resources * Text & Computer Tools * Teaching & Learning * Mailing Lists * Search *
* *
LINGUIST List 22.2903

Fri Jul 15 2011

Qs: Initial Training for Speech Recognition Software

Editor for this issue: Brent Woo <bwoolinguistlist.org>

We'd like to remind readers that the responses to queries are usually best posted to the individual asking the question. That individual is then strongly encouraged to post a summary to the list. This policy was instituted to help control the huge volume of mail on LINGUIST; so we would appreciate your cooperating with it whenever it seems appropriate.

In addition to posting a summary, we'd like to remind people that it is usually a good idea to personally thank those individuals who have taken the trouble to respond to the query.

To post to LINGUIST, use our convenient web form at http://linguistlist.org/LL/posttolinguist.cfm.
        1.     Anna Haberko , Initial Training for Speech Recognition Software

Message 1: Initial Training for Speech Recognition Software
Date: 13-Jul-2011
From: Anna Haberko <ahaberkogmail.com>
Subject: Initial Training for Speech Recognition Software
E-mail this message to a friend

My company is developing software for doctors to dictate reports. Our
software relies on a speech recognition engine that is trained to
recognize words. To improve on the current model, I am redesigning
the initial speech training component. As I would like to develop
effective material, I am looking for insight on the following questions:

What are the requirements for initial speech training text (to be read by
the user of speech recognition in order to initially train the speech
engine, and start working with a satisfactory level of recognition)?
Does it have to include all possible phonemes of a language?
Do they have to repeat certain number of times?
If the full phonemic inventory is not required, what would be necessary
for a language such as English?
What other requirements should I consider for such a text?

While I have attempted to do some research on this subject, I have had
trouble finding adequate guidelines for this, and speech corpora have
not really been searchable for texts like this. I have an exemplary text
of SpeechMagic software (provided by Nuance), but I would be grateful
for any additional examples people could provide. Any other resources
or guidelines for speech recognition development would also be greatly

Linguistic Field(s): Computational Linguistics

Read more issues|LINGUIST home page|Top of issue

Page Updated: 15-Jul-2011

Supported in part by the National Science Foundation       About LINGUIST    |   Contact Us       ILIT Logo
While the LINGUIST List makes every effort to ensure the linguistic relevance of sites listed on its pages, it cannot vouch for their contents.