* * * * * * * * * * * * * * * * * * * * * * * *
LINGUIST List logo Eastern Michigan University Wayne State University *
* People & Organizations * Jobs * Calls & Conferences * Publications * Language Resources * Text & Computer Tools * Teaching & Learning * Mailing Lists * Search *
* *
 
E-mail this message to a friend
Title: Accent Features and Idiodictionaries: On improving accuracy for accented speakers in ASR
Author: Michael Tjalve
Email: click here to access email
Degree Awarded: University College London , PhD in Experimental Phonetics
Degree Date: 2007
Linguistic Subfield(s): Phonetics
Director(s): Mark Huckvale

Abstract:

One of the most widespread approaches to dealing with the problem of accent
variation in ASR has been to choose the most appropriate pronunciation
dictionary for the speaker from a predefined set of dictionaries. This
approach is weak in two ways: firstly that accent types are more numerous
and more variable than can be captured in a few dictionaries, even if the
knowledge were available to create them; and secondly, accents vary in the
composition and phonotactics of the phone inventory not just in which
phones are used in which word.

In this work, we identify not the speaker's accent, but accent features
which allow us to predict by rule their likely pronunciation of all words
in the dictionary. Any given speaker is associated with a set of accent
features, but it is not a requirement that those features constitute a
known accent. We show that by building a pronunciation dictionary for an
individual, an idiodictionary, recognition accuracy can be improved over a
system using standard accent dictionaries.

The idiodictionary approach could be further enhanced by extending the set
of phone models to improve the modelling of phone inventory and variation
across accents. However an extended phoneme set is difficult to build since
it requires specially-labelled training material, where the labelling is
sensitive to the speaker's accent. An alternative is to borrow phone models
of a suitable quality from other languages. In this work, we show that this
phonetic fusion of languages can improve the recognition accuracy of the
speech of an unknown accent.

This work has practical application in the construction of speech
recognition systems that adapt to speakers' accents. Since it demonstrates
the advantages of treating speakers as individuals rather than just as
members of a group, the work also has potential implications for how
accents are studied in phonetic research generally.
Add a dissertation
Update dissertation
Page Updated: 26-Nov-2009

Please report any bad links or misclassified data

LINGUIST Homepage | Read LINGUIST | Contact us

NSF Logo

While the LINGUIST List makes every effort to ensure the linguistic relevance of sites listed
on its pages, it cannot vouch for their contents.