|
|
E-mail this message to a friend
|
|
Title:
|
Accent Features and Idiodictionaries: On improving accuracy for accented speakers in ASR
|
|
Author:
|
Michael Tjalve
|
|
Email:
|
click here to access email
|
|
Degree Awarded:
|
University College London
, PhD in Experimental Phonetics
|
|
Degree Date:
|
2007
|
|
Linguistic Subfield(s):
|
Phonetics
|
|
Director(s):
|
Mark Huckvale
|
|
|
Abstract:
|
|
One of the most widespread approaches to dealing with the problem of accent
variation in ASR has been to choose the most appropriate pronunciation
dictionary for the speaker from a predefined set of dictionaries. This
approach is weak in two ways: firstly that accent types are more numerous
and more variable than can be captured in a few dictionaries, even if the
knowledge were available to create them; and secondly, accents vary in the
composition and phonotactics of the phone inventory not just in which
phones are used in which word.
In this work, we identify not the speaker's accent, but accent features
which allow us to predict by rule their likely pronunciation of all words
in the dictionary. Any given speaker is associated with a set of accent
features, but it is not a requirement that those features constitute a
known accent. We show that by building a pronunciation dictionary for an
individual, an idiodictionary, recognition accuracy can be improved over a
system using standard accent dictionaries.
The idiodictionary approach could be further enhanced by extending the set
of phone models to improve the modelling of phone inventory and variation
across accents. However an extended phoneme set is difficult to build since
it requires specially-labelled training material, where the labelling is
sensitive to the speaker's accent. An alternative is to borrow phone models
of a suitable quality from other languages. In this work, we show that this
phonetic fusion of languages can improve the recognition accuracy of the
speech of an unknown accent.
This work has practical application in the construction of speech
recognition systems that adapt to speakers' accents. Since it demonstrates
the advantages of treating speakers as individuals rather than just as
members of a group, the work also has potential implications for how
accents are studied in phonetic research generally.
|
|
|
|
|
Page Updated: 26-Nov-2009

Please report any bad links or misclassified data
LINGUIST Homepage | Read
LINGUIST | Contact us

While the LINGUIST List makes every effort to ensure the linguistic relevance of sites listed on its pages, it cannot vouch for their contents.
|
|