LINGUIST List 18.2951
|
Wed Oct 10 2007
Diss: Comp Ling: Turk: 'Cross-Lingual Voice Conversion'
Editor for this issue: Luiza Newlin Lukowicz
<luiza linguistlist.org>
|
To post to LINGUIST, use our convenient web form at
http://linguistlist.org/LL/posttolinguist.html.
|
Directory
1. Oytun
Turk,
Cross-Lingual Voice Conversion
Message 1: Cross-Lingual Voice Conversion
|
Date: 10-Oct-2007
From: Oytun Turk <oytunturk gmail.com>
Subject: Cross-Lingual Voice Conversion
E-mail this message to a friend
Institution: Boğaziçi University
Program: Electrical and Electronics Engineering
Dissertation Status: Completed
Degree Date: 2007
Author: Oytun Turk
Dissertation Title: Cross-Lingual Voice Conversion
Linguistic Field(s):
Computational Linguistics
Dissertation Director:
Prof. Dr. Levent Mustafa Arslan
Dissertation Abstract:
Cross-lingual voice conversion refers to the automatic transformation of a source speaker's voice to a target speaker's voice in a language that the target speaker cannot speak. It involves a set of statistical analysis, pattern recognition, machine learning, and signal processing techniques. This study focuses on the problems related to cross-lingual voice conversion by discussing open research questions, presenting new methods, and performing comparisons with the state-of-the-art techniques. In the training stage, a Phonetic Hidden Markov Model based automatic segmentation and alignment method is developed for cross-lingual applications which support text-independent and text-dependent modes. Vocal tract transformation function is estimated using weighted speech frame mapping in more detail. Adjusting the weights, similarity to target voice and output quality can be balanced depending on the requirements of the cross-lingual voice conversion application. A context-matching algorithm is developed to reduce the one-to-many mapping problems and enable non-parallel training. Another set of improvements are proposed for prosody transformation including stylistic modeling and transformation of pitch and the speaking rate. A high quality cross-lingual voice conversion database is designed for the evaluation of the proposed methods. The database consists of recordings from bilingual speakers of American English and Turkish. It is employed in objective and subjective evaluations, and in case studies for testing new ideas in cross-lingual voice conversion.
Read more issues|LINGUIST home page|Top of issue
|
|

Please report any bad links or misclassified data
LINGUIST Homepage | Read
LINGUIST | Contact us

While the LINGUIST List makes every effort to ensure the linguistic relevance of sites listed on its pages, it cannot vouch for their contents.
|
|