Publishing Partner: Cambridge University Press CUP Extra Publisher Login
amazon logo
More Info


New from Oxford University Press!

ad

A History of the Irish Language: From the Norman Invasion to Independence

By Aidan Doyle

This book "sets the history of the Irish language in its political and cultural context" and "makes available for the first time material that has previously been inaccessible to non-Irish speakers."


New from Cambridge University Press!

ad

The Cambridge Handbook of Pragmatics

Edited By Keith Allan and Kasia M. Jaszczolt

This book "fills the unquestionable need for a comprehensive and up-to-date handbook on the fast-developing field of pragmatics" and "includes contributions from many of the principal figures in a wide variety of fields of pragmatic research as well as some up-and-coming pragmatists."


Academic Paper


Title: Paraphrasing spoken Chinese using a paraphrase corpus
Author: Jie Zhang
Email: click here TO access email
Institution: Pennsylvania State University
Author: Kazuhide Yamamoto
Institution: Nagaoka University of Technology
Linguistic Field: Computational Linguistics; Text/Corpus Linguistics; Translation
Abstract: One of the key issues in spoken-language translation is how to deal with unrestricted expressions in spontaneous utterances. We have developed a paraphraser for use as part of a translation system, and in this paper we describe the implementation of a Chinese paraphraser for a Chinese-Japanese spoken-language translation system. When an input sentence cannot be translated by the transfer engine, the paraphraser automatically transforms the sentence into alternative expressions until one of these alternatives can be translated by the transfer engine. Two primary issues must be dealt with in paraphrasing: how to determine new expressions, and how to retain the meaning of the input sentence. We use a pattern-based approach in which the meaning is retained to the greatest possible extent without deep parsing. The paraphrase patterns are acquired from a paraphrase corpus and human experience. The paraphrase instances are automatically extracted and then generalized into paraphrase patterns. A total of 1719 paraphrase patterns obtained using this method and an implemented paraphraser were used in a paraphrasing experiment. The results showed that the implemented paraphraser generated 1.7 paraphrases on average for each test sentence and achieved an accuracy of 88%.

CUP AT LINGUIST

This article appears IN Natural Language Engineering Vol. 11, Issue 4, which you can READ on Cambridge's site or on LINGUIST .



Add a new paper
Return to Academic Papers main page
Return to Directory of Linguists main page