Publishing Partner: Cambridge University Press CUP Extra Publisher Login
amazon logo
More Info


New from Oxford University Press!

ad

Oxford Handbook of Corpus Phonology

Edited by Jacques Durand, Ulrike Gut, and Gjert Kristoffersen

Offers the first detailed examination of corpus phonology and serves as a practical guide for researchers interested in compiling or using phonological corpora


New from Cambridge University Press!

ad

The Languages of the Jews: A Sociolinguistic History

By Bernard Spolsky

A vivid commentary on Jewish survival and Jewish speech communities that will be enjoyed by the general reader, and is essential reading for students and researchers interested in the study of Middle Eastern languages, Jewish studies, and sociolinguistics.


New from Brill!

ad

Indo-European Linguistics

New Open Access journal on Indo-European Linguistics is now available!


Academic Paper


Title: Interlingual annotation of parallel text corpora: a new framework for annotation and evaluation
Author: Bonnie J. Dorr
Email: click here to access email
Homepage: http://www.umiacs.umd.edu/~bonnie
Institution: University of Maryland
Author: Rebecca J. Passonneau
Institution: Columbia University
Author: David Farwell
Institution: New Mexico State University
Author: Rebecca Green
Email: click here to access email
Institution: Online Computer Library Center
Author: Nizar Habash
Institution: Columbia University
Author: Stephen Helmreich
Institution: New Mexico State University
Author: Eduard Hovy
Homepage: http://www.isi.edu/natural-language/people/hovy.html
Institution: University of Southern California
Author: Lori S Levin
Institution: Carnegie Mellon University
Author: Keith J Miller
Institution: MITRE Corporation
Author: Teruko Mitamura
Institution: Carnegie Mellon University
Author: Owen Rambow
Institution: Columbia University
Author: Advaith Siddharthan
Institution: University of Aberdeen
Linguistic Field: Applied Linguistics; Computational Linguistics; Text/Corpus Linguistics
Abstract: This paper focuses on an important step in the creation of a system of meaning representation and the development of semantically annotated parallel corpora, for use in applications such as machine translation, question answering, text summarization, and information retrieval. The work described below constitutes the first effort of any kind to annotate multiple translations of foreign-language texts with interlingual content. Three levels of representation are introduced: deep syntactic dependencies (IL0), intermediate semantic representations (IL1), and a normalized representation that unifies conversives, nonliteral language, and paraphrase (IL2). The resulting annotated, multilingually induced, parallel corpora will be useful as an empirical basis for a wide range of research, including the development and evaluation of interlingual NLP systems and paraphrase-extraction systems as well as a host of other research and development efforts in theoretical and applied linguistics, foreign language pedagogy, translation studies, and other related disciplines.

CUP at LINGUIST

This article appears in Natural Language Engineering Vol. 16, Issue 3, which you can read on Cambridge's site or on LINGUIST .



Back
Add a new paper
Return to Academic Papers main page
Return to Directory of Linguists main page