Publishing Partner: Cambridge University Press CUP Extra Publisher Login
amazon logo
More Info


New from Oxford University Press!

ad

The Language Hoax

By John H. McWhorter

The Language Hoax "argues that that all humans process life the same way, regardless of their language."


New from Cambridge University Press!

ad

Language and Development in Africa

By H. Ekkehard Wolff

Language and Development in Africa "discusses the resourcefulness of languages, both local and global, in view of the ongoing transformation of African societies as much as for economic development.. "


The LINGUIST List is dedicated to providing information on language and language analysis, and to providing the discipline of linguistics with the infrastructure necessary to function in the digital world. LINGUIST is a free resource, run by linguistics students and faculty, and supported primarily by your donations. Please support LINGUIST List during the 2016 Fund Drive.

Academic Paper


Title: Bootstrapping parsers via syntactic projection across parallel texts
Author: Rebecca Hwa
Institution: University of Pittsburgh
Author: Philip Resnik
Email: click here TO access email
Homepage: http://umiacs.umd.edu/~resnik
Institution: University of Maryland
Author: Amy Weinberg
Email: click here TO access email
Homepage: http://www.umiacs.umd.edu/%7Eweinberg/
Institution: University of Maryland
Author: Clara Cabezas
Institution: University of Maryland
Author: Okan Kolak
Institution: University of Maryland
Linguistic Field: Computational Linguistics
Abstract: Broad coverage, high quality parsers are available for only a handful of languages. A prerequisite for developing broad coverage parsers for more languages is the annotation of text with the desired linguistic representations (also known as "treebanking"). However, syntactic annotation is a labor intensive and time-consuming process, and it is difficult to find linguistically annotated text in sufficient quantities. In this article, we explore using parallel text to help solving the problem of creating syntactic annotation in more languages. The central idea is to annotate the English side of a parallel corpus, project the analysis to the second language, and then train a stochastic analyzer on the resulting noisy annotations. We discuss our background assumptions, describe an initial study on the "projectability" of syntactic relations, and then present two experiments in which stochastic parsers are developed with minimal human intervention via projection from English.

CUP AT LINGUIST

This article appears IN Natural Language Engineering Vol. 11, Issue 3, which you can READ on Cambridge's site or on LINGUIST .



Add a new paper
Return to Academic Papers main page
Return to Directory of Linguists main page