Publishing Partner: Cambridge University Press CUP Extra Publisher Login
amazon logo
More Info

New from Oxford University Press!


It's Been Said Before

By Orin Hargraves

It's Been Said Before "examines why certain phrases become clichés and why they should be avoided -- or why they still have life left in them."

New from Cambridge University Press!


Sounds Fascinating

By J. C. Wells

How do you pronounce biopic, synod, and Breughel? - and why? Do our cake and archaic sound the same? Where does the stress go in stalagmite? What's odd about the word epergne? As a finale, the author writes a letter to his 16-year-old self.

Book Information


Title: A resource-light approach to morpho-syntactic tagging
Written By: Anna Feldman
Jirka Hana
Series Title: Language and Computers 70

While supervised corpus-based methods are highly accurate for different NLP
tasks, including morphological tagging, they are difficult to port to other
languages because they require resources that are expensive to create. As a
result, many languages have no realistic prospect for morpho-syntactic
annotation in the foreseeable future. The method presented in this book
aims to overcome this problem by significantly limiting the necessary data
and instead extrapolating the relevant information from another, related
language. The approach has been tested on Catalan, Portuguese, and Russian.
Although these languages are only relatively resource-poor, the same method
can be in principle applied to any inflected language, as long as there is
an annotated corpus of a related language available. Time needed for
adjusting the system to a new language constitutes a fraction of the time
needed for systems with extensive, manually created resources: days instead
of years.

This book touches upon a number of topics: typology, morphology, corpus
linguistics, contrastive linguistics, linguistic annotation, computational
linguistics and Natural Language Processing (NLP). Researchers and students
who are interested in these scientific areas as well as in cross-lingual
studies and applications will greatly benefit from this work. Scholars and
practitioners in computer science and linguistics are the prospective
readers of this book.

Publication Year: 2010
Publisher: Rodopi
Review: Read the review
BibTex: View BibTex record
Linguistic Field(s): Computational Linguistics
Text/Corpus Linguistics
Subject Language(s): English
Issue: All announcements sent out by The LINGUIST List are emailed to our subscribers and archived with the Library of Congress.
Click here to see the original emailed issue.

Format: Electronic
ISBN-13: 9789042027695
Pages: 199
Prices: Europe EURO 40
Format: Hardback
ISBN-13: 9789042027688
Pages: 199
Prices: Europe EURO 40