Publishing Partner: Cambridge University Press CUP Extra Publisher Login
amazon logo
More Info

New from Oxford University Press!


May I Quote You on That?

By Stephen Spector

A guide to English grammar and usage for the twenty-first century, pairing grammar rules with interesting and humorous quotations from American popular culture.

New from Cambridge University Press!


The Cambridge Handbook of Endangered Languages

Edited By Peter K. Austin and Julia Sallabank

This book "examines the reasons behind the dramatic loss of linguistic diversity, why it matters, and what can be done to document and support endangered languages."

Book Information

Sun Image

Title: Definition Extraction for Glossary Creation
Subtitle: A study on extracting definitions for semi-automatic glossary creation in Dutch
Written By: Eline Westerhout
Series Title: LOT Dissertation Series

The central topic of this thesis is the automatic extraction of definitions
from text. Definition extraction can play a role in various applications
including the semi-automatic development of glossaries in an eLearning
context, which constitutes the main focus of this dissertation. A glossary
provides definitions
for the most important terms that are discussed in a text. The
semi-automatic extraction approach presented in this study consists of two
phases. As a first step, a method entirely based on lexico-syntactic
patterns has been used to distinguish between definitions and
non-definitions. A corpus consisting of 600 definitions has been employed
to identify recurrent definition patterns. Since many of these patterns are
not unique to definitions, a second step was employed to reduce the number
of non-definitions identified. It has been investigated whether other
textual characteristics can contribute to the correct classification of
definitions, in addition to the lexico-syntactic patterns. The properties
that have been examined vary from the importance of the defined word
(phrase) within a text to the layout of the definition. Machine learning
techniques have been employed to identify which are the most relevant
(combinations of) definition properties. The results of this dissertation
are relevant for researchers in linguistics and lexicography as well as for
the development of language technology applications.

Publication Year: 2010
Publisher: Netherlands Graduate School of Linguistics / Landelijke (LOT)
Review: Not available for review. If you would like to review a book on The LINGUIST List, please login to view the AFR list.
BibTex: View BibTex record
Linguistic Field(s): Computational Linguistics
Subject Language(s): Dutch
Issue: All announcements sent out by The LINGUIST List are emailed to our subscribers and archived with the Library of Congress.
Click here to see the original emailed issue.

Format: Paperback
ISBN-13: 9789460930348
Prices: U.K. £ 24.86