Publishing Partner: Cambridge University Press CUP Extra Publisher Login
amazon logo
More Info


New from Oxford University Press!

ad

A History of the Irish Language: From the Norman Invasion to Independence

By Aidan Doyle

This book "sets the history of the Irish language in its political and cultural context" and "makes available for the first time material that has previously been inaccessible to non-Irish speakers."


New from Cambridge University Press!

ad

The Cambridge Handbook of Pragmatics

Edited By Keith Allan and Kasia M. Jaszczolt

This book "fills the unquestionable need for a comprehensive and up-to-date handbook on the fast-developing field of pragmatics" and "includes contributions from many of the principal figures in a wide variety of fields of pragmatic research as well as some up-and-coming pragmatists."


Book Information

   
Sun Image

Title: Definition Extraction for Glossary Creation
Subtitle: A study on extracting definitions for semi-automatic glossary creation in Dutch
Written By: Eline Westerhout
Series Title: LOT Dissertation Series
Description:

The central topic of this thesis is the automatic extraction of definitions
from text. Definition extraction can play a role in various applications
including the semi-automatic development of glossaries in an eLearning
context, which constitutes the main focus of this dissertation. A glossary
provides definitions
for the most important terms that are discussed in a text. The
semi-automatic extraction approach presented in this study consists of two
phases. As a first step, a method entirely based on lexico-syntactic
patterns has been used to distinguish between definitions and
non-definitions. A corpus consisting of 600 definitions has been employed
to identify recurrent definition patterns. Since many of these patterns are
not unique to definitions, a second step was employed to reduce the number
of non-definitions identified. It has been investigated whether other
textual characteristics can contribute to the correct classification of
definitions, in addition to the lexico-syntactic patterns. The properties
that have been examined vary from the importance of the defined word
(phrase) within a text to the layout of the definition. Machine learning
techniques have been employed to identify which are the most relevant
(combinations of) definition properties. The results of this dissertation
are relevant for researchers in linguistics and lexicography as well as for
the development of language technology applications.

Publication Year: 2010
Publisher: Netherlands Graduate School of Linguistics / Landelijke (LOT)
Review: Not available for review. If you would like to review a book on The LINGUIST List, please login to view the AFR list.
BibTex: View BibTex record
Linguistic Field(s): Computational Linguistics
Lexicography
Subject Language(s): Dutch
Issue: All announcements sent out by The LINGUIST List are emailed to our subscribers and archived with the Library of Congress.
Click here to see the original emailed issue.

Versions:
Format: Paperback
ISBN-13: 9789460930348
Prices: U.K. £ 24.86