Publishing Partner: Cambridge University Press CUP Extra Publisher Login
amazon logo
More Info


New from Oxford University Press!

ad

Oxford Handbook of Corpus Phonology

Edited by Jacques Durand, Ulrike Gut, and Gjert Kristoffersen

Offers the first detailed examination of corpus phonology and serves as a practical guide for researchers interested in compiling or using phonological corpora


New from Cambridge University Press!

ad

The Languages of the Jews: A Sociolinguistic History

By Bernard Spolsky

A vivid commentary on Jewish survival and Jewish speech communities that will be enjoyed by the general reader, and is essential reading for students and researchers interested in the study of Middle Eastern languages, Jewish studies, and sociolinguistics.


New from Brill!

ad

Indo-European Linguistics

New Open Access journal on Indo-European Linguistics is now available!


Book Information

   
Sun Image

Title: Definition Extraction for Glossary Creation
Subtitle: A study on extracting definitions for semi-automatic glossary creation in Dutch
Written By: Eline Westerhout
Series Title: LOT Dissertation Series
Description:

The central topic of this thesis is the automatic extraction of definitions
from text. Definition extraction can play a role in various applications
including the semi-automatic development of glossaries in an eLearning
context, which constitutes the main focus of this dissertation. A glossary
provides definitions
for the most important terms that are discussed in a text. The
semi-automatic extraction approach presented in this study consists of two
phases. As a first step, a method entirely based on lexico-syntactic
patterns has been used to distinguish between definitions and
non-definitions. A corpus consisting of 600 definitions has been employed
to identify recurrent definition patterns. Since many of these patterns are
not unique to definitions, a second step was employed to reduce the number
of non-definitions identified. It has been investigated whether other
textual characteristics can contribute to the correct classification of
definitions, in addition to the lexico-syntactic patterns. The properties
that have been examined vary from the importance of the defined word
(phrase) within a text to the layout of the definition. Machine learning
techniques have been employed to identify which are the most relevant
(combinations of) definition properties. The results of this dissertation
are relevant for researchers in linguistics and lexicography as well as for
the development of language technology applications.

Publication Year: 2010
Publisher: Netherlands Graduate School of Linguistics / Landelijke (LOT)
Review: Not available for review. If you would like to review a book on The LINGUIST List, please login to view the AFR list.
BibTex: View BibTex record
Linguistic Field(s): Computational Linguistics
Lexicography
Subject Language(s): Dutch
Issue: All announcements sent out by The LINGUIST List are emailed to our subscribers and archived with the Library of Congress.
Click here to see the original emailed issue.

Versions:
Format: Paperback
ISBN-13: 9789460930348
Prices: U.K. £ 24.86