Publishing Partner: Cambridge University Press CUP Extra Publisher Login
amazon logo
More Info

New from Oxford University Press!


It's Been Said Before

By Orin Hargraves

It's Been Said Before "examines why certain phrases become clichés and why they should be avoided -- or why they still have life left in them."

New from Cambridge University Press!


Sounds Fascinating

By J. C. Wells

How do you pronounce biopic, synod, and Breughel? - and why? Do our cake and archaic sound the same? Where does the stress go in stalagmite? What's odd about the word epergne? As a finale, the author writes a letter to his 16-year-old self.

Book Information

Sun Image

Title: Creating and Digitizing Language Corpora Volume 2
Subtitle: Diachronic Databases
Edited By: Joan C. Beal
Karen P Corrigan
Hermann L. Moisl

A range of electronic corpora has become increasingly accessible via the
WWW and CD-ROM. This development coincided with improvements in the
standards governing the collecting, encoding and archiving of such data.
Less attention, however, has been paid to making other types of digital
data available - especially that which one might describe as
'unconventional', namely, the fragmentary texts and voices left to us as
accidents of history. Advances in technology have enabled the collection
and organisation of such data sets into a growing number of user-friendly
electronic corpora. The latter have the potential to offer new insights
into linguistic universals, for instance, since they allow, for the first
time, rapid and systematic comparisons across genres as well as social,
temporal and geographical space. This book provides state-of-the-art
methods and guidelines for creating and digitising these resources taking
full advantage of the dramatic recent improvements in computing and
analytical tools.

Publication Year: 2007
Publisher: Palgrave Macmillan
Review: Not available for review. If you would like to review a book on The LINGUIST List, please login to view the AFR list.
BibTex: View BibTex record
Linguistic Field(s): Computational Linguistics
Text/Corpus Linguistics
Discipline of Linguistics
Issue: All announcements sent out by The LINGUIST List are emailed to our subscribers and archived with the Library of Congress.
Click here to see the original emailed issue.

Format: Hardback
ISBN: 1403943672
ISBN-13: 9781403943675
Pages: 50.00