Publishing Partner: Cambridge University Press CUP Extra Publisher Login
amazon logo
More Info

New from Oxford University Press!


It's Been Said Before

By Orin Hargraves

It's Been Said Before "examines why certain phrases become clichés and why they should be avoided -- or why they still have life left in them."

New from Cambridge University Press!


Sounds Fascinating

By J. C. Wells

How do you pronounce biopic, synod, and Breughel? - and why? Do our cake and archaic sound the same? Where does the stress go in stalagmite? What's odd about the word epergne? As a finale, the author writes a letter to his 16-year-old self.

Book Information

Sun Image

Title: Creating and Digitizing Language Corpora Volume 1
Subtitle: Synchronic Databases
Edited By: Joan C. Beal
Karen P Corrigan
Hermann L. Moisl

A range of electronic corpora has become increasingly accessible via the
WWW and CD-ROM. This development has coincided with improvements in the
standards governing the collecting, encoding and archiving of such data.
Less attention, however, has been paid to making other types of digital
data available - especially that which one might describe as
'unconventional', namely, dialects, child language and bilingual databases.
Advances in technology have enabled the collection and organisation of such
data sets into a growing number of user-friendly electronic corpora. The
latter have the potential to offer new insights into linguistic universals,
for instance, since they allow, for the first time, rapid and systematic
comparisons between first and second language/dialects across both social
and geographical space. This book provides state-of-the-art methods and
guidelines for creating and digitising these resources taking full
advantage of the dramatic recent improvements in computing and analytical

Publication Year: 2007
Publisher: Palgrave Macmillan
Review: Not available for review. If you would like to review a book on The LINGUIST List, please login to view the AFR list.
BibTex: View BibTex record
Linguistic Field(s): Computational Linguistics
Text/Corpus Linguistics
Discipline of Linguistics
Issue: All announcements sent out by The LINGUIST List are emailed to our subscribers and archived with the Library of Congress.
Click here to see the original emailed issue.

Format: Hardback
ISBN: 1403943664
ISBN-13: 9781403943668
Pages: 272
Prices: U.K. £ 50.00