Publishing Partner: Cambridge University Press CUP Extra Wiley-Blackwell Publisher Login
amazon logo
More Info


New from Oxford University Press!

ad

Words in Time and Place: Exploring Language Through the Historical Thesaurus of the Oxford English Dictionary

By David Crystal

Offers a unique view of the English language and its development, and includes witty commentary and anecdotes along the way.


New from Cambridge University Press!

ad

Thesaurus of English Words and Phrases

By Peter Mark Roget

This book "supplies a vocabulary of English words and idiomatic phrases 'arranged … according to the ideas which they express'. The thesaurus, continually expanded and updated, has always remained in print, but this reissued first edition shows the impressive breadth of Roget's own knowledge and interests."


New from Brill!

ad

The Brill Dictionary of Ancient Greek

By Franco Montanari

Coming soon: The Brill Dictionary of Ancient Greek by Franco Montanari is the most comprehensive dictionary for Ancient Greek to English for the 21st Century. Order your copy now!


Book Information

   

Title: Data-Intensive Text Processing with MapReduce
Written By: Jimmy Lin‌
Chris Dyer‌
Edited By: Graeme Hirst
URL: http://www.morganclaypool.com/doi/abs/10.2200/S00274ED1V01Y201006HLT007
Series Title: Synthesis Lectures on Human Language Technologies
Description:

Our world is being revolutionized by data-driven methods: access to large
amounts of data has generated new insights and opened exciting new
opportunities in commerce, science, and computing applications. Processing
the enormous quantities of data necessary for these advances requires large
clusters, making distributed computing paradigms more crucial than ever.
MapReduce is a programming model for expressing distributed computations
on massive datasets and an execution framework for large-scale data
processing on clusters of commodity servers. The programming model
provides an easy-to-understand abstraction for designing scalable algorithms,
while the execution framework transparently handles many system-level
details, ranging from scheduling to synchronization to fault tolerance. This
book focuses on MapReduce algorithm design, with an emphasis on text
processing algorithms common in natural language processing, information
retrieval, and machine learning. We introduce the notion of MapReduce
design patterns, which represent general reusable solutions to commonly
occurring problems across a variety of problem domains. This book not only
intends to help the reader "think in MapReduce", but also discusses
limitations of the programming model as well.

Table of Contents: Introduction / MapReduce Basics / MapReduce Algorithm
Design / Inverted Indexing for Text Retrieval / Graph Algorithms / EM
Algorithms for Text Processing / Closing Remarks

Publication Year: 2010
Publisher: Morgan & Claypool Publishers
Review: Not available for review. If you would like to review a book on The LINGUIST List, please login to view the AFR list.
BibTex: View BibTex record
Linguistic Field(s): Computational Linguistics
Issue: All announcements sent out by The LINGUIST List are emailed to our subscribers and archived with the Library of Congress.
Click here to see the original emailed issue.

Versions:
Format: Electronic
ISBN-13: 9781608453436
Pages: 177
Prices: U.S. $ 30.00
 
Format: Paperback
ISBN-13: 9781608453429
Pages: 177
Prices: U.S. $ 40.00