Hadoop in Action (Paperback)

Chuck Lam

  • 出版商: Manning
  • 出版日期: 2010-12-15
  • 售價: $1,770
  • 貴賓價: 9.5$1,682
  • 語言: 英文
  • 頁數: 325
  • 裝訂: Paperback
  • ISBN: 1935182196
  • ISBN-13: 9781935182191
  • 相關分類: Hadoop
  • 無法訂購

買這商品的人也買了...

相關主題

商品描述

Hadoop in Action teaches readers how to use Hadoop and write MapReduce programs. The intended readers are programmers, architects, and project managers who have to process large amounts of data offline. Hadoop in Action will lead the reader from obtaining a copy of Hadoop to setting it up in a cluster and writing data analytic programs.

The book begins by making the basic idea of Hadoop and MapReduce easier to grasp by applying the default Hadoop installation to a few easy-to-follow tasks, such as analyzing changes in word frequency across a body of documents. The book continues through the basic concepts of MapReduce applications developed using Hadoop, including a close look at framework components, use of Hadoop for a variety of data analysis tasks, and numerous examples of Hadoop in action.

Hadoop in Action will explain how to use Hadoop and present design patterns and practices of programming MapReduce. MapReduce is a complex idea both conceptually and in its implementation, and Hadoop users are challenged to learn all the knobs and levers for running Hadoop. This book takes you beyond the mechanics of running Hadoop, teaching you to write meaningful programs in a MapReduce framework.

This book assumes the reader will have a basic familiarity with Java, as most code examples will be written in Java. Familiarity with basic statistical concepts (e.g. histogram, correlation) will help the reader appreciate the more advanced data processing examples.

商品描述(中文翻譯)

《Hadoop in Action》教讀者如何使用Hadoop並撰寫MapReduce程式。目標讀者為程式設計師、架構師和專案經理,他們需要處理大量離線資料。《Hadoop in Action》將引導讀者從獲取Hadoop的副本到在叢集中設置並撰寫資料分析程式。

本書首先通過將預設的Hadoop安裝應用於一些易於理解的任務,例如分析文件集中單詞頻率的變化,來使讀者更容易理解Hadoop和MapReduce的基本概念。本書繼續介紹使用Hadoop開發的MapReduce應用程序的基本概念,包括對框架組件的詳細介紹,使用Hadoop進行各種數據分析任務的方法,以及大量Hadoop實例。

《Hadoop in Action》將解釋如何使用Hadoop並介紹MapReduce編程的設計模式和實踐。MapReduce在概念上和實現上都是一個複雜的想法,Hadoop使用者需要學習所有運行Hadoop的旋鈕和操縱桿。本書將帶領讀者超越運行Hadoop的機制,教授在MapReduce框架中撰寫有意義的程式。

本書假設讀者對Java有基本的熟悉,因為大部分程式碼範例將使用Java編寫。對基本統計概念(例如直方圖、相關性)的熟悉將有助於讀者理解更高級的數據處理範例。