Hadoop Real World Solutions Cookbook (Paperback)

Jonathan R. Owens, Brian Femiano

  • 出版商: Packt Publishing
  • 出版日期: 2013-01-14
  • 定價: $1,650
  • 售價: 6.0$990
  • 語言: 英文
  • 頁數: 316
  • 裝訂: Paperback
  • ISBN: 1849519129
  • ISBN-13: 9781849519120
  • 相關分類: Hadoop
  • 相關翻譯: Hadoop實戰手冊 (簡中版)
  • 立即出貨(限量) (庫存=1)



Ever felt you could use some no-nonsense, practical help when developing applications with Hadoop? Well, you've just found it. This real-world solutions cookbook is packed with handy recipes you can apply to your own everyday issues.


  • Solutions to common problems when working in the Hadoop environment.
  • Recipes for (un)loading data, analytics, and troubleshooting.
  • In depth code examples demonstrating various analytic models, analytic solutions, and common best practices.

In Detail

Helping developers become more comfortable and proficient with solving problems in the Hadoop space. People will become more familiar with a wide variety of Hadoop related tools and best practices for implementation.

Hadoop Real World Solutions Cookbook will teach readers how to build solutions using tools such as Apache Hive, Pig, MapReduce, Mahout, Giraph, HDFS, Accumulo, Redis, and Ganglia.

Hadoop Real World Solutions Cookbook provides in depth explanations and code examples. Each chapter contains a set of recipes that pose, then solve, technical challenges, and can be completed in any order. A recipe breaks a single problem down into discrete steps that are easy to follow. The book covers (un)loading to and from HDFS, graph analytics with Giraph, batch data analysis using Hive, Pig, and MapReduce, machine learning approaches with Mahout, debugging and troubleshooting MapReduce, and columnar storage and retrieval of structured data using Apache Accumulo.

Hadoop Real World Solutions Cookbook will give readers the examples they need to apply Hadoop technology to their own problems.

What you will learn from this book

  • Data ETL, compression, serialization, and import/export.
  • Simple and advanced aggregate analysis.
  • Graph analysis.
  • Machine learning.
  • Troubleshooting and debugging.
  • Scalable persistence.
  • Cluster administration and configuration.


Cookbook recipes demonstrate Hadoop in action and then explain the concepts behind the code.

Who this book is written for

This book is ideal for developers who wish to have a better understanding of Hadoop application development and associated tools, and developers who understand Hadoop conceptually but want practical examples of real world applications.



- 解決在Hadoop環境中工作時常見問題的解決方案。
- (解)載入數據、分析和疑難排解的食譜。
- 深入的代碼示例,展示各種分析模型、分析解決方案和常見的最佳實踐。


《Hadoop實際解決方案食譜》將教讀者如何使用Apache Hive、Pig、MapReduce、Mahout、Giraph、HDFS、Accumulo、Redis和Ganglia等工具構建解決方案。

《Hadoop實際解決方案食譜》提供了深入的解釋和代碼示例。每個章節都包含一系列食譜,提出並解決技術挑戰,可以按任意順序完成。每個食譜將一個問題分解為易於跟隨的獨立步驟。本書涵蓋了與HDFS的(解)載入、使用Giraph進行圖形分析、使用Hive、Pig和MapReduce進行批量數據分析、使用Mahout進行機器學習方法、MapReduce的調試和疑難排解,以及使用Apache Accumulo進行結構化數據的列存儲和檢索。


- 數據ETL、壓縮、序列化和導入/導出。
- 簡單和高級的聚合分析。
- 圖形分析。
- 機器學習。
- 疑難排解和調試。
- 可擴展的持久性。
- 集群管理和配置。