Pro Apache Hadoop, 2/e (Paperback)

Jason Venner, Sameer Wadkar, Madhu Siddalingaiah

  • 出版商: Apress
  • 出版日期: 2014-09-09
  • 定價: $1,485
  • 售價: 9.5$1,411
  • 貴賓價: 9.0$1,337
  • 語言: 英文
  • 頁數: 444
  • 裝訂: Paperback
  • ISBN: 1430248637
  • ISBN-13: 9781430248637
  • 相關分類: Hadoop
  • 相關翻譯: 深入理解Hadoop(原書第2版) (簡中版)
  • 立即出貨 (庫存=1)

買這商品的人也買了...

商品描述

Pro Apache Hadoop, Second Edition brings you up to speed on Hadoop – the framework of big data. Revised to cover Hadoop 2.0, the book covers the very latest developments such as YARN (aka MapReduce 2.0), new HDFS high-availability features, and increased scalability in the form of HDFS Federations. All the old content has been revised too, giving the latest on the ins and outs of MapReduce, cluster design, the Hadoop Distributed File System, and more.

This book covers everything you need to build your first Hadoop cluster and begin analyzing and deriving value from your business and scientific data. Learn to solve big-data problems the MapReduce way, by breaking a big problem into chunks and creating small-scale solutions that can be flung across thousands upon thousands of nodes to analyze large data volumes in a short amount of wall-clock time. Learn how to let Hadoop take care of distributing and parallelizing your software—you just focus on the code; Hadoop takes care of the rest.

  • Covers all that is new in Hadoop 2.0
  • Written by a professional involved in Hadoop since day one
  • Takes you quickly to the seasoned pro level on the hottest cloud-computing framework  

What you’ll learn

  • Build a resilient and scalable Hadoop compute cluster.
  • Analyze large volumes of data in amazingly short time.
  • Optimize Hadoop tasks like a seasoned professional.
  • Implement bulletproof patterns that are proven successful.
  • Scale out using the new HDFS Federations feature set.
  • Chunk large problems into highly-parallel, MapReduce modules

Who this book is for

This book is aimed at I.T. professionals investigating Hadoop and implementing it in their organizations.  Existing Hadoop users will deepen their toolkits and come up to speed on what’s new Hadoop 2.0. New Hadoop users will quickly move to the seasoned professional level in their use of the toolset.

Table of Contents

1. Motivation for Big Data

2. Hadoop Concepts

3. Getting Started with the Hadoop Framework

4. Hadoop Administration

5. Basics of MapReduce Development

6. Advanced MapReduce Development

7. Hadoop Input Output

8. Testing Hadoop Programs

9. Monitoring Hadoop

10. Data Warehousing using Hadoop

11. Data Processing using Pig

12. HCatalog and Hadoop in the Enterprise

13. Log Analysis using Hadoop

14. Building Real-Time Systems using HBase

15. Data Science With Hadoop

16. Hadoop in the Cloud

17. Building a YARN Application

18. Appendix A

19. Appendix B

20. Appendix C

商品描述(中文翻譯)

《Pro Apache Hadoop, 第二版》帶您深入了解Hadoop - 大數據框架。本書修訂以涵蓋Hadoop 2.0,包括最新的發展,如YARN(又稱MapReduce 2.0),新的HDFS高可用性功能以及HDFS聯邦的可擴展性增加。所有舊內容也經過修訂,提供有關MapReduce、叢集設計、Hadoop分散式文件系統等方面的最新資訊。本書涵蓋了構建第一個Hadoop叢集所需的一切,並開始分析和從業務和科學數據中獲得價值。學習以MapReduce方式解決大數據問題,將大問題分解為小塊並創建小規模解決方案,可以在數千個節點上分析大數據量,並在短時間內完成。學習如何讓Hadoop負責分發和並行化您的軟件 - 您只需專注於編碼,Hadoop會處理其餘的事情。

本書涵蓋了Hadoop 2.0中的所有新內容,作者是從一開始就參與Hadoop的專業人士,將快速帶您成為最熟練的雲計算框架使用者。

您將學到以下內容:
- 構建具有彈性和可擴展性的Hadoop計算叢集。
- 在極短的時間內分析大量數據。
- 像經驗豐富的專業人士一樣優化Hadoop任務。
- 實施經過驗證成功的堅固模式。
- 使用新的HDFS聯邦功能進行擴展。
- 將大問題分解為高度並行的MapReduce模塊。

本書適合正在研究Hadoop並在其組織中實施的IT專業人士。現有的Hadoop使用者將加深他們的工具包並了解Hadoop 2.0的最新資訊。新的Hadoop使用者將迅速提升到熟練的專業水平。

目錄:
1. 大數據的動機
2. Hadoop概念
3. 開始使用Hadoop框架
4. Hadoop管理
5. MapReduce開發基礎
6. 高級MapReduce開發
7. Hadoop輸入輸出
8. 測試Hadoop程序
9. 監控Hadoop
10. 使用Hadoop進行數據倉儲
11. 使用Pig進行數據處理
12. HCatalog和企業中的Hadoop
13. 使用Hadoop進行日誌分析
14. 使用HBase構建實時系統
15. 使用Hadoop進行數據科學
16. Hadoop在雲端中
17. 構建YARN應用程序
18. 附錄A
19. 附錄B
20. 附錄C